Search CORE

1,721,008 research outputs found

Tracking experts that learn by evolving past posteriors

Author: Koolen-Wijkstra Wouter
Erven Tim
Publication venue
Publication date: 01/02/2009
Field of study

Discovering the truth by conducting experiments

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 01/12/2006
Field of study

CWI's Institutional Repository

Combining strategies efficiently : radio interview BNR Denktank. 25.01.2011

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 01/01/2011
Field of study

CWI's Institutional Repository

Robust and adaptive methods for sequential decision making

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 01/10/2016
Field of study

CWI's Institutional Repository

MetaGrad

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 01/01/2016
Field of study

CWI's Institutional Repository

Purex_games

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 27/10/2019
Field of study

CWI's Institutional Repository

SIM-PL

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 01/01/2009
Field of study

CWI's Institutional Repository

Metagrad: Adaptation using multiple learning rates in online learning

Author: Koolen-Wijkstra Wouter
Hoeven Dirk
Erven Tim
Publication venue
Publication date: 01/07/2021
Field of study

We provide a new adaptive method for online convex optimization, MetaGrad, that is ro- bust to general convex losses but achieves faster rates for a broad class of special functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. We prove this by drawing a connec- tion to the Bernstein condition, which is known to imply fast rates in offline statistical learning. MetaGrad further adapts automatically to the size of the gradients. Its main fea- ture is that it simultaneously considers multiple learning rates, which are weighted directly proportional to their empirical performance on the data using a new meta-algorithm. We provide three versions of MetaGrad. The full matrix version maintains a full covariance matrix and is applicable to learning tasks for which we can afford update time quadratic in the dimension. The other two versions provide speed-ups for high-dimensional learning tasks with an update time that is linear in the dimension: one is based on sketching, the other on running a separate copy of the basic algorithm per coordinate. We evaluate all versions of MetaGrad on benchmark online classification and regression tasks, on which they consistently outperform both online gradient descent and AdaGrad

Squint

Author: Koolen-Wijkstra Wouter
Koolen-Wijkstra W.M. (Wouter)
Publication venue
Publication date: 01/12/2015
Field of study

CWI's Institutional Repository

Author Instructions

Author: Instructions Author
Publication venue
Publication date: 04/11/2013
Field of study

Crossref

Cartographic Perspectives (E-Journal - North American Cartographic Information Society, NACIS)