Exponentiated Gradient versus Gradient Descent for Linear Predictors

We consider two algorithms for on-line prediction based on a linear model. The algorithms are the well-known gradient descent (GD) algorithm and a new algorithm, which we call EG ±. They both maintain a weight vector using simple updates. For the GD algorithm, the update is based on subtracting the...

Full description

Saved in:
Bibliographic Details
Published in:Information and computation Vol. 132; no. 1; pp. 1 - 63
Main Authors: Kivinen, Jyrki, Warmuth, Manfred K.
Format: Journal Article
Language:English
Published: San Diego, CA Elsevier Inc 10.01.1997
Elsevier
Subjects:
ISSN:0890-5401, 1090-2651
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first