adam-optimizer

This is an implementation in C language for a better understanding of the mathematics behind the Adam Optimizer, based on the article by Cristian Leo.

Initialize:

Initialize the first moment vector: $m_0 = 0$
Initialize the second moment vector: $v_0 = 0$
Initialize the timestep: $t = 0$

Update Rule:

Update the timestep: $$t = t + 1$$
Compute the gradient $g_t$: $$g_t = \nabla_\theta f_t(\theta_{t-1})$$
Update biased first moment estimate: $$m_t = \beta_1 \cdot m_{t-1} + (1 - \beta_1) \cdot g_t$$
Update biased second raw moment estimate: $$v_t = \beta_2 \cdot v_{t-1} + (1 - \beta_2) \cdot g_t^2$$
Compute bias-corrected first moment estimate: $$\hat{m}_t = \frac{m_t}{1 - \beta_1^t}$$
Compute bias-corrected second raw moment estimate: $$\hat{v}_t = \frac{v_t}{1 - \beta_2^t}$$
Update the parameters: $$\theta_{t+1} = \theta_t - \frac{\alpha \cdot \hat{m}_t}{\sqrt{\hat{v}_t} + \epsilon}$$

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
README.md		README.md
adam.c		adam.c
adam.h		adam.h
linear_regression.c		linear_regression.c
linear_regression.h		linear_regression.h
main.c		main.c
train.c		train.c
train.h		train.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

adam-optimizer

Initialize:

Update Rule:

About

Releases

Packages

Languages

hasanisaeed/adam-optimizer

Folders and files

Latest commit

History

Repository files navigation

adam-optimizer

Initialize:

Update Rule:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages