EKFAC and K-FAC Preconditioners for Pytorch

This repo contains a Pytorch implementation of the EKFAC and K-FAC preconditioners. If you find this software useful, please check the references below and cite accordingly!

Presentation

We implemented K-FAC and EKFAC as preconditioners. Preconditioners are similar Pytorch's optimizer class, with the exception that they do not perform the update of the parameters, but only change the gradient of those parameters. They can thus be used in combination with your favorite optimizer (we used SGD in our experiments). Note that we only implemented them for Linear and Conv2d modules, so they will silently skip all the other modules of your network.

Usage

Here is a simple example showing how to add K-FAC or EKFAC to your code:

# 1. Instantiate the preconditioner
preconditioner = EKFAC(network, 0.1, update_freq=100)

# 2. During the training loop, simply call preconditioner.step() before optimizer.step().
#    The optimiser is usually SGD.
for i, (inputs, targets) in enumerate(train_loader):
    optimizer.zero_grad()
    outputs = network(inputs)
    loss = criterion(outputs, targets)
    loss.backward()
    preconditioner.step()  # Add a step of preconditioner before the optimizer step.
    optimizer.step()

References

EKFAC:

Thomas George, César Laurent, Xavier Bouthillier, Nicolas Ballas, Pascal Vincent, Fast Approximate Natural Gradient Descent in a Kronecker-factored Eigenbasis, NIPS 2018

K-FAC:

James Martens, Roger Grosse, Optimizing Neural Networks with Kronecker-factored Approximate Curvature, ICML 2015

K-FAC for Convolutions:

Roger Grosse, James Martens, A Kronecker-factored Approximate Fisher Matrix for Convolution Layers, ICML 2016
César Laurent, Thomas George, Xavier Bouthillier, Nicolas Ballas, Pascal Vincent, An Evaluation of Fisher Approximations Beyond Kronecker Factorization, ICLR Workshop 2018

Norm Constraint:

Jimmy Ba, Roger Grosse, James Martens, Distributed Second-order Optimization using Kronecker-Factored Approximations, ICLR 2017
Jean Lafond, Nicolas Vasilache, Léon Bottou, Diagonal Rescaling For Neural Networks, arXiv 2017

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
ekfac.py		ekfac.py
kfac.py		kfac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EKFAC and K-FAC Preconditioners for Pytorch

Presentation

Usage

References

EKFAC:

K-FAC:

K-FAC for Convolutions:

Norm Constraint:

About

Releases

Packages

Contributors 5

Languages

License

Thrandis/EKFAC-pytorch

Folders and files

Latest commit

History

Repository files navigation

EKFAC and K-FAC Preconditioners for Pytorch

Presentation

Usage

References

EKFAC:

K-FAC:

K-FAC for Convolutions:

Norm Constraint:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages