Elastic Weight Consolidation (EWC) Implementation - Tensorflow 2

During or after learning new skills, humans are able to retain most of the skills they had learned prior. This is a difficult thing for computers to do, ANN forget tasks they had previously learned during and after the learning of new ones - this phenomenon is termed "catastrophic-forgetting". In order to mitigate this effect, Kirkpatrick et al in their popular work https://arxiv.org/abs/1612.00796 introduced the Elastic Weight Consolidation (EWC) technique. It measures the level of importance of each model parameter to the prior task in the form of Fisher Information Matrix. This then forms the bases of an imposed loss penalty on the training of the new task.

The available repositories have implemented this technique in tf1, keras and pytorch. Here, I implemented it using tensorflow 2 (tf2) with custom training flow using the GradientTape and enger execution.

Below are plots from the appended demo notebook illustrating how to use this code - the example used the MNIST data set and its permuted version.

Please reach out if you do find any issues in the implementation logic, code or have an advice to make things better.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
images		images
.gitignore		.gitignore
MNIST_A.h5		MNIST_A.h5
README.md		README.md
demo.ipynb		demo.ipynb
module.py		module.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Elastic Weight Consolidation (EWC) Implementation - Tensorflow 2

About

Releases

Packages

Languages

stijani/elastic-weight-consolidation-tf2

Folders and files

Latest commit

History

Repository files navigation

Elastic Weight Consolidation (EWC) Implementation - Tensorflow 2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages