Internal Knowledge Distillation

Residual networks can be viewed as the ensembles of a lot of shallower sub-networks. The idea is to train the residual networks in such a way that the knowledge in the ensemble is distilled into the sub-networks in a single procedure. The advantages of doing the same are

Increment in the accuracy of the original ResNet
Possible training of residual networks of multiple depths in a single and efficient procedure
A better approach for knowledge distillation when compared to the traditional distillation methods.

Name		Name	Last commit message	Last commit date
Latest commit History 248 Commits
common		common
configs		configs
dataloaders		dataloaders
datasets		datasets
functions		functions
modules		modules
policy_modules		policy_modules
resnets		resnets
.gitignore		.gitignore
README.md		README.md
inference.py		inference.py
launch.py		launch.py
train_end2end.py		train_end2end.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Internal Knowledge Distillation

About

Releases

Packages

Languages

itsShnik/internal-knowledge-distillation

Folders and files

Latest commit

History

Repository files navigation

Internal Knowledge Distillation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages