ICLR'24 Self-supervised Representation Learning from Random Data Projectors

The codebase for Self-supervised Representation Learning from Random Data Projectors.

We proposed a novel self-supervised learning framework by learning from random data projectors (LFR) without any data augmentations. The proposed LFR:

Do not require domain-specific knowledge or specific model architecture.
Can be applied on any data modality and application domains.
Outperforms multiple state-of-the-art SSL baselines on a wide range of data modalities(image, sequential, and tabular) and real-world applications(banking, healthcare and natural sciences).

Installation & Usage

Installation

First clone this repository, then navigate to the directory and pip install to install all required packages.

git clone git@github.com:layer6ai-labs/lfr
cd lfr
conda env create -f environment.yml
conda activate lfr
pip install h5py

Usage

python main.py --method lfr --dataset kvasir -a resnet18\
        --num_targets 6  --target_layers 2\
        --init-beta --random-dropout\
        --target_sample_ratio 1 --num_of_classes 8\
        --epochs 400 -b 256 --lr 0.0001\
        --optimizer-type sgd --momentum 0.9 --wd 5e-4\
        --fix_pred_lr --train-predictor-individually\
        --pred_epochs 5 --pred_layers 2\
        --eval_epochs 100 --eval_lr 0.001 --eval_bs 256\
        --dim 2048 --pred_dim 256 --loss barlow-batch\
        --num-of-runs 1

See scripts/ for further details on commandline parameters.

Data Access

datasets

MIMIC-III is private dataset. Data pre-processing can be found in this notebook.

All datasets used in our experiments (raw and pre-processed) can be found at Google drive. Redistribution of these datasets is permitted under their licenses.

To run the code: create data/folder, download and unzip any .zip files into the folder. The original data-preprocessing code for each dataset can be found inside notebooks folder.

Citing

If you use any part of this repository in your research, please cite the associated paper with the following bibtex entry:

Authors: Yi Sui, Tongzi Wu, Jesse C. Cresswell, Ga Wu, George Stein, Xiao Shi Huang, Xiaochen Zhang, Maksims Volkovs

@inproceedings{sui2024selfsupervised,
      title={Self-supervised Representation Learning from Random Data Projectors}, 
      author={Yi Sui and Tongzi Wu and Jesse C. Cresswell and Ga Wu and George Stein and Xiao Shi Huang and Xiaochen Zhang and Maksims Volkovs},
      booktitle={International Conference on Learning Representations},
      year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
logos		logos
notebooks		notebooks
scripts		scripts
ssl_models		ssl_models
utils		utils
README.md		README.md
environment.yml		environment.yml
linear_eval.py		linear_eval.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ICLR'24 Self-supervised Representation Learning from Random Data Projectors

Installation & Usage

Installation

Usage

Data Access

datasets

Citing

About

Releases

Packages

Contributors 2

Languages

layer6ai-labs/lfr

Folders and files

Latest commit

History

Repository files navigation

ICLR'24 Self-supervised Representation Learning from Random Data Projectors

Installation & Usage

Installation

Usage

Data Access

datasets

Citing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages