BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

This software project accompanies the research paper, BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer.

We propose BatchQuant, a novel quantizer to stabilize single-shot supernet training for joint mixed-precision quantization and architecture search. Our approach discovers quantized architectures with SOTA efficiency within fewer GPU hours than previous methods.

How to use / evaluate QFA Networks

Use

from qfa.elastic_nn.networks import QFAMobileNetV3


model = QFAMobileNetV3(n_classes=1000, bn_param=(0.1, 1e-5), dropout_rate=0.,
                       width_mult_list=[1.2], ks_list=[3, 5, 7], expand_ratio_list=[3, 4, 6],
                       depth_list=[2, 3, 4], bits_list=[2, 3, 4])
set_activation_statistics(model)
model.set_max_net()
model_path = 'b234_ps.pth'
init = torch.load(model_path, map_location='cpu')['state_dict']
model.load_state_dict(init)

# Randomly sample sub-networks from OFA network
model.sample_active_subnet()

# Manually set the sub-network
model.set_active_subnet(ks=7, e=6, d=4, b=2)

Evaluate

python qfa_eval.py --id [0-180] 'Larger model id associates with model with larger FLOPs and higher Top 1 accuracy'

How to train QFA Networks

horovodrun -np 32 -H <server1_ip>:8,<server2_ip>:8,<server3_ip>:8,<server4_ip>:8 python main.py

How to collect training data for Accuracy Predictor

bash collect_master.sh qfa

How to run NSGA-II search procedure

python nsgaii_search.py

Search space for NSGA-II crossover probability and mutation probability

crossover_probability:
  type: CATEGORICAL
  range: [ 0.001, 0.002, 0.003, 0.004, 0.005, 0.006, 0.007, 0.008, 0.009, 0.01, 0.02, 0.03, 0.04, 0.05, 0.06, 0.07, 0.08, 0.09, 0.1, 0.2, 0.3, 0.4, 0.5 ]
mutation_probability:
  type: CATEGORICAL
  range: [ 0.001, 0.002, 0.003, 0.004, 0.005, 0.006, 0.007, 0.008, 0.009, 0.01, 0.02, 0.03, 0.04, 0.05, 0.06, 0.07, 0.08, 0.09, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 ]

Requirement

Python 3.6+
Pytorch 1.4.0+
ImageNet Dataset (Set path in qfa/imagenet_codebase/data_providers/imagenet.py#L213)
Horovod
Scikit-Learn
Skorch

Citation

@inproceedings{NEURIPS2021_08aee627,
    author = {Bai, Haoping and Cao, Meng and Huang, Ping and Shan, Jiulong},
    booktitle = {Advances in Neural Information Processing Systems},
    editor = {M. Ranzato and A. Beygelzimer and Y. Dauphin and P.S. Liang and J. Wortman Vaughan},
    pages = {1074--1085},
    publisher = {Curran Associates, Inc.},
    title = {BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer},
    url = {https://proceedings.neurips.cc/paper/2021/file/08aee6276db142f4b8ac98fb8ee0ed1b-Paper.pdf},
    volume = {34},
    year = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
exp		exp
log		log
qfa		qfa
runs/resnet34		runs/resnet34
.gitignore		.gitignore
ACKNOWLEDGEMENTS.md		ACKNOWLEDGEMENTS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
bits_settings.txt		bits_settings.txt
collect.sh		collect.sh
collect_master.sh		collect_master.sh
collect_network_info.py		collect_network_info.py
eval_cifar100.py		eval_cifar100.py
info		info
lut_flops_1.20.npy		lut_flops_1.20.npy
main.py		main.py
nsgaii_search.py		nsgaii_search.py
ofa_mbv3_d234_e346_k357_w1.2		ofa_mbv3_d234_e346_k357_w1.2
pareto_archs.npy		pareto_archs.npy
qfa_eval.py		qfa_eval.py
qsamples_8k.json		qsamples_8k.json
result_int2_baseline.txt		result_int2_baseline.txt
result_int8_8bit.txt		result_int8_8bit.txt
run.sh		run.sh
setup.py		setup.py
std@256.npy		std@256.npy
test_qresnet.py		test_qresnet.py
train_cifar100.py		train_cifar100.py
train_fixbit.py		train_fixbit.py
train_qresnet.py		train_qresnet.py
train_transform.py		train_transform.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

How to use / evaluate QFA Networks

Use

Evaluate

How to train QFA Networks

How to collect training data for Accuracy Predictor

How to run NSGA-II search procedure

Search space for NSGA-II crossover probability and mutation probability

Requirement

Citation

About

Releases

Packages

Languages

License

shuzhang-pku/weight-sharing-quant

Folders and files

Latest commit

History

Repository files navigation

BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer

How to use / evaluate QFA Networks

Use

Evaluate

How to train QFA Networks

How to collect training data for Accuracy Predictor

How to run NSGA-II search procedure

Search space for NSGA-II crossover probability and mutation probability

Requirement

Citation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages