Check our latest topic modeling toolkit TopMost !
Unzip and move the datasets to ./data
python==3.6
tensorflow-gpu==1.13.1
scipy==1.5.2
scikit-learn==0.23.2
python run.py --data_dir data/{dataset} --output_dir output/{dataset}
-
Topic coherence: topic_interpretability
-
Topic diversity:
python utils/TU.py --data_path {path of topic word file}
-
Classification: use the SVM of scikit-learn.
If you want to use our code, please cite as
@inproceedings{Wu2020,
author = {Wu, Xiaobao and Li, Chunping and Zhu, Yan and Miao, Yishu},
booktitle = {International Conference on Natural Language Processing and Chinese Computing},
title = {{Learning Multilingual Topics with Neural Variational Inference}},
year = {2020}
}