COVID-19-Deep-Learning

tensorflow projects for diagnosis and prognostic estimation of COVID-19

This is a deep-learning framework for identificating high-risk COVID-19 patients and estimating how long the patient can be cured。

requirments

Anaconda python 3.7.3 Win10
Tensorflow 2.0.0 with GPU

network architecture (see achitecture.pptx)

pretrained model and 50 data subjects for evaluation

password for download:8vst

#after download, unzip the checkpoint.zip, then put all directories of weight files as well as the files for normalization (feature_minv.npy and the feature_maxv.npy) to tf_covid19_care/checkpoints

#if you have any problem, please feel free to ask questions via sending email to wjcy19870122@sjtu.edu.cn

Training

(1) prepare your data (see the 50 data subjects for examples).
(2) cd trainers and run the file: run_train.bat.
Note: you may need to modify the configs/cfgs.py file:changing cfg.data_set to the directory of your dataset.

Evaluation

(1) cd tests and run the run_test.bat file.
(2) run the compute_metrics.py file to obtain the results.

Results

#The average error between predicted recovery day and true recovery day. The results demonstrate that treatment schemes have significant impact on the predictions.

#The top 10 features with a significant impact on model prediction. AM: Albumin, HG: Hemoglobin, TP: Total Protein, α-HBDH: Alpha-hydroxybutyrate Dehydrogenase, CRP: C-reactive Protein, EPC: Expectoration, SK: Shock, PA: Poor Appetite, PS: Poor Spirits, CGH: Cough, WK: Weakness, CCBD: Chest Congestion/Breathing Difficulty, ARDS: Acute Respiratory Distress Syndrome, LDH: Lactate Dehydrogenase, DB: Diabetes. These features are significant on both prediction tasks (a and b: severity-level grading; c, d, e and f: recovery-time regression). However, the impact ranking is different among these features. The p-values calculated from multi-variable linear classification/regression analyses demonstrate that some features especially the symptoms (see the red p-values) are non-significant for linear analysis methods.

#Visualization of the predicted probability distribution for four patients. The days need for a patient to be cured can be estimated by the day with the maximum probability (see the vertical dashed lines). Besides, the cumulative incidence function (e.g., P(t≤7)=17.1%) can also be calculated to assess the risk of patients. The top-3 features are shown to explain the decision made by the model. The dead patient #4 can be easily identified by observing the shape of the curve.

TODO

address imbalance cured-days distribution problem
evluation on multi-center data

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
tf_covid19_care		tf_covid19_care
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID-19-Deep-Learning

requirments

network architecture (see achitecture.pptx)

password for download:8vst

Training

Evaluation

Results

TODO

About

Releases

Packages

Languages

License

wangjuncongyu/COVID-19-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

COVID-19-Deep-Learning

requirments

network architecture (see achitecture.pptx)

password for download:8vst

Training

Evaluation

Results

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages