📈 Data Challenge Project

🔖 LSHTM 2491 Data Challenge Module
🎓 MSc in Health Data Science 22/23

Client: Sanofi
Supervisor: Dr Naomi Waterlow, Research Fellow, LSHTM

Members:

Jessica Caterson jjcato9
Dzan Ahmed Jesenkovic dzanahmed
Ignacio Leiva Escobar IgnaLeiva
Naomi Medina-Jaudes naomedina

❓ How do hospitalisations as a result of the tri-demic of RSV, influenza and SARS-CoV-2 in the 2022 post-pandemic season compare to the 2016 - 2019 pre-pandemic circulation of RSV and influenza?

❔ Include analyses of different geographical locations (e.g. Northern/Southern hemisphere) and by age group.
❔ Would the assessment be different if burden is disaggregated by hospitalisations vs. laboratory confirmed cases?

📃 Summary

The project idea was conceputalized by Sanofi, to compare hospitalizations during the current post-pandemic season with pre-pandemic seasons between 2016 and 2019.
Six countries were selected based on data availability and source reputability, with priority given to countries specified by Sanofi. Hospitalization data (rates or absolutes) were collected and standardised, and seasonality of hospitalizations were compared using time series analysis methods.
The results showed that hospitalizations due to RSV, influenza, and SARS-CoV-2 were higher than in pre-pandemic seasons, with similar rates of influenza and RSV hospitalizations and the addition of SARS-CoV-2 hospitalizations. The shift in hospitalization peaks led to 2-5 times higher hospitalizations, with adults over 65 and children under 5 having the highest hospitalization rates.
Report recommends reducing hospitalizations in future winter seasons through continued vaccination programs and public engagement in spread-reducing measures.

⚙️ Requirements

R 4.2.2
RStudio
R Packages:
- tidyverse
- cowplot
- patchwork
- lubridate
- ggsankey

🖥️ Running the scripts

You can fork or download the repository as a .zip file and extract it on your computer. It is recommended to load the LSHTM-DC-Sanofi.Rproj into your R environment to ensure reproducibility. Otherwise, scripts can be run by setting the working directory into folder where repository files are located using setwd() command in R console.

All the dependancies can be installed by running the following code:

install.packages("tidyverse")
install.packages("cowplot")
install.packages("patchwork")
install.packages("lubridate")
install.packages("devtools")
devtools::install_github("davidsjoberg/ggsankey")

After these, you can run the scripts in scripts/cleaning for each country.

Check README.MD in scripts folder to understand the workflow.
In case new raw data is added to data/raw_data, running scripts for each country from scripts/cleaning will result in updated processed and premerged datasets.
Finally, merge.R from scripts/cleaning can be run to coalesce the processed data to data/merged_data/merged_data.csv.

📊 Script outputs

After the data has been coalesced into one aggregated dataset, scripts from scripts/analysis can be run to create the figures.
Figure outputs are placed in output folder, and exported as high-res PNG or PDF files.

📝 Final report

Final report was created with collaborative mode in Google Docs in the form of a research paper, exported to PDF and submitted through Moodle.

Name		Name	Last commit message	Last commit date
Latest commit History 415 Commits
data		data
output		output
scripts		scripts
.gitignore		.gitignore
LICENSE.md		LICENSE.md
LSHTM-DC-Sanofi.Rproj		LSHTM-DC-Sanofi.Rproj
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📈 Data Challenge Project

❓ How do hospitalisations as a result of the tri-demic of RSV, influenza and SARS-CoV-2 in the 2022 post-pandemic season compare to the 2016 - 2019 pre-pandemic circulation of RSV and influenza?

📃 Summary

⚙️ Requirements

🖥️ Running the scripts

📊 Script outputs

📝 Final report

About

Releases

Packages

Contributors 3

Languages

License

dzanahmed/lshtm-dc-sanofi

Folders and files

Latest commit

History

Repository files navigation

📈 Data Challenge Project

❓ How do hospitalisations as a result of the tri-demic of RSV, influenza and SARS-CoV-2 in the 2022 post-pandemic season compare to the 2016 - 2019 pre-pandemic circulation of RSV and influenza?

📃 Summary

⚙️ Requirements

🖥️ Running the scripts

📊 Script outputs

📝 Final report

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages