Skip to content

itsleeds/tdsarchive

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TDS (Transport Data Science)

This is a GitHub Repository (repo for short) that supports teaching of the Transport Data Science module at the University of Leeds. The module can be taken by students on the Data Science and Analytics and the Transport Planning and the Environment MSc courses.

Summary Description Date Location Duration (Hours)
TDS deadline 1 Computer set-up 2024-02-02 Online - Teams 0
TDS Practical 1: intro Introduction to transport data science 2024-02-08 Irene Manton North Cluster (7.96) 3
TDS Practical 2: od Origin-destination data 2024-02-15 Irene Manton North Cluster (7.96) 3
TDS Practical 3: routing Routing 2024-02-22 Irene Manton North Cluster (7.96) 3
TDS seminar 1 Seminar 1: Tom Van Vuren, Amey and ITS 2024-02-22 Institute for Transport Studies 1.11 3
TDS Practical 4: getting Getting transport data 2024-02-29 Irene Manton North Cluster (7.96) 3
TDS seminar 2 Seminar 2 Will Deakin, Network Rail 2024-03-21 Institute for Transport Studies 1.11 3
TDS deadline 2 Draft portfolio 2024-04-22 Online - Teams 0
TDS Practical 5: visualisation Visualising transport data 2024-05-02 Irene Manton North Cluster (7.96) 3
TDS Practical 6: project Project work 2024-05-09 Irene Manton North Cluster (7.96) 3
TDS deadline 3 Deadline: coursework, 2pm 2024-05-17 Online - Teams 0

Prerequisites

Software

Although you are free to use any software for the course, the emphasis on reproducibility means that popular data science languages such as R and Python are highly recommended.

The teaching will be delivered in R. For this module you therefore need to have up-to-date versions of R and RStudio installed on a computer you have access to:

  • R from cran.r-project.org
  • RStudio from rstudio.com
  • R packages, which can be installed by opening RStudio and typing install.packages("stats19") in the R console, for example.

You should have the latest stable release of R (4.3.0 or above) and be comfortable setting-up any addition software tools you need for your work. Should have access to a computer with decent resources (e.g. a laptop with 8 GB of more RAM).

See Section 1.5 of the online guide Reproducible Road Safety Research with R for instructions on how to install key packages we will use in the module.1

It is also recommended that you have installed and have experience with GitHub Desktop (or command line git on Linux and Mac), Docker, Python, QGIS and transport modelling tools such as SUMO and A/B Street. These software packages will help with the course but are not essential.

Data science experience

Attending the Introduction to R one-off 3 hour workshop (semester 1 Computer Skills workshop) and experience of using R (e.g. having used it for work, in previous degrees or having completed an online course) is essential. Students can demonstrate this by showing evidence that they have worked with R before, have completed an online course such as the first 4 sessions in the RStudio Primers series https://rstudio.cloud/learn/primers or DataCamp’s Free Introduction to R course: https://www.datacamp.com/courses/free-introduction-to-r. This is an advanced and research-led module. Evidence of substantial programming and data science experience in previous professional or academic work, in languages such as R or Python, also constitutes sufficient pre-requisite knowledge for the course.

Course reading

See the handbook.

Assessment (for those doing this as credit-bearing)

  • You will build-up a portfolio of work
  • 100% coursework assessed, you will submit by Friday 17th May:
    • a pdf document up to 10 pages long with figures, tables, references explaining how you used data science to research a transport problem
    • reproducible code contained in an RMarkdown (.Rmd) document that produced the report
  • Written in RMarkdown - will be graded for reproducibility
  • Code chunks and figures are encouraged
  • You will submit a non-assessed 2 page pdf + Rmd report by Friday 23rd February

Issues and contributing

Any feedback or contributions to this repo are welcome. If you have a question please open an issue here (you’ll need a GitHub account): https://github.com/ITSLeeds/TDS/issues

References

Footnotes

  1. For further guidance on setting-up your computer to run R and RStudio for spatial data, see these links, we recommend Chapter 2 of Geocomputation with R (the Prerequisites section contains links for installing spatial software on Mac, Linux and Windows): https://geocompr.robinlovelace.net/spatial-class.html and Chapter 2 of the online book Efficient R Programming, particularly sections 2.3 and 2.5, for details on R installation and set-up and the project management section.