Machine Learning with Python

Getting started with scikit-learn

Following on from the Introduction to Machine Learning course, this series of hands-on workshops will get you started with applying supervised and unsupervised machine learning methods in Python, using the popular scikit-learn package.

Intended Learning Outcomes

After completing this workshop, you will be better able to:

Prepare a dataset for machine learning in Python
Select a scikit-learn method appropriate for a particular learning task
Construct your own workflows for model training and testing
Evaluate the performance of a model

Setup

We will be working with python using jupyter notebooks. The easiest way to access jupyter is via the Anaconda platform.

Please install Anaconda from https://www.anaconda.com in advance of the first session.

Please ensure that you have an up-to-date scikit-learn package installed prior to starting the first session. General installation instructions are available here: https://scikit-learn.org/stable/install.html#installation-instructions

scikit-learn is part of the default installation of Anaconda, so you may already have everything you need.

Getting Started

Download this repository to your computer as a ZIP file and unpack it.

Open JupyterLab (within Anaconda) and navigate to the unpacked directory to work with the .ipynb notebooks.

Alternatively, you can run the notebooks online using Binder:

Data sets

We will be working with a variety of real and synthetic data sets to illustrate various methods. For your own work between classes, you will be asked to identify a suitable data set from your own research or from other work within your field.

You can start thinking about this before the course, but the main requirements for a machine learning data set will be discussed more during the first session.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
binder		binder
1_1_Preliminaries.ipynb		1_1_Preliminaries.ipynb
1_2_Data.ipynb		1_2_Data.ipynb
1_3_Dimensionality_Reduction.ipynb		1_3_Dimensionality_Reduction.ipynb
1_4_Clustering.ipynb		1_4_Clustering.ipynb
1_5_Homework.ipynb		1_5_Homework.ipynb
2_1_Classification.ipynb		2_1_Classification.ipynb
2_2_Regression.ipynb		2_2_Regression.ipynb
2_3_Evaluation.ipynb		2_3_Evaluation.ipynb
2_4_Homework.ipynb		2_4_Homework.ipynb
3_1_Text_Data.ipynb		3_1_Text_Data.ipynb
3_2_Sentiment_Analysis.ipynb		3_2_Sentiment_Analysis.ipynb
3_3_Unsupervised_Approaches.ipynb		3_3_Unsupervised_Approaches.ipynb
3_4_Further_Learning.ipynb		3_4_Further_Learning.ipynb
README.md		README.md
codon_usage.csv		codon_usage.csv
imdb.zip		imdb.zip
reviews.json		reviews.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning with Python

Intended Learning Outcomes

Setup

Getting Started

Data sets

About

Releases

Packages

Languages

juliapurrinos/RCDS-machine-learning-with-python

Folders and files

Latest commit

History

Repository files navigation

Machine Learning with Python

Intended Learning Outcomes

Setup

Getting Started

Data sets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages