A repository for Network Analysis Project 2018

Problem Statement:

** Improving student experience of the course catalog in a time constrained, high stake environment **

Same word different context.
Gaps in description.
Relatively new field.
Same soft skill, different unfamiliar contexts.
Course description domain specific or school specific but not student specific.

Data Source:

The data was gathered for the Spring 2018 courses offered in the iSchool. The [data]: https://github.com/pratikshrivastava/NA_PROJECT_SP2018/blob/master/data/data.csv file was created manually using the ischool course pages for the spring 2018 semester. The file contains Id, Name, Description and Instructor of the courses.

Approaches taken.

Entity Based: We used a Google NLP api, for retrieveing the specific noun, pro-noun etc from the course description which helped us to define the entities in them. Once we got the entities, we tried manually classify those entities into categories.

NetWork Graph:

Communities Formed:

Cosine similarity: We used the gensim library of python for creating the corpus, generating the simliarity scores between the different courses. Once, we had the cosine similiarity scores we used networkx for creating the graphs between the nodes whose scores where greater than the set threshold.

This approach had a drawback, as the course description are designed to be different from any of the other courses. Due to this the similarity scores between the documents were very low. This can be observed from the below plot.

histogram plot of similarity score between different courses.

Network Plot generated using similarity score.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Entity_Approach		Entity_Approach
Gephi_Analysis		Gephi_Analysis
Sim_Approach		Sim_Approach
__pycache__		__pycache__
data		data
images		images
reports		reports
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
Similarity_Matrix_Course.ipynb		Similarity_Matrix_Course.ipynb
sim_matrix.csv		sim_matrix.csv
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A repository for Network Analysis Project 2018

Problem Statement:

Data Source:

Approaches taken.

Analysis:

Comparing the suggested electives with the communitites.

About

Releases

Packages

Contributors 2

Languages

pratikshrivastava/NA_PROJECT_SP2018

Folders and files

Latest commit

History

Repository files navigation

A repository for Network Analysis Project 2018

Problem Statement:

Data Source:

Approaches taken.

Analysis:

Comparing the suggested electives with the communitites.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages