Automate the entire Process of: Excel file contiaining 100 URLs of articles. Scrape the URL content from the web, perform sentiment analysis, calculate readability scores and save the output to an Excel File.
- I have used 7 text files containing stopWords along with Master Dictionary of positive and Negative words
- Extensively used Beautifulsoup for parsing and TextBlob for tokenization
- Sentiment analysis: positive score, negative score, polarity and subjectivity score
- Readability analysis: Fog index, complex words, avg_word length etc
- Create a virtual enviornment
- clone the repository
- pip install -r requirements.txt
- python data_analysis.py