Detection of Hate Speech Detection in Videos Using Machine Learning

[Master's Project]

To detect hate speech in videos based on the spoken content of the videos using machine learning.

Extract audio from video using FFmpeg API
Convert audio to text transcript using Google Cloud Speech-to-Text API
Extract features to get word counts, frequency of word counts, uni-grams and bi-grams
Train Naive Bayes, Linear SVM, Random Forrest and RNN models
Compute Accuracy, Precision score, Recall score, F1 score

YouTube videos are searched and downloaded using a YouTube crawler
Videos are searched using the 'search by keyword' function provided by YouTube Data API
Searched videos are downloaded using Pytube library
Each video is labeled as Normal or Hateful (Racist, Sexist)

Note: You can view .ipynb files using nbviewer - https://nbviewer.jupyter.org/

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DetectionModule		DetectionModule
ProcessingModule		ProcessingModule
README.md		README.md

Provide feedback