Skip to content

TUT888/VietnameseAccentRecognition

Repository files navigation

Vietnamese Accent Recognition

en vi

Project: Vietnamese Accent Recognition using Deep Learning

Project Description

The project runs on Google Colab and includes the following sections:

  • Data preprocessing and feature extraction (MFCC - Mel Frequency Cepstral Coefficients)
  • Utilizes two types of models to train and predict: CNN (Convolutional Neural Network) and RNN (Recurrent Neural Network)

Dataset

  • The dataset used in this project is the Vietnamese Common Voice provided by Mozilla. Mozilla started this project to create a free database for developers to build various voice recognition software. As of the completion of this project, Mozilla has developed datasets in multiple languages, including Vietnamese. More details can be found at: Mozilla Common Voice.
  • This project uses a portion of the Common Voice Corpus 9.0 dataset (updated on 27/04/2022). The downloaded data includes voice recordings of many people from different ages and regions, along with an Excel file containing the corresponding sentences for each voice recording.

Files in repository

  • vietnamese_accent_recognition.ipynb: The main Jupyter Notebook file of the project.
  • audio_record_details.xlsx: Excel file containing the audio recording information.
  • audio_records: Folder containing the audio recordings.

Authors

The project was completed by a group of two members:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published