Skip to content

Convolutional neural network to classify audio files (Python, Keras, Tensorflow) and its GUI (C#).

Notifications You must be signed in to change notification settings

musevarg/AI-Neural-Network-Classifying-Guitar-Distortions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

AI Neural Network to classify Guitar Distortions

This project aimed to find out whether a Convolutional Neural Network (CNN) could be trained to classify audio files, in this case guitar distortions.

1440 audio files were turned into mel-spectrograms (visual representations of an audio signal, with time on the x-axis, frequencies on the y-axis and the intensifty of the frequency in dB represented by the color/z-axis).

The CNNs were implemented in Python using the keras API and tensorflow as backend, and the models produced were all wrapped in a C# GUI to allow for a nice and seamless experience.

Two apporaches were taken:

  • One multi-class model (with 12 classes)
  • Multiple binary models (2 classes each)

It has been found that multiple binary models performed better, with an average accuracy of 98% while the multi-class model obtained 81% accuracy. This, however, is hindered by the rather small collection of audio files that I had to produce myself. I believe that in the future, a multi-class algorithm should perform better if more data becomes available and would also allow a developer to work with more classes.

For more info, please read the report here.

short-demo.mp4

Releases

No releases published

Packages

No packages published