Emotion Extraction From Sound and Meaning

1) Description

This project, extracts emotion from sound by looking at the meaning of sentence and sound features. Model can classify the emotions of fear, anger, disgust, joy, surprise, sadness and neutral.
As dataset, MELD (Multimodal EmotionLines Dataset) is used. We especially choose this dataset, because we need different emotions with meaningful sentences for sentiment analysis to work properly.

1.1) Technologies Used

WhisperAI is used to transcirbe text from sound for sentiment analysis.
NLTK model is used to do sentiment analysis from transcribed text, which gives possible results of positive, negative and neutral.
LSTM model is used to do emotion analysis. Model uses sentiment and sound file's features to determine which emotion the sound file belongs to.

2) How to Install and Run The Project

Firstly, dataset must be downloaded for the project to run correctly. Since dataset is too big to upload to GitHub, it is uploaded to Google Drive.
Here's the link: https://drive.google.com/file/d/1547d1dz2_kgBUKx-AHKC110-18PeNbVl/view?usp=drive_link
After downloading the zip file in the drive link, dataset folder must be extracted to project folder.
Secondly, WhisperAI does not support Python versions that are newer than 3.9.9. So, we recommend users to use version 3.9.9. Otherwise, project can't transcribe the text from sound.

3) How to Use The Project

Project has 3 different Python files.

3.1) Deleting Short Sound Files

"delete_short_files.py" is a Python file that deletes sound files which uses only one word to form a sentence. This file clears dataset from sound file's with less meaning.

3.2) Model Optimization

"model_optimization" uses TensorBoard to find the optimal hyperparameters for LSTM model. Results are saved as log files, so they can be examined if wanted.

3.3) Main Program

"main_file.py" used to run the main project. It has a simple application UI. User can upload files and record its sound for the emotion extraction process. After the extraction process, resulting graphs will be seen on the application screen. Resulting graphs is saved in the Graph folder of the project. Also, user can see the performance of the model by running training function.

4) Results

4.1) Model Optimization

Accuracy results of all of the models in optimization process:
From all of the models, two of the most accurate models are choosen. Accuracy of these two models:
Loss of these two models:

4.2) Training and Test Set Results

Accuracy results of the trained model: Loss of the trained model: Confusion matrix of the test set: Scores of the model:

4.3) Uploaded Files and Their Real Results' Comparison

User uploads more than one files: Real values of these files are: As we can see from the comparison of these results, anger and sadness, surprise and joy can be mixed up. We can come up with this result by looking at this comparison and confusion matrix

4.4) Recorded Files' Results

If there is only one sound file to extract emotion, graph changes. This graph shows the every emotion's percentage of the sound file. Here are some of the results:

4.4) GUI

Here are some images showing the GUI:

4.5) Comparing Accuracy with Other Projects

Accuracy of the other projects that are done with the same dataset: Accuracy of our project: As we can see from the comparison above, other projects' accuracy results are below %70. This project's accuracy changes between %78-%80. By adding the sentiment analysis to the emotion extraction we build a better model.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Model_optimization		Model_optimization
dataset_lite		dataset_lite
excel_files		excel_files
graphs		graphs
gui_related/frame0		gui_related/frame0
models		models
recorded_files		recorded_files
saved_excels		saved_excels
upload_files		upload_files
README.md		README.md
delete_short_files.py.py		delete_short_files.py.py
main_file.py.py		main_file.py.py
tempCodeRunnerFile.py		tempCodeRunnerFile.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion Extraction From Sound and Meaning

1) Description

1.1) Technologies Used

2) How to Install and Run The Project

3) How to Use The Project

3.1) Deleting Short Sound Files

3.2) Model Optimization

3.3) Main Program

4) Results

4.1) Model Optimization

4.2) Training and Test Set Results

4.3) Uploaded Files and Their Real Results' Comparison

4.4) Recorded Files' Results

4.4) GUI

4.5) Comparing Accuracy with Other Projects

About

Releases

Packages

Languages

MBToker/Emotion-extraction-from-sound-and-meaning

Folders and files

Latest commit

History

Repository files navigation

Emotion Extraction From Sound and Meaning

1) Description

1.1) Technologies Used

2) How to Install and Run The Project

3) How to Use The Project

3.1) Deleting Short Sound Files

3.2) Model Optimization

3.3) Main Program

4) Results

4.1) Model Optimization

4.2) Training and Test Set Results

4.3) Uploaded Files and Their Real Results' Comparison

4.4) Recorded Files' Results

4.4) GUI

4.5) Comparing Accuracy with Other Projects

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages