An End-to-End Visual-Audio Attention Network for Emotion Recognition DEMO

This is the demo implementation of the paper "An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos". [NOT OFFICIAL!!]

Original Paper Project Page | Paper

Requirements

PyTorch (ver. 0.4+ required)
FFmpeg
Python3
Pyqt5

Preparation

data（ve8）

If U just want to use the DEMO, this step is not necessary. Download the pre-trained and trained model is enough!!

Download the videos here.(offical)
video pre-processing using /tools/processing.py(mp4 to jpg+ Add n_frames information + Generate annotation file in json format + mp4 to mp3)
Here, We provide the processed dataset, including VideoEmotion8-imgs(splitted by FFmpeg) and VideoEmotion8-mp3, so that you can train your own model easier.

VideoEmotion8-imgs: here (extraction code: fhom)

VideoEmotion8-mp3: here (extraction code: 7tn3)

model

resnet-101-kinetics.pth: pre-trained model download here (extraction code:0bi8)
save_30.pth: trained model download here (extraction code:uq82)
ve8_01.json: download here (extraction code:s567)

Running the code

Assume the strcture of data directories is the following:

~/
  data
    Joy/
      .../(video name)
        images/(jpg files)
        mp3/
          mp3/(mp3 file)
  results
  resnet-101-kinetics.pth
  save_30.pth
  ve8_01.json

Confirm all options in ~/opts.py.

python Emotion.py

Result

See the next section for details.

Tutorial

To see another branch:click here --Tutorial

(Chinese version)

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
__pycache__		__pycache__
core		core
data/Joy/test		data/Joy/test
datasets		datasets
haarcascade_files		haarcascade_files
images_test		images_test
models		models
tools		tools
transforms		transforms
.gitignore		.gitignore
Citation.txt		Citation.txt
Emotion.py		Emotion.py
Emotion.spec		Emotion.spec
Emotion.ui		Emotion.ui
Video_main.py		Video_main.py
__init__.py		__init__.py
cmd_.bat		cmd_.bat
cmd_.vbe		cmd_.vbe
face_detect.py		face_detect.py
image1_rc.py		image1_rc.py
main.py		main.py
myVideoWidget.py		myVideoWidget.py
opts.py		opts.py
readme.md		readme.md
real_time_processing.py		real_time_processing.py
real_time_processing_v2.py		real_time_processing_v2.py
real_time_video_me.py		real_time_video_me.py
requirements.txt		requirements.txt
slice_png.py		slice_png.py
sound.py		sound.py
test.mp4		test.mp4
test.py		test.py
train.py		train.py
validation.py		validation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An End-to-End Visual-Audio Attention Network for Emotion Recognition DEMO

Requirements

Preparation

data（ve8）

model

Running the code

Result

Tutorial

About

Releases 4

Packages

Contributors 2

Languages

Robin-WZQ/multimodal-emotion-recognition-DEMO

Folders and files

Latest commit

History

Repository files navigation

An End-to-End Visual-Audio Attention Network for Emotion Recognition DEMO

Requirements

Preparation

data（ve8）

model

Running the code

Result

Tutorial

About

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 2

Languages

Packages