CSL-video-caption

We want to fetch a better caption via inputting sound and sign language into the LSTM.

Based on the previous work, which added sound to video caption.