We want to fetch a better caption via inputting sound and sign language into the LSTM.
Based on the previous work, which added sound to video caption.
We want to fetch a better caption via inputting sound and sign language into the LSTM.
Based on the previous work, which added sound to video caption.