Tracking the progress in end-to-end speech translation
-
Updated
Oct 25, 2023
Tracking the progress in end-to-end speech translation
A PyPI package for fast word/character error rate (WER/CER) calculation
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
10 digits recognition system based on DTW, HMM and GMM
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
software that analyzes speech utterances
This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018
Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue
All NLP related courses on DataCamp
Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917
Speech subtask of the 2017 NLI Shared Task
Convex combination of phonotactics for large-scale spoken language identification
The Ruby Programming Language
🚧The Internet + project YiLuYuBan.The project is too messy, has moved to https://github.com/wanghao15536870732/ChatWithChinese
Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"
RNN for Spoken Language Understanding
A guide to spoken language processing
Add a description, image, and links to the spoken-language-processing topic page so that developers can more easily learn about it.
To associate your repository with the spoken-language-processing topic, visit your repo's landing page and select "manage topics."