-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
is it possible to retrain on mistakes? #44
Comments
That sounds like a great idea. Although, it'll be really difficult because we can't ensure that the audio is split correctly, and time offsets have to be perfect. And it's not practical to fine-tune on a single sample. Do you have any approaches in mind? |
Hmm maybe increase accessibility about how is the audio split? Also, how does the audio splitting work, is it also an AI? |
I segment on the silent parts of the audio by adapting some code from this project. It's not an AI. We can fine-tune the params while splitting, but it's not a one-size-fits-all solution. |
I am very happy to see your work. It really took a lot of effort. |
Hi |
In google you can test about the model outputs timing information for the detected speech segments |
I want to try to process video files with a long duration, but it seems that the program can only process wav files within 15 seconds. I also want to translate it into several other languages. The same problem is also limited. I look forward to your presentation. open source programs for these functions
It is really helpless to use offline speech recognition programs and translation programs. In the face of deliberate misleading and harm, we can only use software and platforms that are out of their control, Now I use Gettr |
Thank you very much for providing an open source software that can fully implement from video and audio files to subtitle files. I installed and used it, but I don't know which language should be recognized. I use English video files, but the effect seems to be bad. Can you tell me? Thank you so much |
Or can you tell me how to change the translation module to adapt to other countries' languages, thanks a lot |
I plan on implementing either one of Wav2Vec or NeMo, but will need some time. Also, AutoSub can process large video files too. It automatically segments the audio into smaller chunks. |
Thank you very much for your guidance and hope to see the new program written by you soon.
Thank you very much for your guidance and hope to see the new program written by you soon. |
According to your prompt, I downloaded the corresponding module I download it and I test to run ,But the following error occurs If it's convenient, please test it out to see how you can get the module to work Thank you so much (sub) (base) gettr@gettr:~/AutoSub$ python3 autosub/main.py --model deepspeech-0.9.3-models-zh-CN.pbmm --scorer deepspeech-0.9.3-models-zh-CN.scorer --file ~/3-720.mp4 Running inference: |
———————————————————————————————————————————— |
Given a false sub, would it be possible to give a correct sub, and retrain on it?
The text was updated successfully, but these errors were encountered: