Skip to content

Issues: mozilla/translations

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

dataset-hplt-mono_v1_2-zh failed bug Something is broken or not correct
#944 opened Nov 26, 2024 by eu9ene
Japanese is missing in OpusCleaner
#943 opened Nov 25, 2024 by eu9ene
MTData fails to unpack some datasets bug Something is broken or not correct
#942 opened Nov 25, 2024 by eu9ene
Autogenerated config doesn't work blocker Very important issue that blocks training
#941 opened Nov 25, 2024 by eu9ene
Corpora exclusion rules language-coverage Issues related to covering specific languages
#940 opened Nov 25, 2024 by ZJaume
Check for float16 precision support when running translate-* tasks on-prem Running the pipeline on-premises machines
#936 opened Nov 20, 2024 by gregtatum
Linters needs to ignore node_modules bug Something is broken or not correct inference
#932 opened Nov 15, 2024 by gregtatum
Experiment with distillation data inference experiment A training experiment with hypothesis and results
#931 opened Nov 15, 2024 by gregtatum
Use PyMarian for COMET evaluations cost & perf Speeding up and lowering cost for the pipeline
#929 opened Nov 13, 2024 by marco-c
Single-side deduplication quality Improving robustness and translation quality
#928 opened Nov 13, 2024 by ZJaume
Create an analyze-datasets step in the pipeline quality Improving robustness and translation quality
#924 opened Nov 6, 2024 by gregtatum
Investigate merging document sentences in HPLT quality Improving robustness and translation quality
#923 opened Nov 6, 2024 by eu9ene
Reduce monolingual data for en-lt to investigate distillation performance experiment A training experiment with hypothesis and results
#915 opened Oct 31, 2024 by gregtatum
Allow for split vocabs language-coverage Issues related to covering specific languages quality Improving robustness and translation quality
#913 opened Oct 30, 2024 by gregtatum
[meta] Kick off a 2024-H2 training run meta A collection of sub-issues that uses a tasklist
#912 opened Oct 30, 2024 by gregtatum
More corpora specific fixes quality Improving robustness and translation quality
#910 opened Oct 30, 2024 by ZJaume
Limit the amount of data used for distillation cost & perf Speeding up and lowering cost for the pipeline
#905 opened Oct 29, 2024 by gregtatum
Check if issues with short sentences were caused by bicleaner hard rules quality Improving robustness and translation quality
#903 opened Oct 24, 2024 by eu9ene
Investigate word-based filtering for CJK language-coverage Issues related to covering specific languages
#899 opened Oct 23, 2024 by eu9ene
Add support for Chinese Traditional language-coverage Issues related to covering specific languages
#896 opened Oct 22, 2024 by eu9ene
Experiment with student model parameters experiment A training experiment with hypothesis and results quality Improving robustness and translation quality
#894 opened Oct 22, 2024 by gregtatum
ProTip! Follow long discussions with comments:>50.