A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
-
Updated
Feb 4, 2024 - Jupyter Notebook
A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
This study addresses the gap in translating Bangla regional dialects into standard Bangla by creating a large-scale multilingual benchmark dataset of 32,500 sentences in Bangla, Banglish, and English, representing five regional Bangla dialects such as Sylheti, Chittagong, Mymensingh, Noakhali, and Barishal.
Bengali News Article Summarization
Add a description, image, and links to the banglat5 topic page so that developers can more easily learn about it.
To associate your repository with the banglat5 topic, visit your repo's landing page and select "manage topics."