Inquiry Regarding Dataset and Training Details for Ganji's Voice in the Pertts-Streamlit Project #9
Replies: 1 comment
-
سلام و دورد بر شما، تشکر از نظر مثبت شما نسبت به این پروژه
آموزش مدل گنجی چون روی مدل قبلی "امیر" انجام میشد، حدود بعد از 2000 epoch به نتیجۀ دلخواهِ من رسید، درواقع باید خطای داد های آموزشی رو شما بررسی کنید
متاسفانه خیر، چون به تنهایی بنده در این دیتاست مشارکت نداشتم و طبق صحبت های هم تیمی های بنده، اجازه انتشار این دیتاست رو ندارم.
من ترینیتگ رو با روش vits از piper استفاده کردم، بقیه روش ها رو هم امتحان کردم، مثل coqui ولی خب حداقل من نتیجۀ خوبی نگرفتم (البته روش های متعدد تری داره coqui ) |
Beta Was this translation helpful? Give feedback.
-
Hello,
First and foremost, I would like to express my sincere appreciation to the Datacula team and dear Sadegh for all their hard work.
Upon reviewing your project, I came across a few questions that I hope will help me contribute more effectively to the development of the pertts-streamlit project on Git, as well as progress with my own project.
Based on my test of https://tts.datacula.com/, it appears that Mr. Ganji’s voice sample demonstrates higher quality compared to Amir’s, with most sentences being delivered more clearly. Considering this, I reviewed the dataset you have made available for the project and noticed that while Amir’s dataset was included, Ganji’s dataset was not present.
From my examination of Amir’s dataset, it seems to contain around 10 hours of audio, with the texts seemingly having been corrected using a tool (the texts differed from the transcriptions), which raised a few questions. I would greatly appreciate it if you could kindly provide some clarification on the following:
Thank you very much in advance for your time and assistance. I look forward to your response.
Beta Was this translation helpful? Give feedback.
All reactions