[WIP] VoiceSmith makes training text to speech models easy.
-
Updated
Oct 10, 2022 - Python
[WIP] VoiceSmith makes training text to speech models easy.
Python library for handling audio datasets.
Tool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and auto-tag/caption models for your purposes. Custom datasets can be added!
An open source tool for large-scale EEG datasets processing
Data preparation code for building Kaldi ASR system
A single library to (down)load all existing sign language handshape datasets.
Access to data for workshops and extended tests of MDAnalysis.
Machine learning library for classification tasks
Scripts to automatize and standardize dataset handling
Automate ML dataset labelling
A tool to download and format PASCAL VOC 2007 dataset for multilabel classification
Machine learning library for classification tasks
A single library to (down)load all existing sign language video datasets.
Extraction tool to parse MS Celeb dataset
🚀 Whenever you need to look through huge pile of images and cannot use force of file explorer, or you just work on a remote headless machine, you can use this tool. It also allows to move files from one folder to another, creating destination if it does not exist. Work in progress.
A tool to download and format NUS-WIDE dataset for multilabel classification
A tool to download and format MS COCO dataset for multilabel classification
Add a description, image, and links to the dataset-manager topic page so that developers can more easily learn about it.
To associate your repository with the dataset-manager topic, visit your repo's landing page and select "manage topics."