Telegram-ASR-Bot

A telegram ASR (Speech Recognition, or Speech-to-Text) bot based on python-telegram-bot and Google Cloud Speech.

Usage

/start to get a smoke test.
/clear to clear local voice cache.
Feel free to send voices, the bot will watch these messages and process them automatically.

Prerequisite

External

mediainfo¹

Python Packages

[tool.poetry.dependencies]
python = "^3.9"
python-telegram-bot = "^13.11"
google-cloud-speech = "^2.13.1"
environs = "^9.5.0"
pymediainfo = "^5.1.0"
google-cloud-storage = "^2.1.0"

Google Cloud Configurations

Speech-to-Text

A Service Account with Cloud Speech Client and Storage Object Viewer roles granted.

Cloud Storage

This is optional if you do not need long speech recognition (voice duration > 60s).²

A Service Account with Storage Object Admin role granted.
- Can be narrowed to Storage Object Creator if set DELETE_BUCKET_VOICE=False.
A valid storage bucket under control.

Telegram BotFather

/setprivacy Turn disabled.
/setcommands

start - Check bot
clear - Clear local cache

Whitelist Group ID Lookup

Please refer to https://github.com/GabrielRF/telegram-id .

.env Example

# API Key
TG_API_KEY=""
GCLOUD_SPEECH_CREDENTIALS="/Users/asr-speech.json"
GCLOUD_BUCKET_CREDENTIALS="/Users/asr-storage.json"

# Bot Options
ALLOW_PRIVATE=False
ALLOW_GROUP=True
ENABLE_GROUP_WHITELIST=True
GROUP_WHITELIST="-1001145141919,-1001234567890"
ACCEPT_BOT_VOICE=False
ACCEPT_FORWARD_VOICE=False
DELETE_LOCAL_VOICE=True

# Voice Options
PREFER_LANGUAGE="en-US"
MULTIPLE_LANGUAGE_DETECT=True
OPTIONAL_LANGUAGE="zh,ja-JP,yue-Hant-HK"
ENABLE_WORD_CONFIDENCE=False
ENABLE_PUNCTUATION=True

# Storage Bucket Options
ENABLE_BUCKET=True
BUCKET_NAME="my-asr-bucket"
DELETE_BUCKET_VOICE=True

# Message
START_MSG="Start the bot."
NOT_ALLOWED_MSG="This chat type is not allowed."
NOT_IN_WHITELIST_MSG="This group is not in the whitelist."
DENY_BOT_MSG="Voice from another bot is rejected to process."
DENY_FORWARD_MSG="Voice from forward message is rejected to process."
PLACEHOLDER_MSG="Processing..."
CLEAR_MSG="The local voice cache has been successfully deleted."
EMPTY_RESULT_MSG="I can't hear you clearly."
DENY_LONG_VOICE_MSG="Voices that greater than 60 secs needs a Google Cloud Storage service."

pymediainfo requires libmediainfo-dev to detect the sample rate of voice messages. ↩
https://cloud.google.com/speech-to-text/docs/async-recognize ↩

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.env.sample		.env.sample
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bot.py		bot.py
exception.py		exception.py
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
speech.py		speech.py
storage.py		storage.py
voice_manager.py		voice_manager.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telegram-ASR-Bot

Usage

Prerequisite

External

Python Packages

Google Cloud Configurations

Speech-to-Text

Cloud Storage

Telegram BotFather

Whitelist Group ID Lookup

.env Example

About

Releases

Packages

Contributors 2

Languages

License

FrozenYogurtPuff/Telegram-ASR-Bot

Folders and files

Latest commit

History

Repository files navigation

Telegram-ASR-Bot

Usage

Prerequisite

External

Python Packages

Google Cloud Configurations

Speech-to-Text

Cloud Storage

Telegram BotFather

Whitelist Group ID Lookup

.env Example

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages