#

streaming-asr

Here are 23 public repositories matching this topic...

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Updated Jun 29, 2026
Python

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Jun 21, 2026
Python

yeyupiaoling / PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

deep-learning speech speech-recognition chinese speech-to-text asr paddlepaddle conformer deepspeech2 streaming-asr squeezeformer

Updated Dec 17, 2025
Python

soniqo / speech-core

On-device VAD / streaming STT / TTS / diarization in C++17 (ONNX + LiteRT) with a voice-agent pipeline. Linux, Windows, Android.

Updated Jun 29, 2026
C++

VoxRT / voxrt-asr-android

Streaming on-device speech recognition for Android — NEON-accelerated, encrypted FastConformer (32M params), ~150 ms latency, no cloud. Powered by the VoxRT runtime.

android kotlin neon speech-recognition jni speech-to-text arm64 nemo asr ctc on-device offline-speech-recognition rnn-t streaming-asr fastconformer voxrt

Updated Jun 29, 2026
Kotlin

nguyentuansi / nemotron-asr-realtime-translate

Nemotron ASR streaming + real-time translation. 19 source × 200 target languages, runs on a MacBook CPU. Vi↔En foundation, any pair via flags. No cloud.

real-time translation offline vietnamese speech-recognition nemo asr onnxruntime streaming-asr ctranslate2 nllb

Updated Jun 25, 2026
Python

VoxRT / voxrt-asr-ios

Streaming on-device speech recognition for iOS — NEON-accelerated, encrypted FastConformer (32M params), RTF 0.08–0.10 on iPhone 13 Pro Max. Built on the VoxRT custom Rust inference runtime. SwiftPM distribution.

swift ios real-time swift-package-manager speech-recognition spm nemo ctc on-device offline-speech-recognition rnn-t streaming-asr fastconformer voxrt

Updated Jun 29, 2026
Swift

mllpresearch / Europarl-ASR

A 1300-hour English speech and text corpus of parliamentary debates for streaming ASR training and benchmarking, speech data filtering and speech data verbatimization.

automatic-speech-recognition speech-corpus streaming-asr speech-data-filtering speech-data-verbatimization

Updated Mar 30, 2024

VoxRT / voxrt-asr-models

Pre-compiled ASR model weights for the VoxRT on-device runtime. Encrypted .vxrt v2 format. streaming-medium-pc: FastConformer 32M, CTC + RNN-T, CC-BY-4.0 (NVIDIA NeMo).

android ios speech-recognition nemo on-device model-weights asr-model streaming-asr voxrt encrypted-weights

Updated May 28, 2026

nirnaim / faster-whisper-server

Faster-Whisper Transcription Server & API is a production-ready speech-to-text micro-service stack that wraps faster-whisper with a streaming FastAPI server, a Celery/Redis background queue, and optional Docker deployment—delivering real-time or batch audio transcription with minimal latency and simple web-hook integration.

python docker machine-learning ai speech-recognition celery server-sent-events speech-to-text whisper fastapi streaming-asr

Updated Nov 1, 2025
Python

gydx6 / doubao-asr2-openai-proxy

OpenAI-compatible proxy bridging Doubao/Volcengine ASR 2.0 (Seed-ASR) WebSocket protocol to /v1/audio/transcriptions; works with Spokenly and OpenAI-compatible clients. OpenAI 兼容代理：将豆包/火山引擎 ASR 2.0（Seed-ASR）WebSocket 协议桥接到 /v1/audio/transcriptions，适用于 Spokenly 与其他 OpenAI 兼容客户端。

proxy speech-to-text whisper asr bytedance streaming-asr volcengine doubao openai-compatible spokenly seed-asr

Updated Mar 1, 2026
JavaScript

masasibata / t-one-rest-api

Production-ready REST API for Russian speech recognition using T-one model. FastAPI-based service with offline and streaming transcription support.

python rest-api speech-recognition speech-to-text asr fastapi streaming-asr t-one russian-speech-recognition

Updated Feb 6, 2026
Python

gonsalet / ASR_and_MT_for_educational_parliamentary_and_broadcast_media

PhD Thesis: "Automatic speech recognition and machine translation with deep neural networks for open educational resources, parliamentary contents and broadcast media" (2024)

automatic-speech-recognition neural-machine-translation open-educational-resources streaming-asr speech-data-filtering speech-data-verbatimization parliamentary-contents live-broadcast-media-subtitling europarl-asr-speech-corpus

Updated Dec 5, 2024

wenquan222 / DoubaoVoice

Windows 桌面豆包语音输入工具 — 全局快捷键录音 → 火山引擎流式 ASR → 自动粘贴到光标。原生支持豆包平台热词表 ID。

desktop-app python windows speech-to-text asr global-hotkey voice-input streaming-asr volcengine doubao

Updated Apr 25, 2026
Python

APushingBoy / ReverieFlow

一个流式ASR语音识别工具

asr streaming-asr

Updated May 10, 2026
Python

Gern713 / voice_input_tool

Lightweight Windows voice input tool with offline streaming ASR, hotwords, and AI text correction

speech-recognition windows-desktop glm hotwords voice-input pyside6 streaming-asr funasr chinese-asr

Updated Jun 9, 2026
Python

FUYOH666 / ai-agent-tts

Low-latency voice AI agent platform with streaming ASR/TTS, FSM-based dialog management, and microservices architecture. Built with FastAPI, LangGraph, vLLM, and F5-TTS.

Updated Dec 9, 2025
Python

mweinbach / NemotronCoreAI

SwiftPM streaming ASR runtime for NVIDIA Nemotron 3.5 on Apple CoreAI

swift speech-recognition apple-silicon streaming-asr coreai

Updated Jun 21, 2026
Swift

0xPD33 / speechcore

Reusable Rust speech-to-text runtime with audio capture, VAD, backend selection, model provisioning, and transcript streaming.

rust machine-learning speech-recognition vad speech-to-text transcription whisper asr moonshine voice-activity-detection onnx parakeet streaming-asr silero-vad ctranslate2 nemotron

Updated Jun 22, 2026
Rust

Orange-OpenSource / sens-asr

Injecting semantic in Streaming Automatic Speech Recognition models

neural-transducer streaming-asr semantic-representation

Updated Jun 11, 2026
Python

Improve this page

Add a description, image, and links to the streaming-asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the streaming-asr topic, visit your repo's landing page and select "manage topics."