Real-time transcription using faster-whisper
-
Updated
Jul 23, 2024 - HTML
Real-time transcription using faster-whisper
A web ui application that utilizes the stream-translator-gpt
VAD->STT web client based on guillaumekln/faster-whisper & ricky0123/vad
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
🎙️ 100% Local AI Transcription with Speaker Diarization — No API key, no cloud, no cost. Supports 99+ languages, dual engine (CPU + Apple Silicon GPU), exports to SRT/TXT/DOCX.
Ultra-low latency, voice-to-voice translation and playback server running entirely on consumer GPUs using FastAPI, faster-whisper, Ollama (llama3.2), and kokoro-onnx
real-time speech captions in the browser via WebSocket + faster-whisper
YT-AI-NOTES - Fastapi based web application that converts YouTube videos into professional, well-structured study notes or articles using AI. It extracts audio from a given YouTube URL, transcribes the speech to text, and leverages advanced AI models to generate well-structured, Notes—making video-to-text publishing efficient and automated.
Open source voice assistant inspired by Jarvis. Powered by Claude with real-time STT, streaming TTS, wake-word detection, and a live web dashboard. Turkish-first, self-hosted.
A self-hosted, Flask-powered web application to automatically generate subtitles for any video (from upload or URL) using faster-whisper and stream it via HLS.
Real-time speech-to-text using Faster-Whisper and a web interface.
Fully-offline desktop app that turns a voice recording + script into a finished vertical short — local Whisper alignment & libass subtitle burn-in. Electron + Python.
Phone voice dictation + remote control for Claude Code — speak, edit, send, switch parallel agent sessions & see which agent needs you, from your pocket. 100% local (faster-whisper), macOS, no cloud.
Free open-source Opus Clip alternative. Auto-caption & auto-edit raw clips into vertical 9:16 reels with faster-whisper + ffmpeg, via one Claude Code command. Burned-in word captions, hook card, local, no watermark, no subscription. For creators & indie hackers.
An AI-powered desktop companion for video editors that automates transcription, story segmentation, B-Roll recommendations, and YouTube chapter generation using both local and cloud-based LLMs.
Windows local dictation tool — push-to-talk F6, optimized for Québec French / franglais. Whisper local, no subscription, no Dragon required.
Add a description, image, and links to the faster-whisper topic page so that developers can more easily learn about it.
To associate your repository with the faster-whisper topic, visit your repo's landing page and select "manage topics."