Speech To Text

Speech To Text skills and workflows surfaced by the site skill importer.

5 skills
O
transcribe

by openai

transcribe turns audio or video into text with optional diarization and known-speaker hints. It is well suited for Technical Writing, meeting notes, interviews, lectures, and content ops when you need a repeatable transcribe skill with clear output formats and less guesswork than a generic prompt.

Technical Writing
Favorites 0GitHub 18.8k
M
azure-speech-to-text-rest-py

by microsoft

azure-speech-to-text-rest-py is a Python Azure Speech REST skill for short audio transcription without the Speech SDK. Use it for backend development when you need direct HTTP control, fast setup, and support for audio files up to 60 seconds. The guide covers install, authentication, audio formatting, and when to avoid long audio, streaming, or batch transcription.

Backend Development
Favorites 0GitHub 2.3k
N
speech-to-text

by NoizAI

The speech-to-text skill transcribes supported audio files into plain text, with options for timestamps, speaker labels, and JSON output. It is designed for practical speech-to-text usage in repeatable workflows, including interviews, meetings, podcasts, lectures, and automation tasks where consistent transcription matters.

Workflow Automation
Favorites 0GitHub 498
N
tts

by NoizAI

The tts skill turns text into speech audio for narration, dubbing, voiceover, and timeline-aligned playback. Use it to generate a voice file from plain text, convert articles or text files to speech, or render SRT-driven audio with timing control. It supports simple and timeline modes, plus backend-aware workflows for repeatable tts usage.

Voice Generation
Favorites 0GitHub 498
M
detecting-deepfake-audio-in-vishing-attacks

by mukul975

detecting-deepfake-audio-in-vishing-attacks helps security teams analyze audio for AI-generated speech in vishing, fraud, and impersonation cases. It extracts spectral and MFCC-based features, scores suspicious samples, and produces a forensic-style report for review. Ideal for Security Audit and incident response workflows.

Security Audit
Favorites 0GitHub 0
Speech To Text