Speech To Text

Speech To Text skills and workflows surfaced by the site skill importer.

3 skills
O
transcribe

by openai

transcribe turns audio or video into text with optional diarization and known-speaker hints. It is well suited for Technical Writing, meeting notes, interviews, lectures, and content ops when you need a repeatable transcribe skill with clear output formats and less guesswork than a generic prompt.

Technical Writing
Favorites 0GitHub 18.8k
M
azure-speech-to-text-rest-py

by microsoft

azure-speech-to-text-rest-py is a Python Azure Speech REST skill for short audio transcription without the Speech SDK. Use it for backend development when you need direct HTTP control, fast setup, and support for audio files up to 60 seconds. The guide covers install, authentication, audio formatting, and when to avoid long audio, streaming, or batch transcription.

Backend Development
Favorites 0GitHub 2.3k
M
detecting-deepfake-audio-in-vishing-attacks

by mukul975

detecting-deepfake-audio-in-vishing-attacks helps security teams analyze audio for AI-generated speech in vishing, fraud, and impersonation cases. It extracts spectral and MFCC-based features, scores suspicious samples, and produces a forensic-style report for review. Ideal for Security Audit and incident response workflows.

Security Audit
Favorites 0GitHub 0