video-translation

by NoizAI

The video-translation skill translates spoken content in a video into another language, generates TTS dubbing, and replaces or mixes audio while keeping the video intact. It is best for practical video-translation usage when you have a source video, subtitles, and a target language for Translation.

Stars498

Favorites0

Comments0

AddedMay 14, 2026

CategoryTranslation

Install Command

npx skills add NoizAI/skills --skill video-translation

Curation Score

This skill scores 74/100, which means it is list-worthy but best presented with clear caveats. Directory users get a real, non-placeholder workflow for translating and dubbing videos, with explicit triggers and supporting scripts, but they should expect to do some integration work because the repo does not fully spell out an end-to-end install/run path.

74/100

Strengths

Explicit trigger phrases and use cases make it easy for an agent to recognize when to use the skill.
The SKILL.md provides a concrete workflow for downloading subtitles, translating them sentence by sentence, and replacing the audio track.
Supporting scripts for audio replacement and SRT ducking show real operational intent beyond a generic prompt.

Cautions

The workflow depends on another skill (`youtube-downloader`) and external tooling like ffmpeg, so installation and execution may require extra setup.
There is no install command and the excerpted workflow is partially truncated, which reduces immediate out-of-the-box clarity for directory users.

Video Ffmpeg Tts Captions Youtube Script

Overview

Overview of video-translation skill

What video-translation does

The video-translation skill translates spoken content in a video into another language, generates dubbed audio with TTS, and replaces the original audio while keeping the video itself intact. It is best for users who have a specific video, a target language, and a desire to make the audio watchable rather than just machine-translated on screen.

Who should use it

This video-translation skill fits people localizing YouTube-style content, internal training clips, explainers, or any short-to-medium video where subtitle timing is available or can be extracted. It is less useful if you only need captions, if the source audio is too noisy for subtitle alignment, or if you want human-grade lip sync rather than a practical dubbed version.

What matters before install

The main decision point is workflow fit: video-translation assumes you can obtain the source video plus subtitles, translate subtitle text carefully, produce TTS audio, then mux the result back into the video. If your stack already includes video download, subtitle handling, and ffmpeg-based editing, the skill is a good fit; if not, expect extra setup around those dependencies.

How to Use video-translation skill

Install and inspect the skill

Use video-translation install in the directory toolchain, or install from the repo path with npx skills add NoizAI/skills --skill video-translation. After install, read SKILL.md first, then check scripts/replace_audio.sh and scripts/srt_to_duck.py so you understand how the audio replacement and subtitle-driven ducking actually work.

Turn a rough request into a usable prompt

For best video-translation usage, provide the video URL or file path, source language, target language, and whether you want full dub replacement or mixed audio. A weak prompt is “translate this video”; a stronger one is: “Translate this Spanish YouTube video to English, generate natural-sounding English TTS, and replace the original audio while preserving subtitle timing and silence gaps.”

Practical workflow that matches the repo

The repo’s logic is: download the video and subtitles, translate the SRT sentence by sentence, generate dubbed audio, then replace or mix audio with ffmpeg. If subtitles exist, the helper script can duck the original audio during spoken segments, which usually sounds better than a hard cut. If subtitles are missing or misaligned, expect lower output quality because the timing layer is part of the value.

What to check first in the repo

Start with SKILL.md for trigger intent, workflow order, and the translation prompt shape. Then open scripts/replace_audio.sh to see required flags like --video, --audio, --output, and optional --srt, and inspect scripts/srt_to_duck.py if you need to understand how subtitle timestamps are converted into ducking commands. Those two scripts tell you more about real usage than the high-level description alone.

video-translation skill FAQ

Is video-translation just a prompt template?

No. The video-translation skill is a workflow-oriented setup, not just a wording hint. It depends on subtitle extraction, translation with stable SRT formatting, TTS generation, and audio replacement, so it is more operational than a generic “translate this video” prompt.

When is video-translation a good fit?

Use video-translation when the goal is dubbed playback in another language and the source video can be processed locally or through your existing tools. It is especially useful for educational videos, interviews, and narrated content where preserving the visual track matters more than perfect speech cloning.

What are the main limits?

The biggest limits are subtitle quality, audio quality, and timing alignment. If the source transcript is wrong, the translated dub will inherit those errors; if the TTS voice is unnatural, the result will still sound dubbed; and if the video has overlapping speakers, the ducking-based mix may not be clean.

Do beginners need extra tooling?

Yes, usually. video-translation assumes comfort with files, subtitles, and command-line video tools. If you are new, the skill can still help, but expect to review helper scripts and verify ffmpeg, subtitle, and TTS steps before trusting the first output.

How to Improve video-translation skill

Give better input, not just more input

The strongest video-translation guide starts with clear source and target languages, the exact video file or URL, and the intended audience. Say whether you want formal or colloquial speech, whether names and technical terms should stay untranslated, and whether the final output should preserve pauses for natural timing.

Reduce the common failure modes

Most weak results come from bad subtitles, untranslated proper nouns, or TTS that ignores punctuation and sentence boundaries. To improve video-translation for Translation, verify the SRT before dubbing, keep index and timestamp formatting unchanged, and split long subtitle lines into natural speech units before generating audio.

Iterate after the first render

Treat the first pass as a timing test, not the final deliverable. If the dub sounds rushed, lengthen pauses in the source text or adjust sentence segmentation; if the mix is too aggressive, revisit the SRT-driven ducking behavior; if the wording feels literal, rewrite the subtitle translation prompt to demand colloquial, spoken-language output.

Use the scripts to sharpen quality

The repo’s helper scripts are a clue to what matters: timing, replacement, and stable audio switching. If you are improving the video-translation skill for repeated use, build a small checklist around subtitle accuracy, TTS voice choice, and final mux verification so the same errors do not recur on every video.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

baoyu-translate

by JimLiu

baoyu-translate is a translation workflow for long-form articles and Markdown documents, with quick, normal, and refined modes, glossary support, and chunking via bun or npx for consistent output.

Translation

Favorites 0GitHub 13.2k

readme-i18n

by xixu-me

readme-i18n helps translate a GitHub-style README into maintainable multilingual variants, preserving Markdown, links, code blocks, file naming, and a shared language selector across README files.

Translation

Favorites 0GitHub 6

localization

by Eronred

Use the localization skill to plan App Store localization for international markets. It helps teams decide which countries to target, whether to localize the store listing or the app itself, and how to avoid low-value translation work. Best for Product Marketing, ASO, founders, and growth operators who need a practical localization guide with market prioritization and readiness checks.

Product Marketing

Favorites 0GitHub 0

visa-doc-translate

by affaan-m

visa-doc-translate translates visa application document images to English and creates a bilingual PDF with the original page and translation. It is built for structured visa paperwork, OCR fallback, rotation handling, and preserving names, dates, and amounts.

Translation

Favorites 0GitHub 156.3k

translate-book

by deusyu

translate-book is a book-translation skill for PDF, DOCX, and EPUB files. It converts input into Markdown chunks, translates with parallel sub-agents, validates chunk integrity, and rebuilds HTML, DOCX, EPUB, and PDF outputs. Use it for repeatable translate-book for Translation workflows on long-form content.

Translation

Favorites 0GitHub 681

azure-ai-translation-text-py

by microsoft

azure-ai-translation-text-py helps backend teams use the Azure AI Text Translation SDK for Python to translate, transliterate, detect language, and look up dictionary terms. It includes install, auth, and usage guidance for production app integration with Azure credentials and endpoint setup.

Backend Development

Favorites 0GitHub 0

frontend-design

by anthropics

frontend-design helps you turn vague UI ideas into distinctive, production-grade interfaces with real frontend code, strong aesthetic direction, and less generic AI styling.

UI Design

Favorites 1GitHub 105.2k

create-colleague

by titanwings

create-colleague turns coworker docs, chats, emails, screenshots, Feishu, and DingTalk data into an editable AI skill with separate work and persona outputs, plus update flows for ongoing refinement.

Skill Authoring

Favorites 1GitHub 747

hyperframes

by heygen-com

hyperframes is a workflow skill for building HTML-based video compositions in HyperFrames. Use it for title cards, overlays, captions, voiceovers, audio-reactive motion, and scene transitions when you need structured, code-first hyperframes for Video Editing. It favors layout, timing, and animation decisions over generic prompt-only video requests.

Video Editing

Favorites 0GitHub 2.7k

kreuzberg

by kreuzberg-dev

The kreuzberg skill helps you install and use Kreuzberg for document extraction across 91+ formats, including PDFs, Office files, images, HTML, email, and archives. It covers Python, Node.js/TypeScript, Rust, and CLI workflows for OCR, tables, metadata, batch processing, and practical parsing guidance.

PDF Processing

Favorites 0GitHub 0

skill-creator

by anthropics

skill-creator is a Skill Authoring meta-skill for drafting new skills, revising existing SKILL.md files, running evals, comparing variants, and improving trigger descriptions with repository scripts and review tools.

Skill Authoring

Favorites 2GitHub 105.1k

azure-identity-py

by microsoft

azure-identity-py helps set up Azure authentication in Python with Microsoft Entra ID. Use it to choose DefaultAzureCredential, managed identity, or service principal auth, configure environment variables, and troubleshoot access control and credential chain issues. Install guidance, usage patterns, and practical setup notes are based on the repo skill file.

Access Control

Favorites 0GitHub 2.2k

claude-api

by anthropics

claude-api is a practical skill for installing and using the Claude API and Anthropic SDKs. It helps developers choose the right SDK or raw HTTP path, detect language-specific docs, and implement streaming, tool use, files, batches, and error handling with less guesswork.

API Development

Favorites 0GitHub 105k

wrangler

by cloudflare

The wrangler skill helps you find correct CLI commands, config shapes, and deployment steps for Cloudflare Workers. Use it for wrangler usage, wrangler install checks, and a practical wrangler guide when building or shipping Workers for Backend Development.

Backend Development

Favorites 0GitHub 1.3k

clickhouse-best-practices

by ClickHouse

clickhouse-best-practices is a ClickHouse best practices skill for Database Engineering. It guides schema design, query tuning, insert strategy, and agent connectivity with rule-based recommendations, making clickhouse-best-practices usage easier to trigger, review, and cite in ClickHouse workflows.

Database Engineering

Favorites 0GitHub 412

clickhouse-architecture-advisor

by ClickHouse

clickhouse-architecture-advisor helps design ClickHouse workloads with workload-aware decisions for ingestion, partitioning, joins, dictionaries, upserts, and pre-aggregation. It is especially useful for Backend Development, observability, SIEM, product analytics, IoT telemetry, and financial pipelines. The skill labels guidance as official, derived, or field.

Backend Development

Favorites 0GitHub 412