Monthly Plan
Pay monthly, flexible to add or reduce.
- Use Pro tools/templates
- Unlimited video synthesis
- Watermark-free video export
- Custom text/image watermark
- Local plugin API service
Zero server costs, zero data breach risks, generate professional-grade subtitles, podcasts, and short videos at lightning speed.
Choose the most suitable Pro plan to unlock more resources
Pay monthly, flexible to add or reduce.
Annual discount, more savings long-term.
One-time purchase, lifetime use.
Vidpai makes content creation simple!
Based on local Whisper/FunASR/Parakeet models, automatically recognize speech and generate precise subtitles. Quickly generate various stunning subtitle effects.
Intelligently convert text into natural, fluent English podcast videos. Supports multi-character dialogues, creating professional-grade podcast content.
Based on video subtitle content, AI deeply understands video semantics, supports natural language Q&A. Quickly locate and extract key information, making long video content retrieval as simple as conversation.
AI automatically identifies multiple highlight segments, one-click conversion of horizontal long videos to vertical short videos. Quickly create short videos, boost content creation efficiency.
Integrates multiple top-tier ASR models like Whisper/FunASR/Parakeet, converting speech to high-quality text. High precision, fast speed, data never leaves local device.
Integrates top-tier TTS engines like Kokoro, Chatterbox Turbo, Qwen3 TTS, converting text to natural, fluent speech. Supports multiple voices, emotion adjustment, creating professional-grade voice-over effects.




Deploy industrial-grade capabilities of open-source AI directly to your local device.
Plug-and-play plugin architecture, seamlessly integrating top-tier tools like Whisper, FunASR, Kokoro, FFmpeg. Combine as needed, infinitely expand creative capabilities.
All AI processing is completed on local CPU/GPU, data never leaves your device. Say goodbye to cloud API fees, enjoy enterprise-grade privacy protection.
Frame-level precise control rendering technology, supporting rich templates for subtitles, podcasts, dialogue videos, and more.
Integrates industry-leading models like Whisper, FunASR, Kokoro, supporting high-precision speech recognition and natural, fluent speech synthesis, covering multilingual scenarios.
Built-in FFmpeg video processing engine and all-in-one download tool, one-stop completion of transcoding, editing, downloading, and other complex media tasks, no additional configuration required.
Standard HTTP API exposes all plugin capabilities, easily integrate with automation platforms like n8n, Dify, Zapier, build enterprise-grade video processing pipelines.
Process files locally in your browser—no upload, 100% private.
AI removes unwanted objects
Remove backgrounds instantly
Make images clearer
AI voice synthesis