Your Complete AI-Powered Audio & Video Studio
Voice cloning, transcription, batch processing, video automation, and creative AI tools -- all running natively on your Windows machine. Free, offline, and private.
What is SoundWorks?
SoundWorks is a free Windows desktop application that brings together AI speech and voice cloning, professional audio engineering, video production automation, and creative AI tools in a single native interface. Whether you are a musician mastering a sample library, a content creator building a video pipeline, or someone who simply refuses to upload sensitive audio to the cloud, SoundWorks runs every feature entirely on your local hardware. No accounts, no subscriptions, no telemetry. Your data stays on your machine, and every tool is available from the moment you install.
Powerful Features for Every Creator
AI Speech & Voice Cloning
Clone Voices and Generate Speech Locally
Run VibeVoice models up to 20 billion parameters on your own GPU. Transcribe audio with Whisper, generate speech from multiple TTS engines, and control pronunciation with SSML -- all without sending a single byte to the cloud.
- VibeVoice local voice cloning with models up to 20B parameters
- Whisper AI transcription with speaker detection
- Multi-engine cloud and local TTS (Silero, IndexTTS2)
- Full SSML control for pronunciation and timing
Advanced Audio Engineering
Batch Process Audio at Scale
Convert hundreds of files between MP3, AAC, AC3, Opus, FLAC, and WAV in a single operation. Extract audio from video, normalize loudness for streaming platforms, and master tracks with professional-grade encoding that preserves every detail.
- Batch conversion across MP3, AAC, AC3, Opus, FLAC, WAV
- Audio extraction from video files
- Loudness normalization and mastering
- Fragment extraction with precise timestamps
Video Studio & Automation
Automate Video Production
Turn presentation slides into fully narrated videos with synchronized voiceover. Fix washed-out HDR footage from phone recordings, cut and merge clips without re-encoding, and control playback speed -- all from a single interface.
- Slide-to-video with automatic narration sync
- HDR to SDR correction for phone recordings
- Lossless video cut and merge
- Speed control and dubbing tools
Creative Intelligence
AI-Powered Creative Intelligence
Rephrase and translate text through ChatGPT and Claude integrations. Build and edit subtitles with a full SRT/VTT studio. Generate scripts and apply intelligent replacement rules across your content library.
- AI rephraser via ChatGPT and Claude
- Subtitle studio with SRT/VTT editing
- Script generation tools
- Intelligent text replacement rules
Built-in Utilities
Built-in Utility Suite
Download videos from popular platforms. Find visually similar images across your library. Generate QR codes and shorten URLs. A complete set of daily-use tools integrated into a single application, eliminating the need for scattered browser extensions.
- Video downloader for popular platforms
- Image similarity search
- QR code generator
- URL shortener with tracking
Privacy & Performance
Privacy and Performance by Design
Every feature runs offline by default. Store sensitive voice models and project files in an encrypted vault. Force CPU-only mode to keep GPU free for other tasks or run on machines without dedicated graphics hardware.
- Offline-first architecture -- no internet required
- Encrypted vault for sensitive files
- CPU-only mode for low-spec machines
- Zero telemetry and no data collection
What You Can Create
Generate AI Podcasts
Clone your voice once, then produce entire podcast series from text scripts. Control pacing, pauses, and emphasis without sitting in front of a microphone.
Create Faceless YouTube Videos
Combine slide-to-video automation with AI narration to produce bulk content at scale. Process hundreds of assets through a single pipeline in real time or while you sleep.
Build Online Courses
Transform presentations into professional training videos with synchronized voiceover. Consistent audio quality across every lesson, no recording sessions needed.
Transcribe Interviews
Run Whisper locally on hours of recorded audio. Get timestamped, speaker-detected transcripts exported as SRT, VTT, or plain text -- all without uploading to a cloud service.
Master Audio for Streaming
Batch normalize loudness to LUFS standards, convert between formats, and apply consistent mastering settings across an entire album or sample library.
Fix Phone Video Footage
Correct washed-out HDR recordings from mobile phones, trim and merge clips losslessly, and adjust playback speed -- all in a clean native interface.
Frequently Asked Questions
Start Creating Today
Every feature. No cost. No cloud. Download SoundWorks and take control of your creative workflow.