Skip to main content

Your Complete AI-Powered Audio & Video Studio

Voice cloning, transcription, batch processing, video automation, and creative AI tools -- all running natively on your Windows machine. Free, offline, and private.

What is SoundWorks?

SoundWorks is a free Windows desktop application that brings together AI speech and voice cloning, professional audio engineering, video production automation, and creative AI tools in a single native interface. Whether you are a musician mastering a sample library, a content creator building a video pipeline, or someone who simply refuses to upload sensitive audio to the cloud, SoundWorks runs every feature entirely on your local hardware. No accounts, no subscriptions, no telemetry. Your data stays on your machine, and every tool is available from the moment you install.

Powerful Features for Every Creator

AI Speech & Voice Cloning

Clone Voices and Generate Speech Locally

Run VibeVoice models up to 20 billion parameters on your own GPU. Transcribe audio with Whisper, generate speech from multiple TTS engines, and control pronunciation with SSML -- all without sending a single byte to the cloud.

  • VibeVoice local voice cloning with models up to 20B parameters
  • Whisper AI transcription with speaker detection
  • Multi-engine cloud and local TTS (Silero, IndexTTS2)
  • Full SSML control for pronunciation and timing

Advanced Audio Engineering

Batch Process Audio at Scale

Convert hundreds of files between MP3, AAC, AC3, Opus, FLAC, and WAV in a single operation. Extract audio from video, normalize loudness for streaming platforms, and master tracks with professional-grade encoding that preserves every detail.

  • Batch conversion across MP3, AAC, AC3, Opus, FLAC, WAV
  • Audio extraction from video files
  • Loudness normalization and mastering
  • Fragment extraction with precise timestamps

Video Studio & Automation

Automate Video Production

Turn presentation slides into fully narrated videos with synchronized voiceover. Fix washed-out HDR footage from phone recordings, cut and merge clips without re-encoding, and control playback speed -- all from a single interface.

  • Slide-to-video with automatic narration sync
  • HDR to SDR correction for phone recordings
  • Lossless video cut and merge
  • Speed control and dubbing tools

Creative Intelligence

AI-Powered Creative Intelligence

Rephrase and translate text through ChatGPT and Claude integrations. Build and edit subtitles with a full SRT/VTT studio. Generate scripts and apply intelligent replacement rules across your content library.

  • AI rephraser via ChatGPT and Claude
  • Subtitle studio with SRT/VTT editing
  • Script generation tools
  • Intelligent text replacement rules

Built-in Utilities

Built-in Utility Suite

Download videos from popular platforms. Find visually similar images across your library. Generate QR codes and shorten URLs. A complete set of daily-use tools integrated into a single application, eliminating the need for scattered browser extensions.

  • Video downloader for popular platforms
  • Image similarity search
  • QR code generator
  • URL shortener with tracking

Privacy & Performance

Privacy and Performance by Design

Every feature runs offline by default. Store sensitive voice models and project files in an encrypted vault. Force CPU-only mode to keep GPU free for other tasks or run on machines without dedicated graphics hardware.

  • Offline-first architecture -- no internet required
  • Encrypted vault for sensitive files
  • CPU-only mode for low-spec machines
  • Zero telemetry and no data collection

What You Can Create

Generate AI Podcasts

Clone your voice once, then produce entire podcast series from text scripts. Control pacing, pauses, and emphasis without sitting in front of a microphone.

Create Faceless YouTube Videos

Combine slide-to-video automation with AI narration to produce bulk content at scale. Process hundreds of assets through a single pipeline in real time or while you sleep.

Build Online Courses

Transform presentations into professional training videos with synchronized voiceover. Consistent audio quality across every lesson, no recording sessions needed.

Transcribe Interviews

Run Whisper locally on hours of recorded audio. Get timestamped, speaker-detected transcripts exported as SRT, VTT, or plain text -- all without uploading to a cloud service.

Master Audio for Streaming

Batch normalize loudness to LUFS standards, convert between formats, and apply consistent mastering settings across an entire album or sample library.

Fix Phone Video Footage

Correct washed-out HDR recordings from mobile phones, trim and merge clips losslessly, and adjust playback speed -- all in a clean native interface.

Frequently Asked Questions

Start Creating Today

Every feature. No cost. No cloud. Download SoundWorks and take control of your creative workflow.

Download for Windows
v1.33.1 85 MB Windows 10, Windows 11