Controls where GPU-heavy tasks run: voice conversion, XTTS narration, model training.
Modal GPU
Serverless T4 GPU on Modal. Fast, no hardware needed. Recommended.
Local
Run on server hardware. Requires GPU on the host machine.
Voice Convert (Seed-VC)
Zero-shot voice cloning on audio
XTTS Narration Studio
Neural TTS in your cloned voice
Recap Studio
AI video recap narration
Vocal Training
Train custom RVC voice models
Demucs / Denoise
Vocal separation and audio enhancement