Audio-Video Processing

General Interface
Enable the invocation of all TTS (Text-to-Speech) models through a unified format.

OpenAI
Support speech-to-text and text-to-speech conversion.

Suno
Compose songs by AI

302.AI
Open-source model deployed by 302.AI.

Microsoft Azure
Audio processing services provided by Microsoft Azure.

Doubao
Text-to-Speech API from Doubao.

Fish Audio
Sound cloning service from Fish Audio

Minimax
Ultra-long text-to-speech generation from Minimax

Dubbingx
Text-to-speech generation service from Dubbingx

Udio
Song generation service from Udio

Elevenlabs
Audio and video processing services provided by Elevenlabs

Mureka
Mureka is an audio processing service launched by Kunlun Tech