텍스트 음성 변환

Fish Audio, MiniMax, Qwen 등으로 자연스러운 음성으로 변환

음성을 텍스트로

업로드한 음성을 고정밀로 전사

주요 모델로 프롬프트에서 이미지 생성

텍스트와 스타일로 영상 제작

립싱크·디지털 휴먼

아바타·프레젠테이션용 음성과 영상 동기화

음성 워크스페이스

음성 합성 워크스페이스에서 프로젝트 관리

SNS·광고·UGC용 빠른 나레이션

오디오북·팟캐스트

자연스러운 템포의 긴 나레이션

교육·사내 연수

강의·사내 커뮤니케이션용 명확한 읽기

모델 라이브러리

TTS 공급자·기능·사양을 한눈에 비교

음성 클론 절차

샘플 수집부터 학습·모범 사례까지

API 플레이그라운드

API 키로 REST 온라인 시험

토큰 생성 및 관리

앱 열기

.

API documentation & playground

Choose an API below for endpoint details, parameters, and live testing with your API key.

Text to Speech (HTTP)
REST synthesis with your voice model ID and engine options.
Text to Speech (HTTP v2)
Synthesize speech with a voice ID and optional engine settings.
TTS WebSocket
Streaming speech over WebSocket for realtime use cases.
TTS WebSocket v2
Updated WebSocket protocol for TTS.
Speech to Text
Transcribe audio from a public URL.
Voice clone — create model
Upload reference audio to create a voice model.
Voice clone — delete model
Remove a voice model by ID.
Voice clone — list models
List public and personal voice models.
Lip sync — create task
Create a lip-sync video generation task.
Lip sync — query task
Poll task status and results by ID.
Lip sync — list tasks
List lip-sync tasks and statistics.
User profile (API)
Remaining API quota and basic account info.