General Introduction
ElevenLabs is a startup based in New York, USA, specializing in the field of generative AI speech. The company offers a range of powerful text-to-speech, speech-to-speech, speech cloning, and speech recognition services.ElevenLabs' strength lies in its strong multi-language support and personalization capabilities, supporting 32 languages including Chinese, English, Japanese, and Korean. The platform is widely used in the production of audiobooks, movie dubbing, game NPC voices and other content production areas.
Function List
- Text-to-speech: Converts text into high-quality, natural-sounding speech.
- voice cloning: Create personalized voice clones with a few minutes of audio.
- Multi-language support: Supports speech generation and conversion in 32 languages.
- phonetic library: Provides a rich voice library for users to choose and use.
- API integration: Provides low-latency APIs for easy integration into applications by developers.
- project management: Support for project management features such as converting books to audiobooks, scripts to podcasts, and more.
Using Help
Installation and Registration
- Visit the ElevenLabs website (elevenlabs.io).
- Click the "Register" button and fill in the relevant information to complete the registration.
- After logging in, go to the User Control Panel and select the desired service.
Function Operation Guide
Text-to-speech
- Select the "Text to Speech" function in the control panel.
- Type or paste the text content to be converted.
- Select the desired voice type and language.
- Click the "Generate" button and wait for the system to generate the voice file.
- Download the generated voice files or play them directly on the platform.
voice cloning
- Select the "Voice Clone" function in the control panel.
- Upload a few minutes of audio samples and the system will automatically analyze and generate a speech cloning model.
- Select the generated speech clone model and enter the text content for speech generation.
- Download or play the generated voice file.
Multi-language support
- In any speech generation function, select the desired language.
- Input text content, the system will automatically recognize and generate speech files in the corresponding language.
API integration
- Select the "API Integration" function in the control panel.
- Get the API key and related documentation.
- Follow the documentation instructions to integrate the API into your application for speech generation functionality.
project management
- Select the "Project Management" function in the control panel.
- Create a new project and select the project type (e.g. audiobook, podcast, etc.).
- Upload relevant text or audio content and the system will automatically process and generate the required voice files.
- Download or play the generated project file.
ElevenLabs Membership Program
Membership Program | prices | Monthly Character Limit | Customized sound | Additional usage-based characters | Text-to-Speech & Speech-to-Speech | Access to a growing library of sounds | automatic dubbing | dubbing studio | audio quality | API format | Synthetic Sound Design | Instant sound cloning | Professional sound cloning | sports event | utilization analysis | commercial license | No need for attribution |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
free (of charge) | $0/permanent | 10,000 (~10 minutes of audio) | 3 voices | - | ✔️ | ✔️ | - | - | 128 kbps, 44.1kHz | 16kHz PCM, uLaw | - | - | - | - | - | - | ✔️ |
introduction (a subject) | $5 $1/month (20% off first month) | 30,000 (~30 minutes of audio) | 10 voices | $0.30/1000 characters | ✔️ | ✔️ | - | - | 128 kbps, 44.1kHz | 22.05kHz PCM, uLaw | - | - | - | - | - | - | ✔️ |
author (of some project) | $22 $11/month (50% off first month) | 100,000 (~2 hours of audio) | 30 voices | $0.24/1000 characters | ✔️ | ✔️ | ✔️ | - | 128 & 192 kbps (via project), 44.1kHz | 24kHz PCM, uLaw | ✔️ | ✔️ | - | ✔️ | ✔️ | ✔️ | ✔️ |
specialized field | $99/month | 500,000 (~10 hours of audio) | 160 voices | $0.18/1000 characters | ✔️ | ✔️ | ✔️ | ✔️ | 128 & 192 kbps (via project & API), 44.1kHz | 44.1kHz PCM, uLaw | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |
ballpark | $330/month | 2,000,000 (~40 hours of audio) | 660 voices | - | ✔️ | ✔️ | ✔️ | ✔️ | 128 & 192 kbps (via project), 44.1kHz | 44.1kHz PCM, uLaw | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |