This skill generates natural speech from text, supporting over 70 languages with multiple models that balance quality and latency. It is ideal for developers looking to convert written content into spoken language with fine-tuned voice characteristics.
$ npx skills add https://github.com/elevenlabs/skills --skill text-to-speechThis skill converts written text into natural-sounding speech using ElevenLabs' voice synthesis technology. It supports over 70 languages with six different models optimized for different use cases—from highest-quality emotional speech (eleven_v3) to ultra-low latency real-time applications (eleven_flash_v2_5). You can select from pre-built voices or create custom ones, and fine-tune output with stability, similarity boost, style, and speaker boost settings. The skill also offers request stitching to eliminate audio artifacts across long-form content, language enforcement for pronunciation control, text normalization for natural number and date reading, and multiple output formats including MP3, PCM, Opus, and telephony-standard μ-law. Developers building voice applications, voiceover systems, multilingual content platforms, or real-time conversational AI benefit from its flexible architecture and streaming capabilities.
Install the skill using the command provided and refer to the setup guide for additional details.
Generating audio for educational content
Creating voiceovers for videos
Transcribing meetings and events into spoken summaries
Developing interactive voice applications
$ npx skills add https://github.com/elevenlabs/skills --skill text-to-speechgit clone https://github.com/elevenlabs/skillsCopy the install command above and run it in your terminal.
Launch Claude Code, Cursor, or your preferred AI coding agent.
Use the prompt template or examples below to test the skill.
Adapt the skill to your specific use case and workflow.
Generate natural speech from the following text: '[TEXT]'. Use a [LANGUAGE] voice with [TONE_STYLE] tone (e.g., conversational, formal, or emotional). Ensure the speech is clear and suitable for [USE_CASE, e.g., audiobooks, customer service, or educational content].
Here’s the natural speech output for your text: **Audio Preview:** [Simulated audio player interface with play button] **Transcript:** "Welcome to GreenLeaf Organics, your trusted partner in sustainable farming solutions. Our mission is to empower local farmers with cutting-edge agritech tools that enhance productivity while preserving the environment. Whether you're looking to optimize irrigation, monitor soil health, or access real-time market data, we’ve got you covered." **Voice Characteristics:** - Language: English (US) - Tone: Professional and warm - Speed: 1.0x (adjustable) - Model: High-quality neural TTS (balanced latency) **Suggested Next Steps:** - Download the audio file in MP3 or WAV format. - Adjust tone or speed if needed for a different audience. - Integrate this speech into your [USE_CASE, e.g., podcast, IVR system, or e-learning module].
Take a free 3-minute scan and get personalized AI skill recommendations.
Take free scan