abstrakt
Models
Featured
Sora 2 Pro
Featured

Sora 2 Pro

OpenAI's most advanced video generation model with photorealistic output and complex scene understanding.

Veo 3.1
New

Veo 3.1

Google DeepMind's flagship video model with exceptional motion consistency and cinematic quality.

Kling 2.6
Popular

Kling 2.6

Latest Kling model with enhanced character consistency, longer duration support, and improved physics.

Active

100+ AI Models

Access the best AI models from multiple providers through one unified API. Switch models without changing code.

Browse all models
Tools
Featured
AI Image Generator
Popular

AI Image Generator

Create stunning images from text descriptions using FLUX, Stable Diffusion, and more.

Text to Video
New

Text to Video

Transform your ideas into cinematic AI videos with Sora, Veo, and Kling models.

Text to Speech

Text to Speech

Convert text to natural-sounding speech with 30+ voices and emotional expression.

Active

20+ AI Tools

Ready-to-use tools for image, video, and audio generation. No code required — just upload and create.

Explore all tools
Tutorials
Featured
Build Your First AI App
Start Here

Build Your First AI App

Your first AI generation in 5 minutes. Set up your API key and create your first image.

Text-to-Image Masterclass

Text-to-Image Masterclass

Master prompting techniques, model selection, and advanced settings for stunning results.

Text-to-Video Fundamentals

Text-to-Video Fundamentals

Learn to create cinematic AI videos with proper motion, pacing, and storytelling.

Active

Learn AI Generation

Step-by-step guides to master AI image, video, and audio creation. From beginner to advanced.

View all tutorials
Sandbox
Docs
Docs Online

Getting Started

  • Introduction
  • Quick Start
  • Authentication

API Reference

  • Jobs
  • Models
  • Webhooks

Guides

  • Image Generation
  • Video Generation
  • Audio & TTS

Resources

  • Error Codes
  • Rate Limits
Docs/Guides/Audio & TTS

Audio & Text-to-Speech

Generate natural-sounding speech and audio with AI.

Available Models

Text-to-Speech

ModelQualitySpeedBest For
playht-tts-v3Premium~3sProfessional voiceovers
f5-ttsHigh~2sFast generation

Basic Text-to-Speech

Convert text to natural-sounding speech:

const result = await abstrakt.run('playht-tts-v3', {
  text: 'Welcome to Abstrakt! The unified API for AI.',
  voice: 'alloy',
  speed: 1.0
});

console.log('Audio URL:', result.output.items[0].url);

Voice Options

Choose from various voice styles:

alloy

Neutral, balanced voice

echo

Warm, conversational

fable

Expressive, storytelling

onyx

Deep, authoritative

nova

Bright, energetic

shimmer

Soft, gentle

Parameters

ParameterTypeDescription
textstringThe text to convert to speech
voicestringVoice ID (alloy, echo, fable, etc.)
speednumberPlayback speed (0.5 to 2.0)

Long-form Content

For longer text, consider breaking it into chunks:

// Split long text into paragraphs
const paragraphs = longText.split('\n\n');

// Generate audio for each paragraph
const audioUrls = await Promise.all(
  paragraphs.map(text => 
    abstrakt.run('playht-tts-v3', {
      text,
      voice: 'alloy'
    })
  )
);

// Combine audio files on your server
const urls = audioUrls.map(r => r.output.items[0].url);

Common Use Cases

  • •Podcasts & Audiobooks: Convert written content to audio format.
  • •Video Narration: Generate voiceovers for videos and presentations.
  • •Accessibility: Make content accessible with audio versions.
  • •IVR Systems: Create dynamic voice prompts for phone systems.
  • •Language Learning: Generate pronunciation examples.

Next Steps

API Reference

Explore all available endpoints

Try TTS Tool

Generate speech in your browser

On this page

abstrakt
abstrakt

The unified abstraction layer for the next generation of AI applications. Build faster with any model.

Start Here+
  • Quickstart
  • Get API Key
  • Try Playground
  • View Pricing
Image Tools+
  • AI Image Generator
  • Image to Image
  • Remove Background
  • Image Upscaler
  • Object Remover
  • Style Transfer
  • Image Enhancer
  • AI Art Generator
Video Tools+
  • Text to Video
  • Image to Video
  • AI Video Generator
  • Video Upscaler
  • Video Enhancer
  • Frame Interpolation
Audio Tools+
  • Text to Speech
  • Speech to Text
  • AI Music Generator
  • Voice Cloning
  • Audio Enhancer
  • Sound Effects
Tutorials+
  • Getting Started
  • Image Generation
  • Video Generation
  • Audio Generation
  • Advanced Topics
  • AI Glossary
  • All Tutorials
Models+
  • FLUX Schnell
  • FLUX Dev
  • Fast SDXL
  • Stable Diffusion 3
  • MiniMax Video
  • Kling AI
  • Ideogram
  • More Models
Company+
  • About Us
  • Pricing
  • Documentation
  • Tutorials
  • Blog
  • Contact
  • Changelog
  • Status
  • Careers
  • Privacy Policy
  • Terms of Service
  • Cookie Policy

Image Tools

  • AI Image Generator
  • Image to Image
  • Remove Background
  • Image Upscaler
  • Object Remover
  • Style Transfer
  • Image Enhancer
  • AI Art Generator

Video Tools

  • Text to Video
  • Image to Video
  • AI Video Generator
  • Video Upscaler
  • Video Enhancer
  • Frame Interpolation

Audio Tools

  • Text to Speech
  • Speech to Text
  • AI Music Generator
  • Voice Cloning
  • Audio Enhancer
  • Sound Effects

Tutorials

  • Getting Started
  • Image Generation
  • Video Generation
  • Audio Generation
  • Advanced Topics
  • AI Glossary
  • All Tutorials

Start Here

  • Quickstart
  • Get API Key
  • Try Playground
  • View Pricing

Models

  • FLUX Schnell
  • FLUX Dev
  • Fast SDXL
  • Stable Diffusion 3
  • MiniMax Video
  • Kling AI
  • Ideogram
  • More Models

Company

  • About Us
  • Pricing
  • Documentation
  • Tutorials
  • Blog
  • Contact
  • Changelog
  • Status
  • Careers
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
abstrakt

The unified abstraction layer for the next generation of AI applications.

© 2026 abstrakt. All rights reserved.

SYS.ONLINE|API.ACTIVE|v1.2.0