Featured

Sora 2 Pro

Sora 2 Pro

OpenAI's most advanced video generation model with photorealistic output and complex scene understanding.

Veo 3.1

Veo 3.1

Google DeepMind's flagship video model with exceptional motion consistency and cinematic quality.

Kling 2.6

Kling 2.6

Latest Kling model with enhanced character consistency, longer duration support, and improved physics.

Active

100+ AI Models

Access the best AI models from multiple providers through one unified API. Switch models without changing code.

Browse all models

Featured

AI Image Generator

AI Image Generator

Create stunning images from text descriptions using FLUX, Stable Diffusion, and more.

Text to Video

Text to Video

Transform your ideas into cinematic AI videos with Sora, Veo, and Kling models.

Text to Speech

Text to Speech

Convert text to natural-sounding speech with 30+ voices and emotional expression.

Active

20+ AI Tools

Ready-to-use tools for image, video, and audio generation. No code required — just upload and create.

Explore all tools

Featured

Build Your First AI App

Build Your First AI App

Your first AI generation in 5 minutes. Set up your API key and create your first image.

Text-to-Image Masterclass

Text-to-Image Masterclass

Master prompting techniques, model selection, and advanced settings for stunning results.

Text-to-Video Fundamentals

Text-to-Video Fundamentals

Learn to create cinematic AI videos with proper motion, pacing, and storytelling.

Active

Learn AI Generation

Step-by-step guides to master AI image, video, and audio creation. From beginner to advanced.

View all tutorials

KNOWLEDGE BASE

AI Glossary

AI & Machine Learning
Glossary

Comprehensive definitions of AI terms, from diffusion models to transformers. Understand the technology behind AI generation.

50 TERMS

11 CATEGORIES

50 SHOWING

Categories

Jump to Letter

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

A

2 terms

Artificial Intelligence (AI)

Core Concepts

A broad field of computer science focused on building systems capable of performing tasks that typically require human intelligence, such as visual perception, speech recognition, decision-making, and language translation.

Related:

Machine Learning,Deep Learning,Neural Network

Attention Mechanism

Model Architecture

A technique that allows models to focus on relevant parts of the input when generating each part of the output. Enables models to capture long-range dependencies in data.

Related:

Self-Attention,Cross-Attention,Transformer

B

2 terms

Batch Processing

Inference

Generating multiple outputs in a single request or processing multiple inputs together. More efficient than making individual requests for each generation.

Related:

Throughput,Queue

Bias

Safety

Systematic errors in AI outputs due to biases in training data or model design. Can result in unfair or inaccurate representations of certain groups or concepts.

Related:

Fairness,Training Data

C

5 terms

CFG Scale (Classifier-Free Guidance)

Prompting

A parameter that controls how closely the generated image follows the prompt. Higher values produce outputs more aligned with the prompt but may reduce diversity and quality.

Related:

Guidance Scale,Prompt Adherence

Checkpoint

Training

A saved state of a model's weights at a point during or after training. Allows resuming training, comparing versions, or using the model at different stages of development.

Related:

Model Weights,Training

CLIP (Contrastive Language-Image Pre-training)

Model Architecture

A model trained by OpenAI that learns to associate images with text descriptions. Used in image generation to guide the creation process based on text prompts.

Related:

Text Encoder,Image Encoder,Embedding

Content Safety

Safety

Systems and practices to prevent AI from generating harmful, illegal, or inappropriate content. Includes filters, classifiers, and model training restrictions.

Related:

Moderation,Safety Filter

ControlNet

Image Generation

A neural network architecture that adds conditional control to diffusion models. Allows guiding image generation with additional inputs like edge maps, depth maps, poses, or sketches.

Related:

Conditioning,Depth Map,Pose Estimation

D

4 terms

DALL-E

Models

OpenAI's image generation model series. DALL-E 3 is known for excellent prompt understanding and safe, high-quality outputs. Available through OpenAI's API.

Related:

Text-to-Image,OpenAI

Deep Learning

Core Concepts

A subset of machine learning that uses neural networks with many layers (hence 'deep') to learn complex patterns in data. Powers most modern AI image, video, and audio generation.

Related:

Neural Network,Transformer,Diffusion Model

Diffusion Model

Generative AI

A type of generative model that learns to create images by reversing a gradual noising process. Starting from pure noise, the model iteratively removes noise to generate coherent images. Used by FLUX, Stable Diffusion, and DALL-E 3.

Related:

Denoising,Latent Space,FLUX

DreamBooth

Training

A fine-tuning technique that teaches a model to generate specific subjects (people, objects, styles) from just a few example images. Associates a unique identifier token with the subject.

Related:

Fine-tuning,Subject-Driven Generation

F

3 terms

Fine-tuning

Training

Adapting a pre-trained model to a specific task or domain by training it further on a smaller, targeted dataset. More efficient than training from scratch.

Related:

Transfer Learning,LoRA,DreamBooth

FLUX

Models

A family of high-quality image generation models by Black Forest Labs. Known for excellent prompt following, text rendering, and diverse outputs. Available in Schnell (fast), Dev (balanced), and Pro (quality) variants.

Related:

Diffusion Model,Text-to-Image

Frame Interpolation

Video Generation

Generating intermediate frames between existing video frames to increase frame rate or create smooth slow-motion effects. AI predicts the motion between frames.

Related:

Optical Flow,Motion Estimation

G

2 terms

GAN (Generative Adversarial Network)

Generative AI

A generative model architecture where two neural networks (generator and discriminator) compete against each other. The generator creates fake samples while the discriminator tries to distinguish them from real ones.

Related:

Generator,Discriminator,Generative AI

Generative AI

Generative AI

AI systems that can create new content, including images, text, audio, and video. These models learn patterns from training data and generate novel outputs based on prompts or inputs.

Related:

Diffusion Model,GAN,Large Language Model

I

4 terms

Image-to-Image

Image Generation

AI generation that takes an existing image as input along with a prompt to create a modified or transformed version. Used for style transfer, editing, and variations.

Related:

Strength,ControlNet,Inpainting

Image-to-Video

Video Generation

Converting a static image into a video by generating motion and animation. The AI predicts plausible movement based on the image content.

Related:

Motion Generation,Animation

Inference

Inference

The process of using a trained model to generate outputs from new inputs. When you call an AI API to generate an image, you're running inference on the model.

Related:

Prediction,Model Serving

Inpainting

Image Generation

The process of filling in or replacing specific regions of an image while maintaining coherence with the surrounding content. Used for removing objects, editing specific areas, or extending images.

Related:

Mask,Outpainting,Image-to-Image

L

3 terms

Latency

Inference

The time between sending a request and receiving a response. For AI generation, latency includes queue time, inference time, and network transfer.

Related:

Response Time,TTFB

Latent Space

Generative AI

A compressed representation of data learned by AI models. In image generation, manipulating points in latent space allows for smooth transitions between generated images and control over image attributes.

Related:

Embedding,VAE,Diffusion Model

LoRA (Low-Rank Adaptation)

Training

A technique for efficiently fine-tuning large models by training small adapter layers rather than all model weights. Produces small files that can be combined with base models.

Related:

Fine-tuning,Adapter,Model Customization

M

3 terms

Machine Learning (ML)

Core Concepts

A subset of AI that enables systems to learn and improve from experience without being explicitly programmed. ML algorithms build models based on training data to make predictions or decisions.

Related:

Training,Model,Inference

Midjourney

Models

A popular AI image generation service known for its artistic, stylized outputs. Accessed through Discord. Not available through APIs.

Related:

Text-to-Image,Artistic Style

Music Generation

Audio Generation

AI systems that create original music compositions from prompts, existing audio, or musical notation. Can generate melodies, harmonies, and full arrangements.

Related:

Audio Generation,Sound Design

N

2 terms

Negative Prompt

Prompting

Text describing what you don't want in the generated output. Helps avoid common artifacts or unwanted elements. Supported by most diffusion models.

Related:

Prompt,CFG Scale

Neural Network

Core Concepts

A computing system inspired by biological neural networks. Consists of interconnected nodes (neurons) organized in layers that process information and learn patterns from data.

Related:

Deep Learning,Weights,Activation Function

O

1 term

Outpainting

Image Generation

Extending an image beyond its original boundaries by generating new content that seamlessly continues the existing image. Also called image extension or uncropping.

Related:

Inpainting,Image Extension

P

2 terms

Prompt

Prompting

The text input given to a generative AI model to guide what it creates. Effective prompting is key to getting desired results from AI generation.

Related:

Prompt Engineering,Negative Prompt

Prompt Engineering

Prompting

The practice of crafting effective prompts to achieve desired outputs from AI models. Involves understanding model behavior, using specific keywords, and iterative refinement.

Related:

Prompt,Token,Prompt Weighting

S

5 terms

Sampler

Inference

The algorithm used to generate images in diffusion models. Different samplers (Euler, DPM++, DDIM) offer trade-offs between speed, quality, and style.

Related:

Steps,Scheduler

Seed

Inference

A number that initializes the random number generator, allowing reproducible generations. Using the same seed, prompt, and settings produces identical outputs.

Related:

Reproducibility,Deterministic

Speech-to-Text (STT)

Audio Generation

AI systems that transcribe spoken audio into written text. Also called automatic speech recognition (ASR). Used for transcription, subtitles, and voice interfaces.

Related:

Transcription,ASR

Stable Diffusion

Models

An open-source latent diffusion model for image generation developed by Stability AI. The most widely used open model, with many community fine-tunes and extensions.

Related:

Diffusion Model,Open Source

Steps (Inference Steps)

Inference

In diffusion models, the number of denoising iterations used to generate an image. More steps generally produce higher quality but take longer. Typical range: 20-50 steps.

Related:

Sampling,Denoising

T

7 terms

Temporal Consistency

Video Generation

The coherence of generated video across time—ensuring objects, lighting, and style remain consistent from frame to frame without flickering or sudden changes.

Related:

Frame Coherence,Video Stability

Text-to-Image

Image Generation

AI systems that generate images from text descriptions (prompts). Models like FLUX, DALL-E, and Midjourney interpret natural language to create corresponding visual content.

Related:

Prompt,Diffusion Model,CLIP

Text-to-Speech (TTS)

Audio Generation

AI systems that convert written text into natural-sounding speech. Modern TTS can clone voices, control emotion, and produce highly realistic audio.

Related:

Voice Cloning,Speech Synthesis

Text-to-Video

Video Generation

AI systems that generate video clips from text descriptions. These models create coherent motion and temporal consistency across frames while interpreting prompts.

Related:

Temporal Consistency,Frame Interpolation

Token

Prompting

The basic unit of text that AI models process. Words, parts of words, or punctuation marks are converted to tokens. Models have limits on the number of tokens they can process.

Related:

Tokenization,Context Length

Training

Training

The process of teaching a machine learning model by showing it examples and adjusting its parameters to minimize errors. Requires large datasets and significant computational resources.

Related:

Dataset,Loss Function,Optimization

Transformer

Model Architecture

A neural network architecture that uses self-attention mechanisms to process sequences. Powers both large language models and modern image/video generators. Key innovation: parallel processing of entire sequences.

Related:

Attention,Self-Attention,BERT,GPT

U

2 terms

U-Net

Model Architecture

A neural network architecture originally designed for image segmentation, widely used in diffusion models for the denoising process. Features skip connections between encoder and decoder layers.

Related:

Diffusion Model,Encoder-Decoder

Upscaling

Image Generation

Increasing the resolution of an image using AI to add detail and clarity. AI upscalers can generate plausible details that weren't in the original low-resolution image.

Related:

Super Resolution,Enhancement

V

2 terms

VAE (Variational Autoencoder)

Generative AI

A neural network architecture that learns to compress data into a latent space and reconstruct it. VAEs are used in diffusion models to work in compressed latent space rather than pixel space, improving efficiency.

Related:

Latent Space,Encoder,Decoder

Voice Cloning

Audio Generation

Creating a synthetic voice that mimics a specific person's voice characteristics. Requires training on audio samples of the target voice.

Related:

TTS,Voice Profile

W

1 term

Watermarking

Safety

Embedding invisible or visible identifiers in AI-generated content to indicate its synthetic origin. Helps with authenticity verification and content provenance.

Related:

Provenance,Detection

READY

Ready to Start Building?

Put these concepts into practice with Abstrakt's unified AI API. Access 200+ models through a single integration.

Start Free Trial Browse Tutorials

abstrakt

The unified abstraction layer for the next generation of AI applications. Build faster with any model.

Start Here+

Quickstart
Get API Key
Try Playground
View Pricing

Image Tools+

AI Image Generator
Image to Image
Remove Background
Image Upscaler
Object Remover
Style Transfer
Image Enhancer
AI Art Generator

Video Tools+

Text to Video
Image to Video
AI Video Generator
Video Upscaler
Video Enhancer
Frame Interpolation

Audio Tools+

Text to Speech
Speech to Text
AI Music Generator
Voice Cloning
Audio Enhancer
Sound Effects

Tutorials+

Getting Started
Image Generation
Video Generation
Audio Generation
Advanced Topics
AI Glossary
All Tutorials

Models+

FLUX Schnell
FLUX Dev
Fast SDXL
Stable Diffusion 3
MiniMax Video
Kling AI
Ideogram
More Models

Company+

About Us
Pricing
Documentation
Tutorials
Blog
Contact
Changelog
Status
Careers
Privacy Policy
Terms of Service
Cookie Policy

Image Tools

AI Image Generator
Image to Image
Remove Background
Image Upscaler
Object Remover
Style Transfer
Image Enhancer
AI Art Generator

Video Tools

Text to Video
Image to Video
AI Video Generator
Video Upscaler
Video Enhancer
Frame Interpolation

Audio Tools

Text to Speech
Speech to Text
AI Music Generator
Voice Cloning
Audio Enhancer
Sound Effects

Tutorials

Getting Started
Image Generation
Video Generation
Audio Generation
Advanced Topics
AI Glossary
All Tutorials

Start Here

Quickstart
Get API Key
Try Playground
View Pricing

Models

FLUX Schnell
FLUX Dev
Fast SDXL
Stable Diffusion 3
MiniMax Video
Kling AI
Ideogram
More Models

Company

About Us
Pricing
Documentation
Tutorials
Blog
Contact
Changelog
Status
Careers
Privacy Policy
Terms of Service
Cookie Policy

The unified abstraction layer for the next generation of AI applications.

© 2026 abstrakt. All rights reserved.

SYS.ONLINE|API.ACTIVE|v1.2.0