abstrakt
Models
Featured
Sora 2 Pro
Featured

Sora 2 Pro

OpenAI's most advanced video generation model with photorealistic output and complex scene understanding.

Veo 3.1
New

Veo 3.1

Google DeepMind's flagship video model with exceptional motion consistency and cinematic quality.

Kling 2.6
Popular

Kling 2.6

Latest Kling model with enhanced character consistency, longer duration support, and improved physics.

Active

100+ AI Models

Access the best AI models from multiple providers through one unified API. Switch models without changing code.

Browse all models
Tools
Featured
AI Image Generator
Popular

AI Image Generator

Create stunning images from text descriptions using FLUX, Stable Diffusion, and more.

Text to Video
New

Text to Video

Transform your ideas into cinematic AI videos with Sora, Veo, and Kling models.

Text to Speech

Text to Speech

Convert text to natural-sounding speech with 30+ voices and emotional expression.

Active

20+ AI Tools

Ready-to-use tools for image, video, and audio generation. No code required — just upload and create.

Explore all tools
Tutorials
Featured
Build Your First AI App
Start Here

Build Your First AI App

Your first AI generation in 5 minutes. Set up your API key and create your first image.

Text-to-Image Masterclass

Text-to-Image Masterclass

Master prompting techniques, model selection, and advanced settings for stunning results.

Text-to-Video Fundamentals

Text-to-Video Fundamentals

Learn to create cinematic AI videos with proper motion, pacing, and storytelling.

Active

Learn AI Generation

Step-by-step guides to master AI image, video, and audio creation. From beginner to advanced.

View all tutorials
Sandbox
Docs
KNOWLEDGE BASE
AI Glossary

AI & Machine Learning
Glossary

Comprehensive definitions of AI terms, from diffusion models to transformers. Understand the technology behind AI generation.

50 TERMS
11 CATEGORIES
50 SHOWING

Categories

Jump to Letter

ABCDEFGHIJKLMNOPQRSTUVWXYZ
A
2 terms

Artificial Intelligence (AI)

Core Concepts

A broad field of computer science focused on building systems capable of performing tasks that typically require human intelligence, such as visual perception, speech recognition, decision-making, and language translation.

Related:
Machine Learning,Deep Learning,Neural Network

Attention Mechanism

Model Architecture

A technique that allows models to focus on relevant parts of the input when generating each part of the output. Enables models to capture long-range dependencies in data.

Related:
Self-Attention,Cross-Attention,Transformer
B
2 terms

Batch Processing

Inference

Generating multiple outputs in a single request or processing multiple inputs together. More efficient than making individual requests for each generation.

Related:
Throughput,Queue
Learn more

Bias

Safety

Systematic errors in AI outputs due to biases in training data or model design. Can result in unfair or inaccurate representations of certain groups or concepts.

Related:
Fairness,Training Data
C
5 terms

CFG Scale (Classifier-Free Guidance)

Prompting

A parameter that controls how closely the generated image follows the prompt. Higher values produce outputs more aligned with the prompt but may reduce diversity and quality.

Related:
Guidance Scale,Prompt Adherence

Checkpoint

Training

A saved state of a model's weights at a point during or after training. Allows resuming training, comparing versions, or using the model at different stages of development.

Related:
Model Weights,Training

CLIP (Contrastive Language-Image Pre-training)

Model Architecture

A model trained by OpenAI that learns to associate images with text descriptions. Used in image generation to guide the creation process based on text prompts.

Related:
Text Encoder,Image Encoder,Embedding

Content Safety

Safety

Systems and practices to prevent AI from generating harmful, illegal, or inappropriate content. Includes filters, classifiers, and model training restrictions.

Related:
Moderation,Safety Filter

ControlNet

Image Generation

A neural network architecture that adds conditional control to diffusion models. Allows guiding image generation with additional inputs like edge maps, depth maps, poses, or sketches.

Related:
Conditioning,Depth Map,Pose Estimation
D
4 terms

DALL-E

Models

OpenAI's image generation model series. DALL-E 3 is known for excellent prompt understanding and safe, high-quality outputs. Available through OpenAI's API.

Related:
Text-to-Image,OpenAI

Deep Learning

Core Concepts

A subset of machine learning that uses neural networks with many layers (hence 'deep') to learn complex patterns in data. Powers most modern AI image, video, and audio generation.

Related:
Neural Network,Transformer,Diffusion Model

Diffusion Model

Generative AI

A type of generative model that learns to create images by reversing a gradual noising process. Starting from pure noise, the model iteratively removes noise to generate coherent images. Used by FLUX, Stable Diffusion, and DALL-E 3.

Related:
Denoising,Latent Space,FLUX
Learn more

DreamBooth

Training

A fine-tuning technique that teaches a model to generate specific subjects (people, objects, styles) from just a few example images. Associates a unique identifier token with the subject.

Related:
Fine-tuning,Subject-Driven Generation
F
3 terms

Fine-tuning

Training

Adapting a pre-trained model to a specific task or domain by training it further on a smaller, targeted dataset. More efficient than training from scratch.

Related:
Transfer Learning,LoRA,DreamBooth

FLUX

Models

A family of high-quality image generation models by Black Forest Labs. Known for excellent prompt following, text rendering, and diverse outputs. Available in Schnell (fast), Dev (balanced), and Pro (quality) variants.

Related:
Diffusion Model,Text-to-Image
Learn more

Frame Interpolation

Video Generation

Generating intermediate frames between existing video frames to increase frame rate or create smooth slow-motion effects. AI predicts the motion between frames.

Related:
Optical Flow,Motion Estimation
Learn more
G
2 terms

GAN (Generative Adversarial Network)

Generative AI

A generative model architecture where two neural networks (generator and discriminator) compete against each other. The generator creates fake samples while the discriminator tries to distinguish them from real ones.

Related:
Generator,Discriminator,Generative AI

Generative AI

Generative AI

AI systems that can create new content, including images, text, audio, and video. These models learn patterns from training data and generate novel outputs based on prompts or inputs.

Related:
Diffusion Model,GAN,Large Language Model
Learn more
I
4 terms

Image-to-Image

Image Generation

AI generation that takes an existing image as input along with a prompt to create a modified or transformed version. Used for style transfer, editing, and variations.

Related:
Strength,ControlNet,Inpainting
Learn more

Image-to-Video

Video Generation

Converting a static image into a video by generating motion and animation. The AI predicts plausible movement based on the image content.

Related:
Motion Generation,Animation
Learn more

Inference

Inference

The process of using a trained model to generate outputs from new inputs. When you call an AI API to generate an image, you're running inference on the model.

Related:
Prediction,Model Serving

Inpainting

Image Generation

The process of filling in or replacing specific regions of an image while maintaining coherence with the surrounding content. Used for removing objects, editing specific areas, or extending images.

Related:
Mask,Outpainting,Image-to-Image
L
3 terms

Latency

Inference

The time between sending a request and receiving a response. For AI generation, latency includes queue time, inference time, and network transfer.

Related:
Response Time,TTFB

Latent Space

Generative AI

A compressed representation of data learned by AI models. In image generation, manipulating points in latent space allows for smooth transitions between generated images and control over image attributes.

Related:
Embedding,VAE,Diffusion Model

LoRA (Low-Rank Adaptation)

Training

A technique for efficiently fine-tuning large models by training small adapter layers rather than all model weights. Produces small files that can be combined with base models.

Related:
Fine-tuning,Adapter,Model Customization
M
3 terms

Machine Learning (ML)

Core Concepts

A subset of AI that enables systems to learn and improve from experience without being explicitly programmed. ML algorithms build models based on training data to make predictions or decisions.

Related:
Training,Model,Inference

Midjourney

Models

A popular AI image generation service known for its artistic, stylized outputs. Accessed through Discord. Not available through APIs.

Related:
Text-to-Image,Artistic Style

Music Generation

Audio Generation

AI systems that create original music compositions from prompts, existing audio, or musical notation. Can generate melodies, harmonies, and full arrangements.

Related:
Audio Generation,Sound Design
Learn more
N
2 terms

Negative Prompt

Prompting

Text describing what you don't want in the generated output. Helps avoid common artifacts or unwanted elements. Supported by most diffusion models.

Related:
Prompt,CFG Scale

Neural Network

Core Concepts

A computing system inspired by biological neural networks. Consists of interconnected nodes (neurons) organized in layers that process information and learn patterns from data.

Related:
Deep Learning,Weights,Activation Function
O
1 term

Outpainting

Image Generation

Extending an image beyond its original boundaries by generating new content that seamlessly continues the existing image. Also called image extension or uncropping.

Related:
Inpainting,Image Extension
P
2 terms

Prompt

Prompting

The text input given to a generative AI model to guide what it creates. Effective prompting is key to getting desired results from AI generation.

Related:
Prompt Engineering,Negative Prompt
Learn more

Prompt Engineering

Prompting

The practice of crafting effective prompts to achieve desired outputs from AI models. Involves understanding model behavior, using specific keywords, and iterative refinement.

Related:
Prompt,Token,Prompt Weighting
Learn more
S
5 terms

Sampler

Inference

The algorithm used to generate images in diffusion models. Different samplers (Euler, DPM++, DDIM) offer trade-offs between speed, quality, and style.

Related:
Steps,Scheduler

Seed

Inference

A number that initializes the random number generator, allowing reproducible generations. Using the same seed, prompt, and settings produces identical outputs.

Related:
Reproducibility,Deterministic

Speech-to-Text (STT)

Audio Generation

AI systems that transcribe spoken audio into written text. Also called automatic speech recognition (ASR). Used for transcription, subtitles, and voice interfaces.

Related:
Transcription,ASR
Learn more

Stable Diffusion

Models

An open-source latent diffusion model for image generation developed by Stability AI. The most widely used open model, with many community fine-tunes and extensions.

Related:
Diffusion Model,Open Source
Learn more

Steps (Inference Steps)

Inference

In diffusion models, the number of denoising iterations used to generate an image. More steps generally produce higher quality but take longer. Typical range: 20-50 steps.

Related:
Sampling,Denoising
T
7 terms

Temporal Consistency

Video Generation

The coherence of generated video across time—ensuring objects, lighting, and style remain consistent from frame to frame without flickering or sudden changes.

Related:
Frame Coherence,Video Stability

Text-to-Image

Image Generation

AI systems that generate images from text descriptions (prompts). Models like FLUX, DALL-E, and Midjourney interpret natural language to create corresponding visual content.

Related:
Prompt,Diffusion Model,CLIP
Learn more

Text-to-Speech (TTS)

Audio Generation

AI systems that convert written text into natural-sounding speech. Modern TTS can clone voices, control emotion, and produce highly realistic audio.

Related:
Voice Cloning,Speech Synthesis
Learn more

Text-to-Video

Video Generation

AI systems that generate video clips from text descriptions. These models create coherent motion and temporal consistency across frames while interpreting prompts.

Related:
Temporal Consistency,Frame Interpolation
Learn more

Token

Prompting

The basic unit of text that AI models process. Words, parts of words, or punctuation marks are converted to tokens. Models have limits on the number of tokens they can process.

Related:
Tokenization,Context Length

Training

Training

The process of teaching a machine learning model by showing it examples and adjusting its parameters to minimize errors. Requires large datasets and significant computational resources.

Related:
Dataset,Loss Function,Optimization

Transformer

Model Architecture

A neural network architecture that uses self-attention mechanisms to process sequences. Powers both large language models and modern image/video generators. Key innovation: parallel processing of entire sequences.

Related:
Attention,Self-Attention,BERT,GPT
U
2 terms

U-Net

Model Architecture

A neural network architecture originally designed for image segmentation, widely used in diffusion models for the denoising process. Features skip connections between encoder and decoder layers.

Related:
Diffusion Model,Encoder-Decoder

Upscaling

Image Generation

Increasing the resolution of an image using AI to add detail and clarity. AI upscalers can generate plausible details that weren't in the original low-resolution image.

Related:
Super Resolution,Enhancement
Learn more
V
2 terms

VAE (Variational Autoencoder)

Generative AI

A neural network architecture that learns to compress data into a latent space and reconstruct it. VAEs are used in diffusion models to work in compressed latent space rather than pixel space, improving efficiency.

Related:
Latent Space,Encoder,Decoder

Voice Cloning

Audio Generation

Creating a synthetic voice that mimics a specific person's voice characteristics. Requires training on audio samples of the target voice.

Related:
TTS,Voice Profile
Learn more
W
1 term

Watermarking

Safety

Embedding invisible or visible identifiers in AI-generated content to indicate its synthetic origin. Helps with authenticity verification and content provenance.

Related:
Provenance,Detection
READY

Ready to Start Building?

Put these concepts into practice with Abstrakt's unified AI API. Access 200+ models through a single integration.

Start Free TrialBrowse Tutorials
abstrakt
abstrakt

The unified abstraction layer for the next generation of AI applications. Build faster with any model.

Start Here+
  • Quickstart
  • Get API Key
  • Try Playground
  • View Pricing
Image Tools+
  • AI Image Generator
  • Image to Image
  • Remove Background
  • Image Upscaler
  • Object Remover
  • Style Transfer
  • Image Enhancer
  • AI Art Generator
Video Tools+
  • Text to Video
  • Image to Video
  • AI Video Generator
  • Video Upscaler
  • Video Enhancer
  • Frame Interpolation
Audio Tools+
  • Text to Speech
  • Speech to Text
  • AI Music Generator
  • Voice Cloning
  • Audio Enhancer
  • Sound Effects
Tutorials+
  • Getting Started
  • Image Generation
  • Video Generation
  • Audio Generation
  • Advanced Topics
  • AI Glossary
  • All Tutorials
Models+
  • FLUX Schnell
  • FLUX Dev
  • Fast SDXL
  • Stable Diffusion 3
  • MiniMax Video
  • Kling AI
  • Ideogram
  • More Models
Company+
  • About Us
  • Pricing
  • Documentation
  • Tutorials
  • Blog
  • Contact
  • Changelog
  • Status
  • Careers
  • Privacy Policy
  • Terms of Service
  • Cookie Policy

Image Tools

  • AI Image Generator
  • Image to Image
  • Remove Background
  • Image Upscaler
  • Object Remover
  • Style Transfer
  • Image Enhancer
  • AI Art Generator

Video Tools

  • Text to Video
  • Image to Video
  • AI Video Generator
  • Video Upscaler
  • Video Enhancer
  • Frame Interpolation

Audio Tools

  • Text to Speech
  • Speech to Text
  • AI Music Generator
  • Voice Cloning
  • Audio Enhancer
  • Sound Effects

Tutorials

  • Getting Started
  • Image Generation
  • Video Generation
  • Audio Generation
  • Advanced Topics
  • AI Glossary
  • All Tutorials

Start Here

  • Quickstart
  • Get API Key
  • Try Playground
  • View Pricing

Models

  • FLUX Schnell
  • FLUX Dev
  • Fast SDXL
  • Stable Diffusion 3
  • MiniMax Video
  • Kling AI
  • Ideogram
  • More Models

Company

  • About Us
  • Pricing
  • Documentation
  • Tutorials
  • Blog
  • Contact
  • Changelog
  • Status
  • Careers
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
abstrakt

The unified abstraction layer for the next generation of AI applications.

© 2026 abstrakt. All rights reserved.

SYS.ONLINE|API.ACTIVE|v1.2.0