Products
APIs
Voice Agent API
For real-time AI Agents
Text to Speech API
Responsive, natural-sounding voices
Speech to Text API
Unmatched accuracy, speed & cost
Audio Intelligence API
Powered by AI Language models
Announcements
View more
Introducing Nova-3: Setting a New Standard for AI-Driven Speech-to-Text
Published on 02/12/25
Solutions
Use Cases
Contact Centers
Medical Transcription
Conversational AI
Speech Analytics
Media Transcription
Customers
Partners
Startup Program
Resources
Resources
Articles
Podcast
Resource Hub
AI Glossary
About
Careers
AI Apps
AI Minds
AI Voice Generator
Transcription Tool
Developers
Documentation
Tutorials
Status
Changelog
Self-Hosted Deployment
Help
Playground
Community
Enterprise
Pricing
Log In
Get a Demo
Sign Up Free
AI Glossary
Back to Glossary Home
AI and Medicine
Grounding
Probabilistic Models in Machine Learning
Knowledge Distillation
Inference Engine
Emergent Behavior
Double Descent
Bayesian Machine Learning
Batch Gradient Descent
Voice Cloning
Homograph Disambiguation
Grapheme-to-Phoneme Conversion (G2P)
Deep Learning
Articulatory Synthesis
AI Voice Agents
AI Agents
Text-to-Speech Models
Neural Text-to-Speech (NTTS)
Pooling (Machine Learning)
Pretraining
Machine Learning in Algorithmic Trading
Test Data Set
Bias-Variance Tradeoff
Learning Rate
Logits
Inductive Bias
Continuous Learning Systems
Supervised Learning
Autoregressive Model
Auto Classification
Hidden Layer
Multitask Prompt Tuning
Multi-task Learning
Machine Learning Neuron
Semi-Supervised Learning
Rectified Linear Unit (ReLU)
Validation Data Set
Incremental Learning
Diffusion
Clustering Algorithms
Few Shot Learning
Machine Learning Life Cycle Management
Named Entity Recognition
AI Robustness
Information Retrieval
Augmented Intelligence
Collaborative Filtering
Cognitive Architectures
AI Prototyping
AI and Big Data
AI Scalability
AI Literacy
Machine Learning Bias
Image Recognition
AI Resilience
Synthetic Data for AI Training
Objective Function
Data Drift
Self-healing AI
Spike Neural Networks
Human-centered AI
Federated Learning
Uncertainty in Machine Learning
Parametric Neural Networks
Limited Memory AI
Naive Bayes Classifier
AI Transparency
Human-in-the-Loop AI
Machine Learning Preprocessing
AI Privacy
Multi-Agent Systems
Generative Teaching Networks
AI Interpretability
AI Regulation
Human Augmentation with AI
Feature Store for Machine Learning
Decision Intelligence
Chatbots
Quantum Machine Learning Algorithms
Computational Phenotyping
Counterfactual Explanations in AI
Context-Aware Computing
Instruction Tuning
AI Simulation
Ethical AI
AI Oversight
AI Safety
Symbolic AI
AI Guardrails
Composite AI
Gradient Clipping
Generative Adversarial Networks (GANs)
Rule-Based AI
AI Assistants
Activation Functions
Dall-E
Prompt Engineering
Hyperparameters
AI and Education
Chess bots
Midjourney (Image Generation)
DistilBERT
Mistral
XLNet
Benchmarking
Llama 2
Sentiment Analysis
LLM Collection
ChatGPT
Mixture of Experts
Latent Dirichlet Allocation (LDA)
RoBERTa
RLHF
Multimodal AI
Transformers
Winnow Algorithm
k-Shingles
Flajolet-Martin Algorithm
CURE Algorithm
Online Gradient Descent
Zero-shot Classification Models
Curse of Dimensionality
Backpropagation
Dimensionality Reduction
Multimodal Learning
Gaussian Processes
AI Voice Transfer
Gated Recurrent Unit
Prompt Chaining
Approximate Dynamic Programming
Adversarial Machine Learning
Deep Reinforcement Learning
Speech-to-text models
Feedforward Neural Network
BERT
Gradient Boosting Machines (GBMs)
Retrieval-Augmented Generation (RAG)
Perceptron
Overfitting and Underfitting
Machine Learning
Large Language Model (LLM)
Graphics Processing Unit (GPU)
Diffusion Models
Classification
Tensor Processing Unit (TPU)
Natural Language Processing (NLP)
Google's Bard
OpenAI Whisper
Sequence Modeling
Precision and Recall
Semantic Kernel
Fine Tuning in Deep Learning
Gradient Scaling
AlphaGo Zero
Cognitive Map
Keyphrase Extraction
Multimodal AI Models and Modalities
Hidden Markov Models (HMMs)
AI Hardware
Natural Language Generation (NLG)
Natural Language Understanding (NLU)
Tokenization
Word Embeddings
AI and Finance
AlphaGo
AI Recommendation Algorithms
Binary Classification AI
AI Generated Music
Neuralink
AI Video Generation
OpenAI Sora
Hooke-Jeeves Algorithm
Mamba
Central Processing Unit (CPU)
Generative AI
Representation Learning
AI in Customer Service
Conditional Variational Autoencoders
Conversational AI
Packages
Models
Fundamentals
Datasets
Techniques
AI Lifecycle Management
AI Monitoring
Machine Translation
MLOps
Monte Carlo Learning
Principal Component Analysis
Reproducibility in Machine Learning
Restricted Boltzmann Machines
Support Vector Machines (SVM)
Topic Modeling
Vanishing and Exploding Gradients
Data Labeling
F1 Score in Machine Learning
Expectation Maximization
Beam Search Algorithm
Embedding Layer
Differential Privacy
Data Poisoning
Causal Inference
Capsule Neural Network
Attention Mechanisms
Domain Adaptation
Evolutionary Algorithms
Contrastive Learning
Explainable AI
Affective AI
Semantic Networks
Data Augmentation
Convolutional Neural Networks
Cognitive Computing
End-to-end Learning
Prompt Tuning
Model Drift
Neural Radiance Fields
Regularization
Natural Language Querying (NLQ)
Foundation Models
Forward Propagation
F2 Score
AI Ethics
Transfer Learning
AI Alignment
Whisper v3
Whisper v2
Semi-structured data
AI Hallucinations
Matplotlib
NumPy
Scikit-learn
SciPy
Keras
TensorFlow
Seaborn Python Package
PyTorch
Natural Language Toolkit (NLTK)
Pandas
Ego 4D
The Pile
Common Crawl Datasets
SQuAD
Intelligent Document Processing
Hyperparameter Tuning
Markov Decision Process
Graph Neural Networks
Neural Architecture Search
Ablation
Model Interpretability
Out-of-Distribution Detection
Recurrent Neural Networks
Active Learning (Machine Learning)
Imbalanced Data
Loss Function
Unsupervised Learning
AdaGrad
Acoustic Models
Concatenative Synthesis
Candidate Sampling
Computational Creativity
AI Emotion Recognition
Knowledge Representation and Reasoning
Metacognitive Learning Models
AI Speech Enhancement
Eco-friendly AI
Metaheuristic Algorithms
Statistical Relational Learning
Deepfake Detection
One-Shot Learning
Semantic Search Algorithms
Artificial Super Intelligence
Computational Linguistics
Computational Semantics
Part-of-Speech Tagging
Random Forest
Neural Style Transfer
Neuroevolution
Association Rule Learning
Autoencoder
Data Scarcity
Decision Tree
Ensemble Learning
Entropy in Machine Learning
Corpus in NLP
Confirmation Bias in Machine Learning
Confidence Intervals in Machine Learning
Cross Validation in Machine Learning
Accuracy in Machine Learning
Clustering in Machine Learning
Boosting in Machine Learning
Epoch in Machine Learning
Feature Learning
Feature Selection
Genetic Algorithms in AI
Ground Truth in Machine Learning
Hybrid AI
AI Detection
AI Standards
AI Steering
ImageNet
Learning To Rank
Applications
#
A
B
C
D
E
F
G
H
I
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Z
AI Glossary Categories
Datasets
Fundamentals
Models
Packages
Techniques
Categories
Alphabetical
Alphabetical
Alphabetical
A
AI and Medicine
Articulatory Synthesis
AI Voice Agents
AI Agents
Autoregressive Model
Auto Classification
AI Robustness
Augmented Intelligence
AI Prototyping
AI and Big Data
AI Scalability
AI Literacy
AI Resilience
AI Transparency
AI Privacy
AI Interpretability
AI Regulation
AI Simulation
AI Oversight
AI Safety
AI Guardrails
AI Assistants
Activation Functions
AI and Education
AI Voice Transfer
Approximate Dynamic Programming
Adversarial Machine Learning
AlphaGo Zero
AI Hardware
AI and Finance
AlphaGo
AI Recommendation Algorithms
AI Generated Music
AI Video Generation
AI in Customer Service
AI Lifecycle Management
AI Monitoring
Attention Mechanisms
Affective AI
AI Ethics
AI Alignment
AI Hallucinations
Ablation
Active Learning (Machine Learning)
AdaGrad
Acoustic Models
AI Emotion Recognition
AI Speech Enhancement
Artificial Super Intelligence
Association Rule Learning
Autoencoder
Accuracy in Machine Learning
AI Detection
AI Standards
AI Steering
Applications
B
Bayesian Machine Learning
Batch Gradient Descent
Bias-Variance Tradeoff
Benchmarking
Backpropagation
BERT
Binary Classification AI
Beam Search Algorithm
Boosting in Machine Learning
C
Continuous Learning Systems
Clustering Algorithms
Collaborative Filtering
Cognitive Architectures
Chatbots
Computational Phenotyping
Counterfactual Explanations in AI
Context-Aware Computing
Composite AI
Chess bots
ChatGPT
CURE Algorithm
Curse of Dimensionality
Classification
Cognitive Map
Central Processing Unit (CPU)
Conditional Variational Autoencoders
Conversational AI
Causal Inference
Capsule Neural Network
Contrastive Learning
Convolutional Neural Networks
Cognitive Computing
Common Crawl Datasets
Concatenative Synthesis
Candidate Sampling
Computational Creativity
Computational Linguistics
Computational Semantics
Corpus in NLP
Confirmation Bias in Machine Learning
Confidence Intervals in Machine Learning
Cross Validation in Machine Learning
Clustering in Machine Learning
D
Double Descent
Deep Learning
Diffusion
Data Drift
Decision Intelligence
Dall-E
DistilBERT
Dimensionality Reduction
Deep Reinforcement Learning
Diffusion Models
Datasets
Data Labeling
Differential Privacy
Data Poisoning
Domain Adaptation
Data Augmentation
Deepfake Detection
Data Scarcity
Decision Tree
E
Emergent Behavior
Ethical AI
Expectation Maximization
Embedding Layer
Evolutionary Algorithms
Explainable AI
End-to-end Learning
Ego 4D
Eco-friendly AI
Ensemble Learning
Entropy in Machine Learning
Epoch in Machine Learning
F
Few Shot Learning
Federated Learning
Feature Store for Machine Learning
Flajolet-Martin Algorithm
Feedforward Neural Network
Fine Tuning in Deep Learning
Fundamentals
F1 Score in Machine Learning
Foundation Models
Forward Propagation
F2 Score
Feature Learning
Feature Selection
G
Grounding
Grapheme-to-Phoneme Conversion (G2P)
Generative Teaching Networks
Gradient Clipping
Generative Adversarial Networks (GANs)
Gaussian Processes
Gated Recurrent Unit
Gradient Boosting Machines (GBMs)
Graphics Processing Unit (GPU)
Google's Bard
Gradient Scaling
Generative AI
Graph Neural Networks
Genetic Algorithms in AI
Ground Truth in Machine Learning
H
Homograph Disambiguation
Hidden Layer
Human-centered AI
Human-in-the-Loop AI
Human Augmentation with AI
Hyperparameters
Hidden Markov Models (HMMs)
Hooke-Jeeves Algorithm
Hyperparameter Tuning
Hybrid AI
I
Inference Engine
Inductive Bias
Incremental Learning
Information Retrieval
Image Recognition
Instruction Tuning
Intelligent Document Processing
Imbalanced Data
ImageNet
K
Knowledge Distillation
k-Shingles
Keyphrase Extraction
Keras
Knowledge Representation and Reasoning
L
Learning Rate
Logits
Limited Memory AI
Llama 2
LLM Collection
Latent Dirichlet Allocation (LDA)
Large Language Model (LLM)
Loss Function
Learning To Rank
M
Machine Learning in Algorithmic Trading
Multitask Prompt Tuning
Multi-task Learning
Machine Learning Neuron
Machine Learning Life Cycle Management
Machine Learning Bias
Machine Learning Preprocessing
Multi-Agent Systems
Midjourney (Image Generation)
Mistral
Mixture of Experts
Multimodal AI
Multimodal Learning
Machine Learning
Multimodal AI Models and Modalities
Mamba
Models
Machine Translation
MLOps
Monte Carlo Learning
Model Drift
Matplotlib
Markov Decision Process
Model Interpretability
Metacognitive Learning Models
Metaheuristic Algorithms
N
Neural Text-to-Speech (NTTS)
Named Entity Recognition
Naive Bayes Classifier
Natural Language Processing (NLP)
Natural Language Generation (NLG)
Natural Language Understanding (NLU)
Neuralink
Neural Radiance Fields
Natural Language Querying (NLQ)
NumPy
Natural Language Toolkit (NLTK)
Neural Architecture Search
Neural Style Transfer
Neuroevolution
O
Objective Function
Online Gradient Descent
Overfitting and Underfitting
OpenAI Whisper
OpenAI Sora
Out-of-Distribution Detection
One-Shot Learning
P
Probabilistic Models in Machine Learning
Pooling (Machine Learning)
Pretraining
Parametric Neural Networks
Prompt Engineering
Prompt Chaining
Perceptron
Precision and Recall
Packages
Principal Component Analysis
Prompt Tuning
PyTorch
Pandas
Part-of-Speech Tagging
Q
Quantum Machine Learning Algorithms
R
Rectified Linear Unit (ReLU)
Rule-Based AI
RoBERTa
RLHF
Retrieval-Augmented Generation (RAG)
Representation Learning
Reproducibility in Machine Learning
Restricted Boltzmann Machines
Regularization
Recurrent Neural Networks
Random Forest
S
Supervised Learning
Semi-Supervised Learning
Synthetic Data for AI Training
Self-healing AI
Spike Neural Networks
Symbolic AI
Sentiment Analysis
Speech-to-text models
Sequence Modeling
Semantic Kernel
Support Vector Machines (SVM)
Semantic Networks
Semi-structured data
Scikit-learn
SciPy
Seaborn Python Package
SQuAD
Statistical Relational Learning
Semantic Search Algorithms
T
Text-to-Speech Models
Test Data Set
Transformers
Tensor Processing Unit (TPU)
Tokenization
Techniques
Topic Modeling
Transfer Learning
TensorFlow
The Pile
U
Uncertainty in Machine Learning
Unsupervised Learning
V
Voice Cloning
Validation Data Set
Vanishing and Exploding Gradients
W
Winnow Algorithm
Word Embeddings
Whisper v3
Whisper v2
X
XLNet
Z
Zero-shot Classification Models