Reference

AI Voice Agent Glossary

Clear definitions for every term you'll encounter when evaluating, deploying, or scaling AI voice agents across your organization.

A

AI Voice Agent

An artificial intelligence system that conducts human-like phone conversations autonomously, without requiring a live human operator. AI voice agents can answer questions, qualify leads, schedule appointments, and escalate complex issues — operating 24/7 at scale across any industry.

Read definition →

AHT (Average Handle Time)

A workforce management metric measuring the average duration of a customer interaction from initial contact to call termination. AHT typically includes talk time plus after-call work (ACW) such as note-taking and follow-up preparation. Lower AHT often correlates with efficiency, but balancing it against quality and customer satisfaction is essential.

Read definition →

Agentic AI

AI systems capable of taking multi-step, goal-directed actions without continuous human intervention. Agentic AI can browse systems, execute workflows, update CRMs, send follow-up messages, and make decisions across multiple tools — going far beyond simple question-and-answer interactions.

Read definition →

API (Application Programming Interface)

A set of defined rules and protocols that allow different software applications to communicate with each other. In AI voice agent contexts, APIs connect the voice platform to CRMs, scheduling tools, databases, and other business systems to exchange data in real time during a call.

Read definition →

ASA (Average Speed of Answer)

A customer service metric that measures the average time between when a call enters a queue and when an agent answers it. ASA is a key performance indicator in contact centers, with targets often set between 20 and 45 seconds. Lower ASA indicates better responsiveness and typically improves customer satisfaction.

Read definition →

ASR (Automatic Speech Recognition)

The technology that converts spoken audio into machine-readable text in real time. Also called Speech-to-Text (STT), ASR is the first stage of every AI voice agent pipeline — its accuracy directly determines how well downstream intent detection and response generation perform.

Read definition →

ACD (Automatic Call Distributor)

A telephony system that automatically routes incoming calls to the appropriate agent or team based on predefined rules, skills, or availability. Modern ACDs form the backbone of contact center infrastructure, queuing calls, managing hold times, and distributing workload to maximize efficiency and customer satisfaction.

Read definition →

ATDS (Automatic Telephone Dialing System)

A technology system that automatically dials a list of phone numbers and delivers pre-recorded messages or connects callers to available agents. ATDS systems are regulated by the TCPA and require explicit consent; they are commonly used for outbound campaigns, appointment reminders, and debt collection.

Read definition →

Abandonment Rate

A contact center metric that measures the percentage of inbound calls that are disconnected or abandoned before an agent answers them. High abandonment rates indicate insufficient staffing or long wait times, directly impacting customer satisfaction and revenue, particularly in inbound sales and customer service environments.

Read definition →

C

Compliance

Adherence to regulatory frameworks governing how AI systems collect, store, and process data. For AI voice agents, compliance typically covers HIPAA (healthcare), TCPA (telemarketing), GDPR (EU data privacy), and PCI-DSS (payment data) — each imposing rules on call recording, consent, and data retention.

Read definition →

CSAT (Customer Satisfaction Score)

A metric measuring customer satisfaction with a specific interaction or service on a numerical scale, typically 1–5 or 1–10. CSAT surveys are usually administered immediately after a call or transaction. Organizations track CSAT as a key operational metric to identify service gaps and measure the impact of agent training or AI agent improvements.

Read definition →

Conversational AI

A category of artificial intelligence designed to simulate natural human dialogue through text or voice interfaces. Conversational AI combines NLP, intent detection, and response generation to create interactions that feel less like navigating a menu and more like speaking with a knowledgeable person.

Read definition →

CRM Integration

The connection between an AI voice platform and a Customer Relationship Management system such as Salesforce, HubSpot, or Zoho. CRM integration allows the agent to look up caller records, log call outcomes, update contact fields, and trigger follow-up workflows automatically during or after each call.

Read definition →

Call Deflection

A contact center strategy that routes incoming calls away from live agent queues toward self-service channels — including IVR, chatbots, or AI voice agents — to reduce agent workload and operational costs. Effective call deflection improves throughput and reduces average handle time without sacrificing customer satisfaction if the self-service option is high-quality.

Read definition →

CCaaS (Contact Center as a Service)

A cloud-based contact center platform delivered via subscription, eliminating the need for on-premise hardware and infrastructure. CCaaS solutions like Zendesk, Twilio, and Amazon Connect provide call routing, IVR, analytics, and integration APIs, enabling organizations to rapidly deploy and scale customer service operations.

Read definition →

E

EHR (Electronic Health Record)

A digital version of a patient's complete medical history maintained by a healthcare provider or health system. EHRs consolidate clinical notes, medications, lab results, and imaging studies, allowing clinical staff quick access to patient information during consultations. AI voice agents can integrate with EHR systems to confirm appointments, deliver test results, and triage patient concerns.

Read definition →

EMR (Electronic Medical Record)

A digital record of a patient's medical history maintained by a specific healthcare provider or clinical practice. Unlike EHRs, EMRs are typically limited to one practice and not designed for interoperability across organizations. AI voice agents can integrate with EMR systems to retrieve appointment history, medication information, and previous visit summaries.

Read definition →

Erlang C

A mathematical formula used in contact centers to calculate the number of agents required to handle an expected volume of calls at a target service level. Erlang C accounts for call arrival rate, average handle time (AHT), and acceptable wait time, helping workforce management teams right-size staffing to meet demand without overstaffing.

Read definition →

Escalation

The process of transferring a customer interaction from one channel, agent, or system to another — typically when an issue is too complex or sensitive for the current handler to resolve. AI voice agents recognize escalation triggers and smoothly route callers to a human agent, ensuring critical issues receive appropriate human judgment and empathy.

Read definition →

P

PHI (Protected Health Information)

Any information in a medical record or health plan that can be used to identify an individual patient — including name, date of birth, Social Security number, insurance details, and health conditions. HIPAA mandates strict controls on PHI collection, storage, and access. AI voice agents handling healthcare calls must encrypt PHI, audit access logs, and comply with Business Associate Agreement (BAA) obligations.

Read definition →

Power Dialer

An automated dialing technology that dials the next number on an outbound list as soon as an agent becomes available, eliminating idle time between calls. Power dialers present a live caller to the agent within one to two rings, keeping agents continuously engaged and maximizing calls per hour — though requiring careful compliance management to avoid TCPA violations.

Read definition →

Preview Dialer

An outbound dialing system that displays a prospect's profile, previous interaction history, and call notes to an agent before the call is placed. The agent reviews the information and then manually initiates the call, allowing for more personalized and informed conversations. Preview dialers are commonly used in sales and collection environments where context and timing matter.

Read definition →

Predictive Dialer

An automated outbound calling system that uses algorithms to predict agent availability and dials multiple numbers simultaneously, connecting only the live calls to available agents. Predictive dialers maximize call connect rates and agent efficiency but carry regulatory risk — the system must not trigger TCPA violations by creating excessive abandoned calls.

Read definition →

Progressive Dialer

An automated dialing system that dials one number per available agent, waiting for the call to connect before dialing the next number. Progressive dialers are less aggressive than predictive dialers and carry lower TCPA compliance risk because they dial one-to-one with agent availability, resulting in fewer abandoned calls.

Read definition →

S

STT (Speech-to-Text)

The technology that converts spoken audio into written text transcripts. Also called Automatic Speech Recognition (ASR), STT is the first stage of every voice AI pipeline — accuracy here directly determines how well the downstream intent detection and response generation stages perform.

Read definition →

Schedule Adherence

A workforce management metric measuring how closely an agent adheres to their assigned schedule — tracking whether they clock in on time, take breaks only during assigned windows, and avoid unscheduled absences. High schedule adherence ensures consistent coverage, prevents wait time spikes, and is often tied to agent performance reviews and incentive programs.

Read definition →

Shrinkage

A workforce management term referring to any time an agent is being paid but not available to handle customer interactions — including breaks, lunch, training, coaching, meetings, and sick time. Shrinkage is typically expressed as a percentage of scheduled hours and must be factored into staffing models to ensure adequate coverage for customer demand.

Read definition →

SIP (Session Initiation Protocol)

A network protocol used to initiate, maintain, and terminate voice, video, and messaging sessions across internet networks. SIP is the backbone of modern VoIP and cloud telephony systems — including many AI voice agent platforms. It handles signaling (call setup and teardown) separately from media transmission, enabling flexible and scalable voice infrastructure.

Read definition →

Softphone

A software application installed on a computer or mobile device that enables phone calls over the internet (VoIP) without requiring a traditional desk phone. Softphones support voice, video, messaging, and screen sharing, and are essential tools for remote contact center agents, field technicians, and distributed teams working with cloud-based telephony platforms.

Read definition →

STIR/SHAKEN

A framework and set of protocols designed to authenticate the origin of phone calls and prevent spoofing and illegal robocalls. STIR (Secure Telephone Identity Revisited) provides cryptographic verification of a calling number, while SHAKEN (Signature Handling for Authenticated Identity Establishment) defines the implementation standards. Compliance is mandatory for telecom carriers and increasingly expected of outbound calling platforms.

Read definition →