Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

Item 1
Item 2
Item 3

Unordered list

Item A
Item B
Item C

Text link

Bold text

Emphasis

^Superscript

_Subscript

Pricing

Request a demo

Get started

Speech-To-Text

Call center transcription software: what enterprises should look for in 2026

TL;DR: Most contact centers evaluate transcription software using clean-audio lab benchmarks, then watch QA automation break down when BPO (Business Process Outsourcing) agents switch languages mid-call or phone-line noise degrades the signal. In 2026, the criteria that matter are real-world multilingual WER, all-inclusive per-hour pricing, and data sovereignty that holds up under GDPR and HIPAA audit. For enterprise teams, the highest-ROI evaluation step is testing on real BPO call samples rather than vendor demo audio, and asking every shortlisted provider for an all-in per-hour price with diarization, sentiment, and entity extraction enabled.

Speech-To-Text

PII redaction for call recordings: how ingestion-level redaction keeps calls PCI compliant

TL;DR: Legacy pause-and-resume systems don't remove agents, local desktops, or telephony infrastructure from PCI DSS audit scope. Automated, ingestion-level PII redaction scrubs sensitive data before it reaches any database. By removing cardholder data at the ingestion layer, contact center platforms using automated redaction can potentially reduce audit complexity, cut agent handle time (AHT), and protect downstream CRM and LLM pipelines from corrupt data. The accuracy floor for reliable entity detection in PCI audits is significantly higher than for standard QA transcription, making STT model selection a compliance decision as much as a product one.

Speech-To-Text

GDPR, SOC 2, and ISO 27001 speech-to-text: the contact center compliance and certification guide

TL;DR: When your contact center routes voice data through a transcription vendor, every certification gap in that vendor's stack becomes your compliance liability. Voice recordings qualify as personal data under GDPR Article 4, and processing them through uncertified APIs creates direct financial exposure. This guide breaks down what GDPR, SOC 2 Type II, ISO 27001, HIPAA, and PCI DSS each require of your audio infrastructure vendor and maps those requirements to the QA coverage rates and cost-per-contact metrics you manage daily. We hold GDPR, SOC 2 Type II, ISO 27001, HIPAA, and PCI DSS certifications, and never use customer audio for model training on Growth or Enterprise plan.

Data residency for voice and transcription data: EU, US, and AI compliance

Published on July 3, 2026

by Ani Ghazaryan

TL;DR: Storing call recordings in an EU S3 bucket does not make your voice pipeline compliant if a US-based transcription API processes those files during inference. Data residency, data sovereignty, and AI processing represent distinct compliance considerations. This article maps those legal distinctions and the cross-border risks introduced by Business Process Outsourcing (BPO) access and AI model training defaults. It also covers how our EU-hosted infrastructure with configurable residency addresses those risks at the pipeline level, without inflating cost-per-contact or degrading Average Handle Time (AHT).

Contact centers that have adopted AI transcription to automate QA and agent coaching have quietly inherited a compliance problem that their legal teams may not have caught yet. The audio file resting in Frankfurt is compliant by storage rules. The moment it leaves for inference on a US GPU cluster, it is not.

This guide breaks down what data residency actually means for voice and transcript data, where it diverges from data sovereignty, and what compliance looks like across the full AI pipeline from raw audio capture to CRM writeback.

Defining data residency for operations leaders

Vendor contracts, DPAs, and regulatory audits use three terms constantly, and you cannot treat them as interchangeable.

Concept	Definition	Key driver	Legal impact
Data residency	Where data is physically stored	Geography of the data center	Affects which regulator can inspect or seize it
Data sovereignty	Which country's laws govern the data	Country of the data controller or processor	Affects which legal standards apply to collection, processing, and transfer
Data localization	A legal requirement that data cannot leave a jurisdiction	Government mandate	Prohibits foreign processing, replication, or backup

‍

As Alation's residency guide explains, data residency concerns only the geographic location of stored data, while data localization is stricter and prohibits data from leaving a designated jurisdiction at all, including for backups, replication, or processing abroad. Oracle's sovereignty overview reinforces the distinction: data sovereignty refers to which country's laws govern the data, regardless of where it physically sits.

For contact center operations, this distinction is directly operational. A vendor can offer EU data residency (storage in Frankfurt) while potentially routing audio through US-based inference infrastructure during transcription. Your data sits in Europe but may process in Virginia for 300 milliseconds of GPU compute. That 300 milliseconds can be a GDPR event.

Recordings vs. transcripts as separate compliance objects

Operations leads often treat recordings and transcripts as the same compliance unit, but they carry different risk profiles. Raw audio files are typically large and access is often limited to QA sampling, while transcripts are structured text, enriched with named entities, timestamps, and speaker labels, making them queryable and potentially dense with PII. A financial services call can generate a transcript containing searchable personal information such as names, account numbers, and other sensitive details. The AI transcription legality guide covers why this distinction matters for regulated industries handling both file types under the same compliance framework.

Distinguishing residency from data sovereignty

The cloud creates a compliance paradox: data can be stored in Europe and processed in the United States during the fraction of a second an AI model runs inference. Many operations leads discover this gap during legal review of their vendor Data Processing Agreements (DPAs).

The €1.2 billion lesson

In May 2023, the Irish Data Protection Authority issued Meta Platforms Ireland Limited a €1.2 billion GDPR fine (approximately $1.3 billion USD) following an EDPB binding decision. According to GDPR Local, the violation concerned transfers of personal data from European Facebook users to the United States on the basis of standard contractual clauses after the invalidation of Privacy Shield. Under GDPR, fines for the most serious violations reach 4% of global annual revenue, so for a $200M ARR CCaaS platform that ceiling sits at $8 million before any remediation costs.

A compliance violation at this scale does not just trigger fines. It destroys CSAT through service disruption during remediation, inflates cost-per-contact when vendors are replaced mid-contract, and erodes the operational credibility that CX leads depend on in executive reviews.

The operational impact of regional routing

Routing audio to regional servers rather than global load balancers can introduce measurable latency. Based on voice AI pipeline research, regional routing decisions affect network overhead. For async post-call transcription workflows feeding Quality Assurance (QA) scoring, this overhead is typically invisible to AHT. For real-time live-assist use cases, latency considerations become critical to maintaining natural interaction flow. Regional residency must be configured at infrastructure level rather than added as a routing afterthought, and for contact centers tracking cost-per-contact as a primary key performance indicator (KPI), residency architecture that adds no latency overhead to AHT and no premium to per-hour infrastructure costs is the only sound compliance path.

Compliance rules for processing voice records

GDPR and Schrems II

Under GDPR Articles 44 through 49, cross-border transfers are governed by specific conditions under which personal data can leave the European Economic Area. The 2020 Schrems II ruling by the Court of Justice of the EU invalidated the EU-US Privacy Shield and established that transfers to the US may require supplementary measures. As IAPP's Schrems II analysis notes, data transferred to or stored in the US can potentially be accessed by US intelligence agencies, making such transfers a GDPR risk that SCCs alone may not fully resolve without additional technical controls. Controllers must assess whether the receiving jurisdiction's laws undermine the SCCs they have in place, as EY's Schrems II analysis details.

Global mandates beyond GDPR

Contact centers with BPO operations in Asia-Pacific or Latin America face additional obligations:

Australia (APRA CPS 234**):**The Australian Prudential Regulation Authority's (APRA) CPS 234 standard sets two distinct notification thresholds: material incidents must be reported to APRA within 72 hours of becoming aware; material information security control weaknesses must be reported within 10 business days. CPS 230 (effective 1 July 2025) added requirements for "material service providers", with entities required to maintain annual registers of material service providers submitted to APRA.
Brazil (LGPD): The Lei Geral de Proteção de Dados (LGPD) is aligned with GDPR principles and includes requirements for lawful basis and transfer restrictions.
Canada (PIPEDA): The Personal Information Protection and Electronic Documents Act (PIPEDA) generally does not mandate in-country storage but may require comparable protection in any jurisdiction where data is processed.
Japan (APPI): The Act on the Protection of Personal Information (APPI) requires consent or adequacy confirmation before cross-border transfers, with stricter enforcement following 2022 amendments.
South Africa (POPIA): The Protection of Personal Information Act (POPIA) prohibits transfers to countries without adequate protection, enforced by the Information Regulator since July 2021.
UAE (PDPL): The Personal Data Protection Law (PDPL) includes transfer restrictions for sensitive data.

Navigating voice data residency: EU vs. US rules

The EU approach: centralized and extraterritorial

GDPR applies to any organization processing EU residents' data, regardless of where the organization is based. For example, a contact center headquartered in Texas that handles calls from German customers would process EU personal data and would need to comply. The framework is mature, consistently enforced, and supported by binding EDPB decisions that provide clear guidance on transfer mechanisms.

The US approach: fragmented and state-level

The US does not have a single comprehensive federal equivalent of GDPR. Instead, a patchwork of state laws governs voice and biometric data:

Illinois (BIPA): One of the strictest US biometric laws, enacted in 2008. It requires written consent before collecting voiceprints, mandates published retention schedules, prohibits sale of biometric data, and provides a private right of action, with statutory damages of approximately $1,000 per negligent violation and $5,000 per intentional violation, plus attorney fees.
Texas (CUBI): Requires informed consent before capturing biometric identifiers for commercial purposes, enforced by the Texas Attorney General, with penalties up to $25,000 per violation.
Washington: Requires notice and consent before enrolling biometric identifiers in commercial databases, with Attorney General enforcement.
Numerous additional states have passed comprehensive privacy laws classifying biometric data as sensitive, as documented by PrivacyLawMap's 2026 analysis, requiring opt-in consent or an honored opt-out before processing voiceprint data.

The BPO access paradox

This is a compliance gap that many Request for Proposal (RFP) processes miss. An EU-resident audio file accessed by a QA analyst in Manila or Bangalore can constitute a cross-border data transfer under GDPR. The European Data Protection Board (EDPB) guidance indicates that making personal data available to another party in a third country can trigger transfer rules. Allowing offshore BPO agents to access EU-resident call recordings via a QA dashboard may satisfy the conditions for a cross-border transfer.

Per GDPR Article 44 and the EDPB's guidance on what constitutes a transfer, making data accessible to a party in a third country can qualify, even when access is read-only. A playback dashboard routing EU-stored audio to an offshore QA agent can potentially constitute a transfer regardless of where the file is stored.

The practical controls that satisfy this requirement include:

Role-Based Access Controls (RBAC) that restrict audio playback to in-region agents
Virtual Desktop Infrastructure (VDI) configured to block local file downloads by offshore analysts
Transfer mechanisms such as Standard Contractual Clauses (SCCs) with documented assessments of the destination country's surveillance laws

For operations teams, implementing these controls requires careful workforce management planning alongside the technical configuration work.

Securing call data during AI model processing

Contact center technology stacks often separate recording infrastructure from transcription and enrichment, meaning multiple vendors and different data handling policies can sit between raw audio and a Customer Relationship Management (CRM) entry.

Mapping residency requirements at each pipeline stage

The concurrent voice AI pipeline architecture maps the full sequence: raw audio capture, ASR transcription, LLM inference, and downstream writeback. Each stage carries its own residency obligation:

Raw audio capture and regional storage: Ideally, the file should land in a compliant region immediately, as temporary storage in a non-compliant region during ingestion can potentially constitute a transfer even if the file is subsequently moved.
GPU inference (transcription): Your transcription provider should process the audio on infrastructure located in the same jurisdiction as storage. Routing EU audio to a US GPU cluster for inference can create a GDPR cross-border transfer regardless of how quickly processing completes.
Transcript generation and enrichment: The output text, including named entity extraction, sentiment scores, and speaker labels, contains personal data under GDPR definitions and should be stored in a compliant region consistent with your data handling policies.
LLM processing: If your transcript feeds an LLM for summarization or QA scoring, your LLM provider should run inference in the same compliant region or use appropriate transfer mechanisms.
CRM writeback: The final structured data entering your CRM should be transmitted via infrastructure that maintains your compliance posture.

AI provider residency status as of 2026

Provider	GPU inference region	At-rest storage region	Notes
Gladia	EU or US configurable	EU or US configurable	Configurable via API
OpenAI	US or Europe (eligible Enterprise/Edu)	Multiple regions available	In-region GPU reportedly launched Jan 16, 2026 for eligible ChatGPT Enterprise/Edu/Healthcare. Standard API customers may not receive in-region inference automatically.
Deepgram	EU endpoint (reportedly GA Jan 10, 2026) or US	EU or US	EU endpoint reportedly reached general availability in January 2026
AssemblyAI	EU endpoint or US	EU or US	EU endpoint reportedly at `api.eu.assemblyai.com` guarantees data does not leave the EU

‍

The EU AI Act and call analytics

The EU AI Act may classify call analytics systems that score agent performance as high-risk, potentially requiring transparency, human oversight, and audit logging. Our sentiment analysis operates on transcript text rather than audio waveforms, making it text-based sentiment inference rather than acoustic emotion detection, and the distinction may matter under biometric categorization provisions.

How Gladia handles data residency and compliance

We operate EU and US processing clusters. As documented in our documents, we use European infrastructure based in France by default to respect GDPR constraints. Developers can specify the region in the API call to route processing and storage to the appropriate cluster, so EU customer audio from intake to transcript can remain on EU infrastructure unless explicitly configured otherwise.

Regional residency configuration does not introduce a per-hour cost premium on our Growth and Enterprise plans. The table below shows the all-in cost model across deployment types:

Deployment model	Per-hour cost (Growth plan)	Cost-per-contact (5-min avg call)	Notes
EU residency	$0.20/hr async	~$0.017 per call	Configurable via API
US residency	$0.20/hr async	~$0.017 per call	Configurable via API
On-premises / air-gapped	Custom	Custom	Enterprise only

‍

The compliance guarantee is built into the base rate. Diarization, sentiment analysis, named entity recognition, and translation are all included at the same per-hour price, so regional residency configuration does not introduce additional per-hour costs.

Data training policy by plan

This is the clause most operations leads never find until a legal audit:

Starter plan: Customer data can be used for model training by default on this tier.
Growth and Enterprise plans: Customer data is never used for model training. No opt-out action is required and there is no contract clause to locate. This is the default behavior on paid plans.

Our OpenAI STT API comparison covers how these defaults compare against other providers, where training opt-outs are often enterprise-contract provisions rather than default behavior.

Compliance certifications

We hold SOC 2 Type II and ISO 27001 certifications and comply with HIPAA and GDPR requirements. For contact centers in financial services or healthcare, these certifications represent important procurement considerations. Full documentation is available at our compliance hub.

PII redaction

Our PII redaction feature replaces sensitive entities in transcripts with placeholder labels such as [NAME] or [PHONE_NUMBER]. This feature must be explicitly enabled in the API call and is not active by default.

On-premises and air-gapped deployment

For organizations with strict data sovereignty requirements that exceed cloud residency guarantees, we offer on-premises and air-gapped hosting. This removes the cloud jurisdiction question entirely because inference runs inside your own infrastructure perimeter. The data retention documentation covers retention periods and deletion controls that apply across deployment types.

Migration checklist for moving to a resident region

If your current stack routes EU voice data through non-compliant transcription infrastructure, the migration involves five phases:

Audit phase: Map every vendor in your pipeline, identify processing locations for each stage, and document any DPA gaps where transfer mechanisms are absent or rely on invalidated frameworks.
Mapping phase: Identify which API calls route audio outside the target residency region, confirm LLM inference locations for summarization or QA scoring workflows, and document CRM writeback endpoints.
Migration phase: Re-point transcription API calls to EU endpoints, update storage buckets to EU regions, reconfigure QA dashboards with RBAC controls restricting BPO access to in-region sessions, and update webhook configuration and CRM writeback patterns to route through EU infrastructure.
Validation phase: Run test calls through the migrated pipeline, verify transcript metadata shows EU processing locations, measure transcription quality against your pre-migration baseline accuracy metrics, and document evidence for DPA compliance records.
Operational validation: Measure AHT, First Call Resolution (FCR), and cost-per-contact before and after migration to confirm that compliance architecture does not degrade service delivery. Migration timelines vary based on organization size, existing infrastructure complexity, and integration requirements.

For European contact center audio, Solaria-3 ranks #1 against AssemblyAI, ElevenLabs, Deepgram, Mistral, and Speechmatics, demonstrating that regional residency can be implemented without compromising transcription quality.

FAQs

Does data residency apply to transcripts or just audio recordings?

Both are subject to data residency requirements. Transcripts often contain denser structured Personally Identifiable Information (PII) than recordings (names, account numbers, credit card data in searchable plaintext) and should be processed and stored using appropriate transfer mechanisms and safeguards under GDPR and equivalent frameworks.

Can I send EU voice data to a US transcription API if I have SCCs in place?

SCCs do not eliminate GDPR transfer risk after Schrems II, and a compliant Transfer Impact Assessment requires careful evaluation. Route EU audio to EU-based inference infrastructure where the transfer question never arises.

Does BPO access to EU-resident audio recordings count as a data transfer?

Yes, because the EDPB defines making personal data accessible to a party in a third country as a transfer. The standard technical controls are role-based access controls that restrict audio playback to in-region agents and VDI configurations that prevent offshore analysts from downloading files to local devices.

How does AI model training default behavior affect compliance?

If a transcription vendor's terms allow customer audio to be used for model retraining by default, that audio is processed for a secondary purpose without explicit consent, which creates a GDPR lawful basis problem on top of the residency issue. On our Growth and Enterprise plans, customer audio is never used for model training and no opt-out is required, while on the Starter plan, data can be used for training by default.

What is the difference between data residency and data localization?

Data residency requires data to be stored in a specific geographic location but may allow processing or copies elsewhere, while data localization is stricter and prohibits data from leaving a designated jurisdiction at all, including for inference, backup, or replication. China's PIPL and some provisions of EU sectoral regulations impose localization requirements rather than residency requirements.

Does Gladia offer on-premises deployment for air-gapped environments?

Yes. We offer on-premises and air-gapped hosting for organizations where cloud residency guarantees are insufficient, removing the cloud jurisdiction question entirely because inference runs inside the customer's own infrastructure. Contact our enterprise team via the compliance hub for deployment specifications.

Key terms glossary

Standard Contractual Clauses (SCCs): Pre-approved contractual language issued by the European Commission that allows personal data to be transferred from the EEA to third countries. Post-Schrems II, SCCs are a necessary but not sufficient transfer mechanism; controllers must also complete a Transfer Impact Assessment confirming that the destination country's surveillance laws do not undermine the SCCs' protections.

Transfer Impact Assessment (TIA): A documented legal and technical analysis required when relying on SCCs or other Article 46 GDPR transfer mechanisms. The TIA evaluates whether the laws and practices of the destination country, including government access powers, allow the SCCs to be effective in practice. A TIA is not optional; it is required before each cross-border transfer under post-Schrems II guidance.

Data Processing Agreement (DPA): A binding contract between a data controller and a data processor, required under GDPR Article 28 whenever a processor handles personal data on the controller's behalf. For contact centers, a DPA must be in place with every vendor that touches call audio or transcripts — including transcription APIs, QA platforms, and CRM providers. The DPA must specify processing purposes, data categories, retention periods, and sub-processor obligations.

Biometric identifier: Any measurable physical or behavioral characteristic used to identify an individual. Under US state laws such as Illinois BIPA and Texas CUBI, voiceprints derived from call recordings can qualify as biometric identifiers, triggering written consent, retention schedule, and deletion requirements that sit separately from GDPR obligations. Whether a transcription provider's model training pipeline constitutes biometric data collection is a live compliance question in several jurisdictions.

Sub-processor: A third party engaged by a data processor to carry out specific processing activities on behalf of the controller. Under GDPR, a processor must obtain prior written authorization from the controller before engaging sub-processors, and all sub-processors must be bound by data protection obligations equivalent to those in the main DPA. In a voice AI pipeline, GPU cloud providers, LLM inference vendors, and storage providers typically qualify as sub-processors.

Contact us

Your request has been registered

A problem occurred while submitting the form.

Speech-To-Text

Call center transcription software: what enterprises should look for in 2026

Speech-To-Text

PII redaction for call recordings: how ingestion-level redaction keeps calls PCI compliant

Speech-To-Text

GDPR, SOC 2, and ISO 27001 speech-to-text: the contact center compliance and certification guide

From audio to knowledge

Subscribe to receive latest news, product updates and curated AI content.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

GDPR Compliant

HIPAA Compliant

AICPA SOC Type 2

ISO 27001 Compliant

Gladia

Become the Speech AI expert in your organization with content from Gladia right in your inbox, no more than twice a month.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

By continuing your navigation, you apply the use of cookies intended to improve the performance and the functionalities of this site.

No, thanks

From audio to knowledge

Subscribe to receive latest news, product updates and curated AI content.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Read more

Call center transcription software: what enterprises should look for in 2026

PII redaction for call recordings: how ingestion-level redaction keeps calls PCI compliant

GDPR, SOC 2, and ISO 27001 speech-to-text: the contact center compliance and certification guide

Data residency for voice and transcription data: EU, US, and AI compliance

Defining data residency for operations leaders

Recordings vs. transcripts as separate compliance objects

Distinguishing residency from data sovereignty

The €1.2 billion lesson

The operational impact of regional routing

Compliance rules for processing voice records

GDPR and Schrems II

Global mandates beyond GDPR

Navigating voice data residency: EU vs. US rules

The EU approach: centralized and extraterritorial

The US approach: fragmented and state-level

The BPO access paradox

Securing call data during AI model processing

Mapping residency requirements at each pipeline stage

AI provider residency status as of 2026

The EU AI Act and call analytics

How Gladia handles data residency and compliance

Data training policy by plan

Compliance certifications

PII redaction

On-premises and air-gapped deployment

Migration checklist for moving to a resident region

FAQs

Does data residency apply to transcripts or just audio recordings?

Can I send EU voice data to a US transcription API if I have SCCs in place?

Does BPO access to EU-resident audio recordings count as a data transfer?

How does AI model training default behavior affect compliance?

What is the difference between data residency and data localization?

Does Gladia offer on-premises deployment for air-gapped environments?

Key terms glossary

Contact us

Read more

From audio to knowledge

Subscribe to receive latest news, product updates and curated AI content.

Gladia

Newsletter

From audio to knowledge

Subscribe to receive latest news, product updates and curated AI content.