How to Configure Agent Personas & Voice Libraries | Thoughtly

How to Configure Agent Personas & Voice Libraries | Thoughtly | Thoughtly

Last updated June, 2026

How to Configure Agent Personas and Voice Libraries in Thoughtly

Your AI voice agent's persona — the way it speaks, listens, and represents your brand — is the difference between a lead that stays on the line and one that hangs up in the first ten seconds. Thoughtly gives you two levers for this: the advanced prompt (which shapes personality and guardrails) and the voice library (which controls how the agent actually sounds). This guide walks through configuring both, from writing an effective persona prompt to browsing, cloning, and assigning voices that match your caller population.

If you're deploying agents for high-volume consumer lead conversion — insurance quotes, mortgage inquiries, education enrollment, healthcare scheduling — voice and persona choices directly impact contact rates and conversion. A clipped, robotic voice with no persona framing will tank your answer rate before the agent ever gets to qualify the lead.

What you'll need

A Thoughtly workspace with at least one agent created in the Agent Builder
Access to the Agent Builder (Settings panel) for your agent
If using Bring Your Own Key (BYOK): an ElevenLabs Starter-or-above plan or a Cartesia account
If cloning a voice: a quiet recording environment and a browser with microphone access (Chrome recommended)
A clear picture of your target caller — language, accent, typical conversation tone, and brand voice

Step 1: Configure the persona with the advanced prompt

The advanced prompt is the overarching instruction your agent considers before anything else. It lives in Settings → Advanced settings inside the Agent Builder. Think of it as the agent's character sheet — not a routing document.

The advanced prompt defines persona, tone, and high-level guardrails. It does not control call flow. Routing logic belongs in Speak nodes and Outcomes. If you put navigation instructions in the advanced prompt, the agent will inconsistently follow them and you'll spend hours debugging why it sometimes transfers and sometimes doesn't.

What to include

Persona identity: who the agent is (e.g., "a friendly insurance intake specialist")
Tone guidance: conversational, confident, concise, empathetic
Length preference: short sentences, no monologues
Guardrails: topics to avoid (medical advice, legal advice, pricing commitments)
Recovery behavior: what to do when the caller seems confused (summarize, ask one clarifying question)

Example advanced prompt for a mortgage intake agent

You are a friendly, knowledgeable mortgage intake specialist. Speak in short, confident sentences. Be warm but efficient — callers are often anxious about timelines. If a caller expresses confusion, summarize what you understand and ask one clarifying question. Never quote specific interest rates or fees. If asked about pricing, say "Your loan officer will walk through rates and options that fit your situation." Avoid giving tax, legal, or financial advice.

Example advanced prompt for mortgage lead intake

Keep the prompt to 3–6 lines. Long prompts dilute the signal — the agent starts treating everything as equally important. After a few test calls, revisit the prompt to tune tone, not logic.

Advanced prompts with variables

Advanced prompts can reference variables that are filled at runtime — contact attributes, automation payloads, CRM fields, or channel context. This lets you adapt persona behavior based on what you already know about the caller. For example, if the contact's state is known from a CRM field, the prompt can instruct the agent to acknowledge relevant state-specific information without hard-coding it.

Step 2: Choose a voice from the voice library

Voice selection happens inside the Agent Builder. Click the Voice field in the right-hand sidebar to open the Voice Selector panel. The panel has two tabs: Saved (your workspace's saved voices, shown by default) and Explore (the full catalog from Cartesia and ElevenLabs).

Browsing the Saved tab

The Saved tab shows voices your workspace has already saved, grouped by language. The active voice's language group is pinned to the top. Each voice row shows the avatar, name, platform (Cartesia or ElevenLabs), gender, and cost multiplier. Click Play to preview. Click the voice row to assign it to your agent.

Exploring the full catalog

The Explore tab gives you access to thousands of professional voices. The Expressive toggle is enabled by default, prioritizing low-latency voices optimized for real-time conversations. Toggle it off to browse the full catalog.

Filter chips let you narrow results:

Expressive: prioritizes low-latency, expressive voices for real-time calls
Gender: Male or Female
Language: 15 languages with flag icons — English, Spanish, French, German, Portuguese, Italian, Japanese, Korean, Chinese, Hindi, Arabic, Polish, Dutch, Russian, Turkish

Filters combine. For example, enable Expressive + Female + Spanish to find low-latency female voices in Spanish. You can also click the search icon to search by voice name or keyword.

Saving and assigning

Click anywhere on a voice row to save it to your workspace library and assign it to the current agent in one step. If you want to save a voice without assigning it yet, click the Bookmark icon. The voice appears in your Saved tab for future use across any agent.

If your Saved tab is empty after connecting BYOK, your ElevenLabs account may not have any cloned or generated voices yet. Use the Explore tab to save voices, or clone a new voice from the Saved tab.

Step 3: Connect your own voice provider key (BYOK)

Bring Your Own Key (BYOK) lets you connect your own ElevenLabs API key to Thoughtly. Instead of using shared platform credentials, your agents use your provider account directly. This gives you access to your personal voices, cloned voices, and provider-specific pricing.

Connecting ElevenLabs

Go to Settings → Integrations in the Thoughtly dashboard
Find the ElevenLabs card and click Connect
Paste your ElevenLabs API key (from your ElevenLabs dashboard under Profile + API key)
Thoughtly validates the key against your ElevenLabs account before proceeding
Once connected, Thoughtly imports your cloned and generated voices into your workspace library. They appear in the Saved tab tagged with Your ElevenLabs.

A paid ElevenLabs plan (Starter or above) is required. Free ElevenLabs API keys cannot be connected because the validation step rejects them.

What changes after connecting

A Your ElevenLabs badge appears in the Voice Selector header
The Saved tab shows voices linked to your ElevenLabs account. Built-in default voices are hidden.
Voice cloning creates clones in your ElevenLabs account rather than using Thoughtly's shared account
Cartesia voices remain available regardless of BYOK status
Concurrency is managed by ElevenLabs based on your plan. If agents exceed the limit, calls may fall back to a default voice.

To disconnect, go to Settings → Integrations and remove the ElevenLabs connection. This removes all BYOK-linked voices from your workspace library. Agents using those voices fall back to the default voice. Cartesia voices are not affected.

Step 4: Clone a custom voice

When the public library doesn't have a voice that fits your brand, Thoughtly lets you clone a custom voice directly from the Saved tab. The Clone Voice modal supports both browser microphone recording and audio file upload.

Requirements

Recording: up to 20 seconds (stops automatically). Record the full duration for best results.
Upload: no length limit. Longer samples can improve quality.
Supported formats: any browser-compatible audio format (WAV, MP3, etc.)
Browser permissions: microphone access required for recording

Cloning steps

Open your agent → click the Voice field → in the Saved tab, click the Clone a voice banner
Enter a descriptive Voice Name (e.g., "Sarah, Sales US")
Select Language and Gender for the voice
Click Record Voice to record via browser mic, or Upload Audio File to use an existing sample
Review the sample with the audio player. Click Discard Recording to start over if needed.
Click Clone. Processing takes 30–60 seconds.
Click Done. The voice appears in your Saved tab, ready to assign to any agent.

If your workspace uses BYOK, cloned voices are created in your own provider account and tagged accordingly. This gives you full ownership and portability of your cloned voices.

Recording best practices

Record in a quiet environment with minimal background noise and echo
Use an external microphone for better quality than built-in laptop mics
Maintain a consistent speaking volume throughout the recording
Speak in a conversational style matching the tone you want agents to use
Include full sentences with natural pauses and varied intonation
Record the full 20 seconds — longer samples produce better clones

Managing cloned voices

Cloned voices are scoped to your workspace. They are private (not shared with other workspaces), team-wide (all workspace members can use them), and persistent until manually deleted. A single cloned voice can be assigned to unlimited agents simultaneously.

To remove a cloned voice: in the Saved tab, click the filled Bookmark icon to unsave it from your library. Deleting or unsaving a voice that is currently assigned to agents will cause those agents to fall back to the default voice.

Only clone voices from individuals who have explicitly authorized their voice to be used for AI synthesis. Keep written records of voice-use permissions.

Step 5: Optimize voice for natural speech

After assigning a voice, tune it for your caller population. Thoughtly provides several controls under Settings → Presence and Voice Optimization.

Match accent to market

Pick a voice accent that matches your caller's region. This improves trust and comprehension.

Market	Recommended accent
North America	US English
Europe	UK English
ANZ	Australian English
Mexico	Mexican Spanish
Spain	Castilian Spanish
Argentina	Argentine Spanish
Brazil	Brazilian Portuguese
Portugal	European Portuguese

Set language explicitly

Each agent has a Language dropdown in the conversation settings sidebar. Pick the language, then assign a voice in that same language. Mismatched voice/language pairs cause noticeable pronunciation problems. To serve callers in multiple languages, create a separate agent per language with a matching voice and Genius knowledge base.

Adjust speaking speed (Cartesia)

Cartesia voices have a Voice Speed slider in the Presence section of the Agent Builder sidebar. The slider runs from 0 (slowest) to 100 (fastest), with 50 as the default. Small adjustments of 10 points are noticeable. The slider is disabled when a non-Cartesia voice is active.

Fix pronunciation issues

If your agent mispronounces names, brands, or industry terms, three options are available:

Phonetic spelling in prompts: write "Thought-lee" instead of "Thoughtly" in the relevant Speak node
Genius pronunciation guide: add entries to your Genius knowledge base with the correct spoken form
Voice cloning: clone a voice that naturally handles your domain vocabulary

For numbers, IDs, or confirmation codes, enable Spell Numbers in the Speak node. This improves pronunciation of long numbers by reading them digit-by-digit.

Step 6: Test voice and persona before going live

Before deploying, use Thoughtly's built-in testing tools to validate that your persona prompt and voice selection sound right in real conversation.

Text chat (Test Agent): verify branching, variable usage, and prompt behavior without placing a call
Call Me: place a real call to your phone to test voice quality, pronunciation, interruption handling, and transfer timing
Sample metadata: pass test contact attributes to verify personalization in the advanced prompt

Text-based tests are isolated from live calls. For voice quality, interruption behavior, pronunciation, and transfer timing, always place a real test call before going live. Test with native speakers in your target market when possible.

Common mistakes

Putting routing logic in the advanced prompt. The advanced prompt is for persona and guardrails — not navigation. Routing belongs in Speak nodes and Outcomes.
Writing a 15-line advanced prompt. Long prompts dilute signal. Keep it to 3–6 lines. Revisit after test calls to tune tone, not logic.
Choosing a voice accent that doesn't match the caller population. A UK English voice calling Texas mortgage leads creates immediate friction.
Mismatching voice language and agent language. If the agent language is set to Spanish but the voice is English, pronunciation problems are guaranteed.
Not testing pronunciation of brand names, industry terms, and caller names. What reads fine in a prompt can sound wrong in synthesis.
Forgetting to save a voice from Explore before expecting it in Saved. Clicking Play previews the voice but doesn't save it. Click the row or Bookmark icon to save.
Cloning from a low-quality recording. Background noise, low bitrate, or mumbling produces poor clones. Use a quiet room and a good microphone.
Disconnecting BYOK without reassigning agents. Disconnecting removes all BYOK-linked voices. Agents fall back to the default voice — which may not match your brand.

Measuring success

After deploying, track these metrics to evaluate whether your persona and voice configuration is working:

Metric	What to look for	Where in Thoughtly
Answer rate (outbound)	Percentage of calls where the lead picks up. A jarring voice or off-putting persona drops this immediately.	Call history → filter by agent
Average call duration	Short calls (<30s) on inbound may indicate the persona is confusing or the voice is unpleasant.	Call history → duration column
Transfer rate	Percentage of calls that result in a warm transfer. A well-configured persona should transfer qualified leads, not alienate them.	Call history → outcome filter
Voicemail detection accuracy	If the agent is leaving messages on live answers, the voice or persona may be causing confusion.	Call history → disposition filter
Conversion rate by agent	Compare agents with different voices or persona prompts. A/B test by voice to find what converts.	Analytics → agent comparison

Thoughtly's analytics dashboard provides call-level data. Review the first 50–100 calls after launch, listen to recordings, and adjust the advanced prompt, voice, or presence settings based on what you hear.

FAQ

Can I use different voices for different agents?

Yes. Each agent has its own Voice field in Settings. You can assign any saved voice to any agent, and a single voice can be assigned to unlimited agents simultaneously.

Do I need an ElevenLabs account to use voices?

No. Cartesia voices are available regardless of BYOK status. However, connecting your own ElevenLabs key (BYOK) gives you access to your personal ElevenLabs voice library, cloned voices, and provider-specific pricing. A small number of enterprise workspaces use Thoughtly-managed credentials instead.

Can I clone a voice in a language other than English?

Yes. Voice cloning supports English, Spanish, French, German, Portuguese, Italian, Japanese, Korean, Chinese, Hindi, Dutch, Polish, Russian, Swedish, and Turkish.

How long should the advanced prompt be?

Keep it to 3–6 lines. The advanced prompt defines persona, tone, and guardrails — not routing logic. Long prompts cause the agent to treat everything as equally important, which degrades performance.

What happens if I disconnect my ElevenLabs key?

Disconnecting removes all BYOK-linked voices from your workspace library. Agents that were using those voices fall back to the default voice. Cartesia voices are not affected. Reconnecting restores your ElevenLabs voices.

Sources and further reading

Thoughtly docs: Agent Voices — browse, preview, and assign voices

Thoughtly docs: Voice Optimization — accents, speed, and pronunciation

Thoughtly docs: Bring Your Own Key (BYOK) — connect your ElevenLabs account

Thoughtly docs: Voice Cloning — create custom voice clones

Thoughtly docs: Agent Settings — advanced prompt, presence, and post-call configuration

Thoughtly docs: Testing Voice Agents — Test Agent and Call Me tools

Thoughtly product: Voice Library — studio library, cloning, and A/B by voice

Thoughtly blog: How to Test and Iterate AI Voice Agents Before Going Live

Thoughtly blog: How to Use Thoughtly Variables for Dynamic Call Personalization

Thoughtly blog: How to Use Outcomes and Branching for Complex Call Flows

How to Configure Agent Personas and Voice Libraries in Thoughtly

How to Configure Agent Personas and Voice Libraries in Thoughtly

What you'll need

Step 1: Configure the persona with the advanced prompt

What to include

Example advanced prompt for a mortgage intake agent

Advanced prompts with variables

Step 2: Choose a voice from the voice library

Browsing the Saved tab

Exploring the full catalog

Saving and assigning

Step 3: Connect your own voice provider key (BYOK)

Connecting ElevenLabs

What changes after connecting

Step 4: Clone a custom voice

Requirements

Cloning steps

Recording best practices

Managing cloned voices

Step 5: Optimize voice for natural speech

Match accent to market

Set language explicitly

Adjust speaking speed (Cartesia)

Fix pronunciation issues

Step 6: Test voice and persona before going live

Common mistakes

Measuring success

FAQ

Can I use different voices for different agents?

Do I need an ElevenLabs account to use voices?

Can I clone a voice in a language other than English?

How long should the advanced prompt be?

What happens if I disconnect my ElevenLabs key?

Sources and further reading

Keep reading

How to Use Thoughtly Email Campaigns for Post-Call Follow-Up

How to Test and Iterate AI Voice Agents Before Going Live

How to Integrate Thoughtly with Google Sheets for Reporting

Every lead called instantly. Every conversation handled perfectly.