Claude Code Automates Voice Agent Builds

Nate Herk | AI Automationgo watch the original →

Nate demonstrates using Claude Code's natural language planning to build an ElevenLabs voice agent that qualifies leads and books Cal.com calls, handling setup, tools, and debugging without manual dashboard clicks.

Voice Agent Core Components

Voice agents operate in a continuous loop: user speech is transcribed to text, processed by an LLM which may query knowledge bases or call tools, then synthesized back to speech. Key elements include persona (system prompt defining tone, e.g., "warm professional B2B sales"), voice (selected from ElevenLabs library or custom clone), knowledge (e.g., business info or YouTube transcripts), and tools (API calls like Cal.com booking or GitHub queries). Traditional setup requires manual ElevenLabs dashboard configuration—prompts, voices, knowledge uploads, tool endpoints—but Claude Code automates this by reading docs and executing via API.

Nate's prior example agent, trained on 400 YouTube transcripts, answered queries like scraping tools (recommending Firecrawl for job listings to Excel) using RAG-like retrieval. Deployment options: ElevenLabs dashboard testing, website widget (single script embed), or Twilio phone integration. Widget is a floating bubble triggering calls, ideal for sites.

Natural Language Planning Replaces Manual Config

In VS Code with Claude Code extension (requires paid Claude subscription), Nate activates "plan mode" for iterative brainstorming. Prompt: Embed sales agent on Neural AI consultancy landing page to answer questions, capture leads (name, email, company, problem, team size, role), and book 30-min Cal.com discovery calls. Claude probes: ElevenLabs/Cal.com status (new agent, existing event type), widget style (floating bubble), persona (warm B2B sales), direct booking (yes, skipping n8n/Zapier).

Plan emerges: Fetch API keys/event ID, create ElevenLabs agent (Claude-3.5-sonnet LLM, system prompt emphasizing sales funnel), add tools (Cal.com availability check, booking), embed widget. Claude drafts prompt: "You're Neural's sales agent... qualify leads, book calls." Quotes: "As humans we're very good at knowing the end goal but we don't always know the way to get there"—highlights plan mode's value in clarifying paths. Claude creates .env for keys, minimizing user steps.

API Integration and Tool Automation

User provides Cal.com API key (from settings) and ElevenLabs key (full permissions, optional spend limit). Event type ID targets 30-min slot synced to Google Calendar. Claude Code executes: Authenticates Cal.com, creates agent "Neural Diagnostics," sets voice/tools (two Cal.com functions: check slots, book with metadata), configures knowledge (business details), first message ("Hi, I'm Neural's AI... ready to chat about AI transformation?"), and generates widget script.

Tools use Cal.com API directly: POST /events for booking (name, email, start time from availability). Claude renames event to "Neural Diagnostic" for branding. No custom code needed beyond .env and index.html script injection. Why ElevenLabs? Best voice clone (Nate's 4-hour training), intuitive dashboard, easy widget. Alternatives like Pinecone/NotebookLM for knowledge or MCP servers for tools are possible but unneeded here.

Iteration, Debugging, and Live Testing

First test fails: Widget loads, but timezone bug mismatches Cal.com slots (agent assumes UTC, user local). Debug: Claude inspects logs, adjusts tool prompt to parse user timezone (e.g., "What timezone are you in?"), reconfigures agent. Iteration loop: Test → error → natural language fix ("Debug timezone bug"). Final demo: Agent qualifies lead (company pain points, team size), books call seamlessly.

Quote: "Code beats clicks basically just validating that it's so much better to build a voice agent by speaking into your computer rather than going onto the dashboard and clicking." Security: API keys in .env (git ignore), monthly spend limits; prod needs granular permissions. Costs: ElevenLabs ~$0.10-0.30/min (transcription/synthesis), Claude Code per tokens, Cal.com free tier sufficient.

Production Deployment Realities

Localhost tests first, then deploy site (e.g., Hostinger VPS with NATEHERK code). Widget auto-handles loop. Scalability: Handles multiple calls; monitor costs. Failures instructive—e.g., permission errors prompt key tweaks. Outcome: From idea to live agent in ~15-30 mins, vs. hours manually.

Quote: "It has never been so easy to build whatever you want"—captures accessibility shift. Non-obvious: Direct ElevenLabs-Cal.com skips middleware (n8n/Zapier), but requires precise prompting for edge cases like timezones.

Key Takeaways

  • Use Claude Code plan mode to align on architecture before execution, clarifying ambiguities like auth flows.
  • Store API keys in .env; grant ElevenLabs full perms for demos, restrict for prod with spend caps.
  • Build tools modularly: Separate availability check and booking for reliability.
  • Debug via logs + natural language: Describe issues, let Claude iterate prompts/tools.
  • Prefer direct API integrations over Zapier for latency/cost; test timezones explicitly.
  • ElevenLabs widget is one
  • Voice cloning elevates realism; train on 4+ hours for professional tone.
  • Costs scale with usage: Budget $0.10+/min; monitor via dashboards.
  • Start simple: Persona + tools > complex knowledge for sales agents.
  • #tutorial
  • #demo
  • #ai

summary by x-ai/grok-4.1-fast. probably wrong about something. check the source.