< 1.2s

typical AI response latency

platform for the full AI pipeline

2.7B

minutes processed

40%

more abandonment above 1.2s

Trusted by 2,000+ companies

The problem

Bolt-on pipelines stack latency at every boundary

Six hops between caller and AI

PSTN to telephony provider, telephony to WebSocket, WebSocket to your server, server to STT, STT to LLM, LLM to TTS, TTS back through the chain. Each hop adds 50 to 300ms.

Partial metrics hide the real number

STT-to-first-token measures one step. TTS time-to-first-byte measures another. Neither measures the time a caller waits between finishing a sentence and hearing the AI respond.

Optimization cannot eliminate architecture

Switching to a faster TTS provider saves 200ms but does not remove the other five network boundaries. Co-locating servers helps, but four to six network transits remain.

Streaming helps, but boundaries remain

Streaming STT and TTS reduces batch delays. Each stream still crosses a network boundary. Streaming over WebSocket to an external service is faster than batch, but slower than processing inside one engine.

Build a Voice AI Agent

from signalwire_agents import AgentBase
from signalwire_agents.core.function_result import SwaigFunctionResult

class SupportAgent(AgentBase):
    def __init__(self):
        super().__init__(name="Support Agent", route="/support")
        self.prompt_add_section("Instructions",
            body="You are a customer support agent. "
                 "Greet the caller and resolve their issue.")
        self.add_language("English", "en-US", "rime.spore:mistv2")

    @AgentBase.tool(name="check_order")
    def check_order(self, order_id: str):
        """Check the status of a customer order.

        Args:
            order_id: The order ID to look up
        """
        return SwaigFunctionResult(f"Order {order_id}: shipped, ETA April 2nd")

agent = SupportAgent()
agent.run()

import { AgentBase, FunctionResult } from '@signalwire/sdk';

const agent = new AgentBase({
  name: 'Support Agent',
  route: '/support',
});

agent.promptAddSection('Instructions',
  'You are a customer support agent. Greet the caller and resolve their issue.');

agent.addLanguage({ name: 'English', code: 'en-US', voice: 'rime.spore:mistv2' });

agent.defineTool({
  name: 'check_order',
  description: 'Check the status of a customer order',
  parameters: {
    type: 'object',
    properties: {
      order_id: { type: 'string', description: 'The order ID to look up' },
    },
    required: ['order_id'],
  },
  handler: (args) => {
    return new FunctionResult(`Order ${args.order_id}: shipped, ETA April 2nd`);
  },
});

agent.run();

package main

import (
	"fmt"

	"github.com/signalwire/signalwire-go/pkg/agent"
	"github.com/signalwire/signalwire-go/pkg/swaig"
)

func main() {
	a := agent.NewAgentBase(
		agent.WithName("Support Agent"),
		agent.WithRoute("/support"),
	)

	a.PromptAddSection("Instructions",
		"You are a customer support agent. Greet the caller and resolve their issue.")

	a.AddLanguage(map[string]any{
		"name": "English", "code": "en-US", "voice": "rime.spore:mistv2",
	})

	a.DefineTool(agent.ToolDefinition{
		Name:        "check_order",
		Description: "Check the status of a customer order",
		Parameters: map[string]any{
			"type": "object",
			"properties": map[string]any{
				"order_id": map[string]any{
					"type": "string", "description": "The order ID to look up",
				},
			},
			"required": []string{"order_id"},
		},
		Handler: func(args map[string]any, rawData map[string]any) *swaig.FunctionResult {
			orderID := args["order_id"]
			return swaig.NewFunctionResult(
				fmt.Sprintf("Order %v: shipped, ETA April 2nd", orderID),
			)
		},
	})

	a.Run()
}

import com.signalwire.sdk.agent.AgentBase;
import com.signalwire.sdk.swaig.FunctionResult;

import java.util.List;
import java.util.Map;

public class SupportAgent {
    public static void main(String[] args) throws Exception {
        var agent = AgentBase.builder()
                .name("Support Agent")
                .route("/support")
                .build();

        agent.promptAddSection("Instructions",
                "You are a customer support agent. "
              + "Greet the caller and resolve their issue.");

        agent.addLanguage("English", "en-US", "rime.spore:mistv2");

        agent.defineTool(
                "check_order",
                "Check the status of a customer order",
                Map.of("type", "object",
                       "properties", Map.of(
                           "order_id", Map.of(
                               "type", "string",
                               "description", "The order ID to look up")),
                       "required", List.of("order_id")),
                (toolArgs, rawData) -> {
                    var orderId = toolArgs.get("order_id");
                    return new FunctionResult(
                        "Order " + orderId + ": shipped, ETA April 2nd");
                }
        );

        agent.run();
    }
}

# frozen_string_literal: true

require 'signalwire'

agent = SignalWire::AgentBase.new(name: 'Support Agent', route: '/support')

agent.prompt_add_section('Instructions',
  'You are a customer support agent. Greet the caller and resolve their issue.')

agent.add_language(name: 'English', code: 'en-US', voice: 'rime.spore:mistv2')

agent.define_tool(
  name:        'check_order',
  description: 'Check the status of a customer order',
  parameters:  {
    'order_id' => { 'type' => 'string', 'description' => 'The order ID to look up' }
  }
) do |args, _raw|
  SignalWire::Swaig::FunctionResult.new(
    "Order #{args['order_id']}: shipped, ETA April 2nd"
  )
end

agent.run

<?php
require 'vendor/autoload.php';

use SignalWire\Agent\AgentBase;
use SignalWire\SWAIG\FunctionResult;

$agent = new AgentBase(['name' => 'Support Agent', 'route' => '/support']);

$agent->promptAddSection('Instructions',
    'You are a customer support agent. Greet the caller and resolve their issue.');

$agent->addLanguage('English', 'en-US', 'rime.spore:mistv2');

$agent->defineTool(
    name: 'check_order',
    description: 'Check the status of a customer order',
    parameters: [
        'order_id' => ['type' => 'string', 'description' => 'The order ID to look up'],
    ],
    handler: function (array $args): FunctionResult {
        return new FunctionResult("Order {$args['order_id']}: shipped, ETA April 2nd");
    }
);

$agent->run();

#!/usr/bin/env perl
use strict;
use warnings;
use lib 'lib';
use SignalWire::Agent::AgentBase;
use SignalWire::SWAIG::FunctionResult;

my $agent = SignalWire::Agent::AgentBase->new(
    name  => 'Support Agent',
    route => '/support',
);

$agent->prompt_add_section('Instructions',
    'You are a customer support agent. Greet the caller and resolve their issue.');

$agent->add_language(name => 'English', code => 'en-US', voice => 'rime.spore:mistv2');

$agent->define_tool(
    name        => 'check_order',
    description => 'Check the status of a customer order',
    parameters  => {
        order_id => { type => 'string', description => 'The order ID to look up' },
    },
    handler => sub {
        my ($args, $raw) = @_;
        return SignalWire::SWAIG::FunctionResult->new(
            response => "Order $args->{order_id}: shipped, ETA April 2nd"
        );
    },
);

$agent->run;

#include <signalwire/agent/agent_base.hpp>

using namespace signalwire;
using json = nlohmann::json;

class SupportAgent : public agent::AgentBase {
public:
    SupportAgent() : AgentBase("Support Agent", "/support") {
        prompt_add_section("Instructions",
            "You are a customer support agent. "
            "Greet the caller and resolve their issue.");

        add_language({"English", "en-US", "rime.spore:mistv2"});

        define_tool({
            .name = "check_order",
            .description = "Check the status of a customer order",
            .parameters = {
                {"order_id", {{"type", "string"},
                              {"description", "The order ID to look up"}}}
            },
            .handler = [](const json& args, const json&) {
                auto order_id = args.value("order_id", "unknown");
                return swaig::FunctionResult(
                    "Order " + order_id + ": shipped, ETA April 2nd");
            }
        });
    }
};

int main() {
    SupportAgent().run();
}

using SignalWire.Agent;
using SignalWire.SWAIG;

var agent = new AgentBase(new AgentOptions { Name = "Support Agent", Route = "/support" });

agent.PromptAddSection("Instructions",
    "You are a customer support agent. Greet the caller and resolve their issue.");

agent.AddLanguage("English", "en-US", "rime.spore:mistv2");

agent.DefineTool("check_order", "Check the status of a customer order",
    new { type = "object", properties = new {
        order_id = new { type = "string", description = "The order ID to look up" }
    }, required = new[] { "order_id" } },
    (args, rawData) =>
    {
        var orderId = args.TryGetValue("order_id", out var id) ? id : "unknown";
        return new FunctionResult($"Order {orderId}: shipped, ETA April 2nd");
    });

agent.Run();

use signalwire::agent::AgentBase;
use signalwire::swaig::FunctionResult;
use serde_json::json;

fn main() {
    let mut agent = AgentBase::builder()
        .name("Support Agent")
        .route("/support")
        .build();

    agent
        .prompt_add_section("Instructions",
            "You are a customer support agent. Greet the caller and resolve their issue.", &[])
        .add_language("English", "en-US", "rime.spore:mistv2");

    agent.define_tool(
        "check_order",
        "Check the status of a customer order",
        json!({"type": "object", "properties": {
            "order_id": {"type": "string", "description": "The order ID to look up"}
        }, "required": ["order_id"]}),
        Box::new(|args, _raw| {
            let order_id = args.get("order_id").and_then(|v| v.as_str()).unwrap_or("unknown");
            FunctionResult::with_response(&format!("Order {order_id}: shipped, ETA April 2nd"))
        }),
    );

    agent.run();
}

Multi-vendor pipeline vs. single engine

Bolt-on pipeline

Six to nine network hops per turn
770ms to 2,080ms measured roundtrip
Each vendor measures its own slice
Codec transcoding between services (G.711 to PCM)
WebSocket piping adds silent failure modes
Co-location helps one hop, not six

SignalWire

Orchestration inside the media stack
800-1200ms typical full roundtrip
One engine measures the entire path
Native codec handling, no transcoding step
Audio stays inside the media engine
Built by the FreeSWITCH team

Independent latency measurements by platform

Platform	Measured latency	Source
Twilio	950ms average	Telnyx: Voice AI Agents Compared
Vonage	800 to 1,200ms	Telnyx: Voice AI Agents Compared
Vapi (India region)	1,450ms	Trustpilot reviews, production reports
Bland AI	800ms average	G2 reviews
DIY WebSocket stack	1,920ms median	DEV Community benchmark
DIY WebRTC stack	2,060ms median	DEV Community benchmark
LiveKit + Twilio (EU)	4,000ms+ per turn	GitHub issues, production reports
SignalWire	800-1200ms typical	Full roundtrip measurement

Where milliseconds accumulate in a bolt-on stack

Hop	What happens	Latency added
PSTN to telephony provider	Call ingress, media stream setup	50 to 100ms
Telephony to WebSocket	Base64 encode mu-law, open stream	30 to 80ms
WebSocket to your server	Network transit, decode, buffer	20 to 50ms
Server to STT	Codec convert, stream audio, wait for transcript	200 to 400ms
STT to LLM	Send transcript, wait for first tokens	200 to 800ms
LLM to TTS	Send text, wait for first audio chunk	150 to 400ms
TTS back through chain	Encode, transmit, decode at each boundary	120 to 250ms
Total		770 to 2,080ms

How SignalWire achieves sub-second response

Call arrives at the media engine

PSTN ingress with no external telephony provider in the path. The audio is already inside the engine.

STT streams concurrently

Audio processes every 250ms during speech. No waiting for the caller to finish before transcription begins.

LLM inference runs in parallel

The transcript streams to the LLM while the caller is still speaking. Response generation overlaps with transcription.

TTS generates audio inside the engine

No network hop to an external synthesis service. Audio goes from TTS to PSTN egress without leaving the platform.

Some platforms report STT-to-first-token. Others report TTS time-to-first-byte. Full roundtrip measures what the caller experiences: the gap between finishing a sentence and hearing the AI respond.

What latency feels like to callers

Latency	Caller experience	Business impact
Under 500ms	Feels instantaneous	Optimal engagement
500 to 800ms	Slight pause, still conversational	Acceptable for most use cases
800 to 1,200ms	Noticeable delay, like a bad international call	Callers start talking over the agent
1,200 to 2,000ms	Awkward pauses, callers check if the line dropped	40% increase in call abandonment
Above 2,000ms	Caller hangs up or asks for a human	Support escalation, lost revenue

FAQ

How is 800-1200ms measured?

Full roundtrip: the moment the caller stops speaking to the moment the caller hears the AI respond. Not a partial metric like STT-to-first-token or TTS time-to-first-byte. With speech-to-speech voice models, latency can be as low as 600ms.

Where do the competitor numbers come from?

Twilio and Vonage numbers come from a Telnyx benchmark (a competitor publishing independent measurements). Vapi numbers come from Trustpilot reviews and production reports. DIY stack numbers come from DEV Community benchmarks.

Can I reduce latency by switching to a faster TTS provider?

Switching providers saves time on one hop but does not eliminate the other five to eight network boundaries. Architecture determines the floor. Optimization determines how close you get to it.

Does SignalWire lock me into specific STT or LLM providers?

No. You can bring your own models. The AI kernel orchestrates them from inside the media engine, eliminating the orchestration overhead of bolt-on pipelines.

What about caching LLM responses to reduce latency?

Caching helps for common queries but removes the benefit of having an AI agent that handles novel conversations. Every external API call is a network round-trip that no cache eliminates.

Measure the full roundtrip yourself.

Run the same conversation on your current stack and on SignalWire. Compare what your callers actually experience.

Get Started Free Read the Docs

Every Vendor Hop Costs You 200ms