List Pipeline Options
Configs API Endpoints
List Pipeline Options
Retrieve the complete list of supported providers and models for Speech-to-Text (STT), Large Language Models (LLM), and Text-to-Speech (TTS) pipelines.
GET
List Pipeline Options
Discover the complete catalog of models available for building a modular Voice Agent pipeline. While “Realtime” models (likeDocumentation Index
Fetch the complete documentation index at: https://docs.lyzr.ai/llms.txt
Use this file to discover all available pages before exploring further.
gpt-realtime or gemini-native-audio) handle listening, thinking, and speaking in one step, a Pipeline agent strings together three separate specialized models. This endpoint gives you the exact IDs needed to configure that 3-step chain.
Authentication Required: You must include your API key in the
x-api-key header to authenticate this request.Understanding the Pipeline Architecture
To create a pipeline agent, you must configure three components in yourPOST /agents request. The response from this endpoint provides the available options for each:
1. stt (Speech-to-Text)
This is the model that transcribes the user’s spoken audio into text.
- Available Providers: Deepgram, AssemblyAI, Cartesia, Sarvam.
- Key Detail: Pay close attention to the
languagesarray for each STT model to ensure it supports your target demographic.
2. llm (Large Language Model)
This is the “brain” of the agent. It takes the text from the STT model, processes it against your system prompt, and generates a text response.
- Available Providers: OpenAI (GPT-4o, GPT-5 series), Google (Gemini Flash/Pro series), DeepSeek, MoonshotAI.
3. tts (Text-to-Speech)
This model takes the text generated by the LLM and synthesizes it into spoken audio for the user to hear.
- Available Providers: ElevenLabs, Cartesia, Deepgram (Aura), Inworld, Rime, Sarvam.
- Key Detail: The response includes a
defaultVoiceIdfor each TTS model, which you can use as a fallback if you aren’t rendering a full list of custom voice clones.