Samtal

Samtal

Powering your Agentic Voice AI

Built by Applied AI Sweden AB   ·   EU-owned & operated   ·   Self-hosting available

API Key Using built-in demo key
Playground
Voice Clone
API Docs
SDK Integration

Text-to-Speech

FastHigh quality
Voice design (optional — overrides selected voice)

Speech-to-Text

Round-Trip

TTS then ASR — tests full pipeline accuracy and latency.

Voice Cloning

Record a short speech sample, then use it as the voice for TTS. Both the audio and its transcript are required.

Step 1: Record Reference

Read one of these sentences naturally. 5-15 seconds works best.

Step 2: Verify Transcript

Auto-filled from your recording. Edit if needed.

Step 3: Generate with Cloned Voice

Overview

Drop-in replacement for ElevenLabs. Point any ElevenLabs SDK at https://samtal.moln.ai. Powered by OmniVoice (646-language TTS) and Parakeet TDT 0.6b-v3 (25-language ASR) on NVIDIA GPU.

Authentication

xi-api-key: sk_your_key_here

Endpoints

POST /v1/text-to-speech/{voice_id}
POST /v1/text-to-speech/{voice_id}/stream — chunked streaming
POST /v1/speech-to-text
GET /v1/voices   GET /v1/voices/{voice_id}   GET /v1/models

Voices

voice_idNameLanguage
spectra-en-defaultNovaEnglish
spectra-en-warmOliverEnglish
spectra-sv-defaultAstridSwedish
spectra-da-defaultFrejaDanish
spectra-no-defaultSigridNorwegian
spectra-fi-defaultAinoFinnish
spectra-de-defaultLenaGerman
spectra-fr-defaultCamilleFrench
spectra-es-defaultLuciaSpanish
spectra-it-defaultGiuliaItalian
spectra-nl-defaultEmmaDutch
spectra-pl-defaultZuzannaPolish
spectra-pt-defaultBeatrizPortuguese

Performance

OperationLatencyRTF
TTS (GPU, quality 4)~35ms0.10
ASR (GPU)~70ms0.039
Cached snippetinstant0

Drop-in ElevenLabs Replacement

  1. Set base_url to https://samtal.moln.ai
  2. Use a Samtal API key

All endpoints, request formats, and response shapes are identical to ElevenLabs.

Python

JavaScript

cURL

Go

Ruby