Yes. Real-time transcription is the single most common use case. Stream the audio to Google Speech-to-Text, Deepgram, AWS Transcribe, Azure Speech, OpenAI Whisper, ElevenLabs, or any ASR engine — or use EnableX's built-in speech-to-text, which supports 95%+ recognition accuracy across Indian languages using the IIT Madras IndicVoices dataset.
For Indian-language call centres, the EnableX built-in ASR often beats global engines on Hindi, Tamil, Telugu, Marathi, Bengali, and code-mixed Hinglish speech. Read our guide on building speech-to-text systems in WebRTC calling for architecture patterns.