Speech to text software that’s flexible, scalable, and easy to use
Our GPU-accelerated, AI-based technology enables you to deliver greater insights to the contact center by transcribing audio into analyzable text.
Transcribe all calls quickly, accurately, and cost-effectively
Transcribe large volumes of recorded audio quickly via lightning-fast GPUs
Integrate highly accurate transcripts with your analytics or business intelligence platform
Reduce your hardware footprint and minimize total cost of ownership
Features:
- Cost-effectively scale from 10 to 100,000 agents
- State-of-the-art NVIDIA® GPUs that increase compute performance and speech-to-text (STT) conversion speed
- Optimized cloud solutions powered by AWS
- Latest-generation Intel® Xeon® or AMD processors
- High-speed DDR4 SDRAM for high-bandwidth data transfers
- Containerized machines for VMware, AWS AMIs, and others
Technical Specifications
Audio Compatibility
Supports the G.711 suite of audio standards (Uncompressed Pulse Code Modulation [PCM], μ-law, and A-law)
Deployment Method
In-cloud Deployment
Integration
REST API enables both batch (file-based) and real-time (stream-based) operation
Format
Open-format JSON and Text transcripts
Languages Supported
All North American languages (English, Spanish, and French)
Speech Engine
Large Vocabulary Continuous Speech Recognition (LVCSR)
Transcription Delivery Mode
Up to 150 hours of recorded calls per hour per single hardware unit