Watson Speech to Text on Cloud Pak for Data
Version: 4.0.9 Premium IBM
Description
IBM Watson Speech to Text for IBM Cloud Pak for Data provides speech recognition capabilities for your applications. The service leverages machine learning to combine knowledge of grammar, language structure, and the composition of audio and voice signals to accurately transcribe the human voice. It continuously updates and refines its transcription as it receives more speech audio. You can customize the service to suit your language and application needs.
Watson Speech to Text offers both HTTP and WebSocket programming interfaces that make it suitable for any application where speech is the input and a textual transcript is the output. Possible use cases include:
- Voice control of applications, embedded devices, and vehicle accessories
- Transcribing meetings and conference calls
- Dictating email messages and notes
The service is ideal for clients who need to extract high-quality speech transcripts from call center audio. Clients in industries such as financial services, healthcare, insurance, and telecommunication can develop cloud-native applications for customer care, customer voice, agent assistance, and other solutions.
Quick links
- Install: Install the service
- Use: Work with the service
- Administer: Manage and maintain the service
- Develop: Write code and build applications
Integrated services
Service | Capability |
---|---|
Watson™ Assistant for Voice Interaction | Enable direct voice interactions over a telephone with a cognitive self-service agent or transcribe phone calls between a caller and agent. |
Service | Capability |
---|---|
Watson Assistant | Build your own branded assistant into any device, application, or channel. Users interact with your application through the user interface that you implement. |
Watson Text to Speech | Convert written text to natural-sounding speech for your applications and stream the results back to the client with minimal delay. |