Product overview: IKnowWhatYouMean is a robust real-time speech recognition application, which can convert voice into text. You can talk to the computer and it will type the text in a snap.
Issues: The software had to type exactly the words spoken with great accuracy. It had to rapidly identify the language, the topic, and the context and transcribe what is being discussed in a snap. The IKnowWhatYouMean app should be able to comprehend and manage the voice from audio and from a microphone as well. Plus, we had to develop and implement meaningful analytics, so that it could structure and analyze the text. Due to machine intelligence techniques, we precisely transcribed the voice. It allowed analyzing grammar and language structure data, taking into account how the audio signal is composed. Plus, the customer wanted the app to be a self-learning creation. The IKnowWhatYouMean app should support 5 languages: English, French, German, Russian, and Spanish.
Supported audio formats: Web Media (WebM), Opus or Vorbis codec, MP3 or MPEG, Waveform Audio File Format (WAV), Linear 16-bit Pulse-Code Modulation (PCM), Free Lossless Audio Codec (FLAC), mu-law (or u-law) audio, and basic audio.
Technologies used: Application Programming Interfaces (APIs), WebSocket, HTTP REST
Result: A powerful, yet elegant and easygoing application, which can be utilized in various industries where you might need to use the voice recognition option.