Skip to main content

Introduction

Overview

The SpeechToText SDK provides a voice recognition feature based on a voice interface by enabling integration with the 42dot voice recognition platform in applications running on the 42dot Pleos Vehicle OS. The SDK transmits the user's voice to the Speech Service platform according to the API format provided by the SpeechToText SDK and delivers the processed text results from the Speech Service platform back to the client. By using the SpeechToText SDK, IVI applications can utilize the 42dot voice recognition service without needing to manage microphone control or handle input and output operations.

[No backward compatibility support]
  • Apply the updated package naming. (Modify module-level build.gradle and import statements)
  • Apply the updated permission naming in the manifest file.
  • Redownload the Pleos Connect Emulator.

Development Environment Setup

To use the SpeechToText SDK, install the Pleos Connect Emulator in Android Studio and then proceed with AVD setup.

SpeechToText API

The list of APIs provided by the SpeechToText SDK is as follows.

FunctionSummary
initialize APIAPI to initialize the SpeechToText SDK.
release APIAPI to release the SpeechToText SDK.
registerApp APIAPI to register an app for using server-based speech recognition features.
addListener APIAPI to add an event listener for voice recognition results.
removeListener APIAPI to remove a registered event listener.
request APIAPI to request real-time speech recognition.
stop APIAPI to stop the current voice recognition.
sendAudio APIAPI to extract text from stored PCM audio data.
completeAudioSend APIAPI to complete sending PCM audio data.