Gleo AI SDK Overview
Overview
Gleo AI SDK provides AI technologies such as Speech-to-Text (SpeechToText), Text-to-Speech (TextToSpeech), and Large Language Models (LLM) as APIs for developing intelligent agents.
Developers can easily implement AI capabilities into their apps or services using the Gleo AI SDK, and enhance user experience by integrating with Gleo AI.
-
SpeechToText SDK: Delivers high-quality recognition in vehicle environments and supports a hybrid mode of on-device and server-based recognition.
- On-device recognition works without network connectivity, while server-based recognition provides higher accuracy when connected to the internet.
-
TextToSpeech SDK: Generates natural, human-like voices and supports a hybrid mode of on-device and server-based synthesis, similar to the SpeechToText SDK.
- Streaming is supported, enabling fast audio output in response to streaming text input generated by the LLM.
-
LLM SDK: Utilizes 42dot's proprietary LLM to generate contextually appropriate responses with low latency.
- Future support for PEFT (parameter-efficient fine-tuning) APIs will allow for even higher-quality responses.
Available SDKs
SDK Type | Description |
---|---|
SpeechToText SDK | Converts user speech into text in real time. |
TextToSpeech SDK | Converts text into natural-sounding speech for feedback or read-aloud use. |
LLM SDK | Provides a language model that supports NLP tasks such as summarization, Q&A, generation, and classification. |