Skip to main content

Gleo AI SDK Overview

Overview

Gleo AI SDK provides AI technologies such as Speech-to-Text (SpeechToText), Text-to-Speech (TextToSpeech), and Large Language Models (LLM) as APIs for developing intelligent agents.
Developers can easily implement AI capabilities into their apps or services using the Gleo AI SDK, and enhance user experience by integrating with Gleo AI.

  • SpeechToText SDK: Delivers high-quality recognition in vehicle environments and supports a hybrid mode of on-device and server-based recognition.

    • On-device recognition works without network connectivity, while server-based recognition provides higher accuracy when connected to the internet.
  • TextToSpeech SDK: Generates natural, human-like voices and supports a hybrid mode of on-device and server-based synthesis, similar to the SpeechToText SDK.

    • Streaming is supported, enabling fast audio output in response to streaming text input generated by the LLM.
  • LLM SDK: Utilizes 42dot's proprietary LLM to generate contextually appropriate responses with low latency.

    • Future support for PEFT (parameter-efficient fine-tuning) APIs will allow for even higher-quality responses.

Available SDKs

SDK TypeDescription
SpeechToText SDKConverts user speech into text in real time.
TextToSpeech SDKConverts text into natural-sounding speech for feedback or read-aloud use.
LLM SDKProvides a language model that supports NLP tasks such as summarization, Q&A, generation, and classification.