Optima Ve

Enabling Independence for vision impaired people using Gemini 1.5 Pro

What it does

Optima Ve - Empowering Independence for the Visually Impaired
Optima Ve is a innovative & impactful solution that empowers visually impaired individuals to navigate daily life with greater independence. Utilizing the Gemini 1.5 Pro multimodal Language Learning Model (LLM) from Google, Optima Ve seamlessly integrates vision and voice technologies to offer an intuitive user experience.
Purpose and Vision:
Optima Ve aims to provide a seamless, user-friendly way for visually impaired individuals to perform everyday tasks using their smartphones, fostering independence and reducing the challenges of visual impairment.
Core Functionality:
Contextual Understanding: Gemini 1.5 Pro’s LLM deeply comprehends user queries by interpreting complex requests and asking clarifying questions. This ensures accurate understanding before executing tasks.
Task Execution: Once the issue is understood, the AI performs tasks such as identifying objects, reading text, or navigating spaces, addressing a wide range of daily challenges.
Voice Interaction: Whisper, a sophisticated speech-to-text engine, facilitates natural, conversational voice interaction, enabling effortless communication.
Vision Capabilities: The app uses the smartphone’s camera to process video inputs, assisting users in locating items, identifying obstacles, and reading text.

Built with

  • Android
  • Web/Chrome
  • React Native
  • Whisper

Team

By

OptimaVe - Enabling Independence

From

Pakistan