VisionAid

Helping the visually impaired navigate the world with the power of AI.

What it does

VisionAid is an innovative mobile application designed to empower the visually impaired by enhancing their ability to navigate and interact with the world around them. Leveraging Google’s advanced Gemini 1.5 Flash Model, the app takes images and sends them to Gemini, enabling users to identify everyday objects, navigate public spaces, and even recognize familiar faces and pets, creating a more connected and independent experience.

VisionAid’s intuitive interface makes it easy for users to ask questions about their surroundings with immediate, accurate responses provided through voice feedback, thanks to the Gemini Flash Model and the Google Cloud Text-To-Speech API. Whether it’s identifying items in a grocery store, crossing a busy street safely, or recognizing a friend, VisionAid acts as an intelligent companion, offering users the confidence to explore the world on their terms.

VisionAid aims to break down barriers and make the world more accessible for the visually impaired, using cutting-edge technology to foster independence and enhance daily living. VisionAid is not just an app; it’s a step towards a future where everyone can experience their surroundings with clarity and confidence, thanks to the rapid advancements of API.

Built with

Google Cloud Text-To-Speech

Team

From

Germany