Visibl

Visibl turns audiobooks into videobooks

What it does

Visibl is an iOS app that transforms audiobooks into immersive videobooks, turning your listening experience into a visual journey. As you listen, Visibl dynamically generates images in real-time, allowing users to see a unique visual interpretation of the audiobook they are enjoying. The app doesn't just create static images; it enables users to influence and guide the visuals, making each journey through a book highly personalized.

Visibl leverages the power of the Gemini API in three key ways:

- Audio Transcription: The app transcribes the audiobook audio into text, ensuring accurate representation of the content.

- Named Entity Recognition (NER): This allows the app to identify and focus on key characters, places, and objects within the text, which are crucial for generating contextually relevant visuals (Gemini 1.5 Pro)

- Image Prompt Generation: Using the insights from transcription and NER, the app generates detailed and personalized image prompts, which are then used by diffusion models to create the visuals in real-time. (Gemini 1.5 Pro)

This combination of Gemini API features ensures that Visibl not only provides a novel way to experience audiobooks but also tailors the experience uniquely to each user.

Built with

  • Firebase

Team

By

visibl

From

UK