Photobox for Kids

Enhancing family interactions for Kids utilizing Gemini.

What it does

We propose "Photobox for Kids," an interactive system using multi-modal recognition to enhance family interactions and early childhood education. It has two main components: an AI Camera for children to capture objects or family moments, and a Photobox for interactive learning at home. Parents can capture household items, which a Vision-Language Model (VLM) uses to generate tailored educational content. The system employs a 'Chain of Thought' to progress from simple queries to complex explanations. When children capture registered items, the system identifies and describes them. Unregistered items trigger descriptions generated by the Gemini 1.5 model. This simple photo capture method generates extensive Q&A content, promoting curiosity and understanding. A pilot in an international kindergarten showed children recalling 70% of 100 registered objects. The AI Camera captures family moments, and the Photobox provides rich interactive content when children present the printed photos. This system extends previous HCI work by using the Gemini model for richer interactive content. Ongoing studies are validating its effectiveness in enhancing family interactions.

Built with

Android

Team

Photobox for Kids

From

United States