PDF MINIMALIZER APP

CHANGE THE BOOKS INTO EASY ENGLISH VERSION (without its flow)

What it does

My app simplifies difficult text in PDF books, making it easier to read for non-native English speakers. The app is designed to help users understand complex language and improve their vocabulary by breaking down hard words and sentences into simpler English.

Here's how it works:

1.Text Extraction: The app uses Tesseract OCR to extract text from PDF files. This allows it to process books that are in scanned or image formats.

2.Simplifying Text: The core feature of the app involves using the Google Gemini API. After extracting the text, the app sends it to the Gemini API, which rephrases complex sentences and identifies difficult words.

3.Replacement and Output: The app uses regular expressions to extract and replace difficult words with simpler definitions provided by the API. Users can choose between two modes: Beginner and Intermediate. The Beginner mode simplifies the text more thoroughly, while the Intermediate mode provides more challenging vocabulary to help users gradually improve their English skills.

4.Results: Finally, the app generates a text file with the simplified content, making it easier to read and understand. This output can be a valuable tool for learners who struggle with advanced English texts.

By integrating the Gemini API, the app not only simplifies language but also customizes the learning experience based on the user’s English proficiency level.

Built with

  • python streamlit

Team

By

White Rock

From

India