The PDF-Genie project aims to develop an intelligent system capable of understanding and processing the content within uploaded PDF documents. Leveraging the FAISS (Facebook AI Similarity Search) algorithm, this system can efficiently index and search through vast amounts of textual data, providing users with valuable insights and knowledge extraction.
- Document Upload: Users can upload PDF documents containing valuable information.
- Text Extraction: The system extracts text content from uploaded PDF documents for further analysis.
- Document Indexing: Utilizing the FAISS algorithm, the system indexes the textual data to enable fast and accurate search capabilities.
- Knowledge Retrieval: Users can search for specific information within the uploaded documents, with the system retrieving relevant sections or documents.
- Intelligent Insights: The AI Knowledge Genie provides intelligent insights and summaries based on the content of the uploaded documents.
- Python 3.x
- FAISS-cpu
- Streamlit
- PyPDF2
- google-generativeai
- python-dotenv
- langchain_google_genai
- chromadb
- Clone this repository:
git clone https://github.com/Balaji-V-S/PDF-Genie- Install the required packages:
pip install -r requirements.txt- Run the script
streamlit run main.py- Access the web interface by opening your web browser and navigating to the provided URL/IP
Contributions are welcome! If you have any suggestions, enhancements, or bug fixes, feel free to submit a pull request.
