Chat with PDF is an interactive application that allows users to converse with textual content within PDF documents. Using the power of LangChain and Streamlit, this app enables efficient information retrieval by answering questions based on the content of uploaded documents.
- Upload PDF documents directly into the application.
- Ask natural language questions and receive accurate, context-aware answers.
- Used state-of-the-art language models for response generation.
Before running the application, ensure you have Python installed on your system. To install the required dependencies, run the following command:
pip install streamlit PyMuPDF langchain-openai dotenv
To start the application, navigate to the app's directory and run:
streamlit run app.py
The application will be available in your web browser at localhost:8501
by default.
- Load documents: Users can upload PDF documents through the web interface.
- Extract text & Pre-process: Text is extracted from the PDF and prepared for processing.
- Split documents into chunks: The application splits documents into smaller, manageable text chunks.
- Vector store for Retrieval: Chunks are indexed in a vector store for efficient retrieval.
- LLM for Generation: Utilizes a large language model (LLM) to generate responses to user queries.
- Query & Answer: The user can query the document content, and the application provides answers.