pdf-document-processor

Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨

open-source google gemini openai chatbot-application python-3 chat-application gemini-api pdf-document-processor streamlit-application large-language-models llm generative-ai chatgpt langchain instructor-embeddings langchain-python gemini-pro

Updated Apr 23, 2024
Python

naiveHobo / pdfviewer

Star

PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.

pdf tkinter pdf-viewer pdf-files pdf-document tkinter-graphic-interface tkinter-gui pdf-document-processor tkinter-python tkinter-library

Updated Jul 4, 2021
Python

lovasoa / pagelabels-py

Sponsor

Star

Python library to manipulate PDF page labels

pdf labels page pdf-document-processor

Updated Aug 3, 2024
Python

OnedocLabs / onedoc

Star

The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.

react nodejs html api pdf sdk ycombinator pdf-viewer document pdf-generation pdf-reader pdf-library document-generator pdf-document-processor pdf-reports react-print-pdf

Updated May 21, 2024
Python

liunian-Jay / MU-GOT

Star

PDF解析工具：GOT的vLLM加速实现，MinerU做布局识别裁剪、GOT做表格公式解析，实现RAG中的pdf解析

pdf-document-processor rag vllm retrieval-augmented-generation mllms mineru

Updated Nov 7, 2024
Python

KalyanM45 / DocGenius-Revolutionizing-PDFs-with-AI

Sponsor

Star

This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.

python openai pdf-reader pdf-document-processor langchain chat-with-pdf

Updated Dec 10, 2024
Python

sidphbot / Auto-Research

Star

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

python nlp ocr text-similarity text-generation pytorch topic-modeling summarization research-tool arxiv research-data-management scientific-publications research-and-development research-software-engineering scientific-research text-clustering arxiv-api pdf-document-processor title-generation

Updated Dec 10, 2023
Python

SiddhantSadangi / pdf-workdesk

Sponsor

Star

A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.

python pdf webapp pdf-viewer pdfkit pdf-files pdf-document pdf-document-processor streamlit

Updated Aug 13, 2025
Python

PSPDFKit / nutrient-dws-client-python

Star

Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion

python pdf-converter pdf-generation pdf-document-processor ocr-python pdf-processing

Updated Aug 14, 2025
Python

papercast-dev / papercast

Star

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.