🔥 The Python library for PDF forms.
-
Updated
Aug 12, 2025 - Python
🔥 The Python library for PDF forms.
pdfCropMargins -- a program to crop the margins of PDF files
A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk
A tool to sign PDF files. With Linux support.
CCKS2019评测任务五-公众公司公告信息抽取,第3名
Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨
PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.
Python library to manipulate PDF page labels
The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.
PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析
This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.
Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
Search and replace text in PDF files with PyPDF.
✨ A batch of useful code/scripts: run commands automatically, finish repetitive stupid operations, perform format conversions, etc.
Prepare documents for distribution
Create a ChatGPT for uploaded pdf using Langchain
This repo contains script using Tesseract OCR to digitize pdf ebooks to text format.
Add a description, image, and links to the pdf-document-processor topic page so that developers can more easily learn about it.
To associate your repository with the pdf-document-processor topic, visit your repo's landing page and select "manage topics."