pdf-search

Here are 28 public repositories matching this topic...

Bklieger / Semantic

SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file and perform a semantic search on contents.

pdf embeddings semantic-search pdf-search

Updated Apr 4, 2024
TypeScript

jina-ai / jina-vdr

Star

Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval

embeddings multi-modal pdf-search visual-document-retrieval vidore

Updated Aug 4, 2025
Python

njmarko / googolplex-pdf-search

Star

Python program for searching pdf text, ranking the results and exporting highlighted search results in pdf. Uses trie structure, stack, heap, page graph. Converts queries to postfix notation. Allows for logical expressions and phrases. Offers did you mean functionality.

autocomplete stack graph trie heap pdf-generation didyoumean datastructures-algorithms postfix-evaluation pdf-highlighter pdf-search

Updated Aug 28, 2024
Python

herohql / pdf-master

Star

vue功能最全的pdf组件，支持渲染、页码提取与跳转、文件加载完成监听、页面变化监听、文本搜索、关键词高亮、目录提取

pdf vue pdf-search pdf-highlight pdf-directory

Updated Mar 14, 2025
HTML

raisultan / hermes

Star

Use semantic search on PDFs locally

embeddings semantic-search pdf-search

Updated Mar 30, 2024
Python

ai-naymul / DocuVisQA

Star

DocuVisQA(Document Visual Question Answering) is a Python project that leverages Google's Generative AI and Langchain for document processing, text splitting, and question answering. It also supports image processing with Streamlit for interactive UI.

python open-source pdf chatbot document image-recognition streamlit pdf-search documentretrieval-exe streamlit-application langchain langchain-python

Updated Apr 8, 2024
Python

eli64s / pdflex

Sponsor

Star

CLI for merging PDF contexts.

pdf-converter pdf-document pdf-generator pdf-manipulation pdf-extractor pdf-library pdf-parser pdf-data-extraction pdf-processor pdf-tools pdf-document-processor python-pdf pdf-search pdf-text-extraction pdf-python pdf-automation python-pdf-tools pdf-document-parser pdf-regex

Updated Mar 20, 2025
Python

Ashad001 / UltimateRAG

Star

In Development

machine-learning ai embeddings gemini openai web-search rag pdf-search jina llms llama-index vector-store

Updated Jul 30, 2024
Python

FelixKohlhas / pdf_search

Star

A web interface that allows searching for PDFs by their content

pdf flask sqlite pdf-search

Updated Nov 30, 2023
Python

yvnggodemis / pdf-parse

Star

PDF Parser built in Rust

rust pdf pdf-reader pdf-parser pdfparser pdfsearch pdfreader pdf-parse pdf-search pdfparse

Updated Dec 19, 2024
Rust

deckerego / docidx

Star

A document indexing daemon that can populate Elasticsearch indexes with the contents and metadata of a number of document types including PDF, image scans, etc. Used to power Facile Search, however can be re-used for anything that requires search indexing for scanned documents.

search-engine elasticsearch full-text-search scanned-documents pdf-search

Updated Mar 5, 2025
Java

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

Star

Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF

python glob pdf-converter python3 tf-idf querying pdfminer document-search pdf-search

Updated Oct 15, 2019
Python

tanmaypatil / resume-intelligence

Star

Resume search application using openai RAG and file search . A demo application which shows power of RAG from openai to simplify resume screening . Open source VLM model example to follow

nlp gradio pdf-search vector-database openai-api retrieval-augmented-generation

Updated Dec 28, 2024
Jupyter Notebook

Sazizi2025 / PDF-Founder

Star

Are you short on time?! Can't you search all the PDFs one by one for the content you want?! Well, PDF-Founder is here...

python pdf gui image tesseract rgb graphical tesseract-ocr easy-to-use image-generator snipping pdf-search-engine pymupdf pysimplegui pdf-search ptl pymupdf-fitz

Updated Jan 8, 2024
Python

aemal / pdf-finder

Star

A tool to search for text in PDF files using multiple methods, including OCR (Optical Character Recognition).

ocr pdf-search pdf-finder search-in-pdf

Updated Apr 23, 2025
Python

M-Husnain-Ali / AI-PDF-Search-Engine

Star

A powerful AI-powered PDF search and question-answering system built with LangChain, Pinecone Vector Store, OpenAI, and Supabase. Upload PDFs, ask questions, and get intelligent answers with persistent conversation memory.

embedded-systems openai question-answering semantic-search text-embedding pinecone streamli rag pdf-search vector-database ai-chatbot supbase langchain intelligent-document-processing chat-with-pdf pdf-rag

Updated Jul 26, 2025
Python

sampconrad / busca-diario

Sponsor

Star

Programa que busca uma lista de nomes das Partes Processuais nos PDFs do Diário Oficial.

python law brasil pdf-search

Updated Dec 19, 2023
Python

logxdx / contextualized-late-interation-with-pdfs

Star

A high-performance RAG system for PDFs using multi-vector embeddings (ColPali / ColQwen / ColSmol) with vector search in Qdrant, prefetch optimization, and reranking for improved relevance. Designed for speed, accuracy, and scalability, this system is ideal for building intelligent search, document understanding, and QA applications.

rag pdf-search colpali pdf-rag colqwen2 colsmol

Updated Aug 9, 2025
Python

domwal / acervo-digital-pessoal

Star

Website in PHP to index all pdf content and easy way to find any text

javascript mysql css html bootstrap jquery php pdf ajax windows-10 indexing full-text-search document-search php73 linux-debian pdf-search

Updated Apr 15, 2024
PHP

ad4529 / unichemfinder

Star

Repository for the Indexing, Search and Evaluation of UniChemFinder

pdf-search granular-search chemical-ir

Updated Apr 21, 2025
Python

Improve this page

Add a description, image, and links to the pdf-search topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-search topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-search

Here are 28 public repositories matching this topic...

Bklieger / Semantic

jina-ai / jina-vdr

njmarko / googolplex-pdf-search

herohql / pdf-master

raisultan / hermes

ai-naymul / DocuVisQA

eli64s / pdflex

Ashad001 / UltimateRAG

FelixKohlhas / pdf_search

yvnggodemis / pdf-parse

deckerego / docidx

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

tanmaypatil / resume-intelligence

Sazizi2025 / PDF-Founder

aemal / pdf-finder

M-Husnain-Ali / AI-PDF-Search-Engine

sampconrad / busca-diario

logxdx / contextualized-late-interation-with-pdfs

domwal / acervo-digital-pessoal

ad4529 / unichemfinder

Improve this page

Add this topic to your repo