web-scraping

Here are 301 public repositories matching this topic...

codingforentrepreneurs / 30-Days-of-Python

Learn Python for the next 30 (or so) Days.

python api flask automation tutorial csv jupyter rest-api selenium pandas python3 web-scraping selenium-webdriver fastapi

Updated Feb 27, 2024
HTML

programminghistorian / jekyll

Star

Jekyll-based static site for The Programming Historian

Updated Aug 15, 2025
HTML

jaebradley / basketball_reference_web_scraper

Star

NBA Stats API via Basketball Reference

python nba web-scraper web-scraping basketball-reference

Updated Aug 16, 2025
HTML

austinoboyle / scrape-linkedin-selenium

Star

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

python scraper linkedin scraping selenium web-scraper web-scraping scrape selenium-webdriver

Updated Oct 16, 2022
HTML

davidteather / everything-web-scraping

Sponsor

Star

Learn everything web scraping with David Teather Codes on YouTube

python course everything reverse-engineering python3 web-scraping courses webscraping hacktoberfest youtube-series python-web-scraper project-based-learning web-scraping-tutorial project-based-learning-courses hacktoerfest web-scraping-python project-based-tutorials

Updated Jul 31, 2023
HTML

City-Bureau / city-scrapers

Star

Scrape, standardize and share public meetings from local government websites

python open-data web-scraping scrapy city-scrapers

Updated Jun 19, 2025
HTML

currentslab / extractnet

Star

A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package

python machine-learning text-mining news web-scraping webscraping news-articles news-extractor content-extraction news-extraction text-cleaning date-extraction author-extraction

Updated May 19, 2025
HTML

programminghistorian / ph-submissions

Star

The repository and website hosting the peer review process for new Programming Historian lessons

python api open-source mapping multi-lingual web-scraping digital-humanities data-management pedagogy web-archiving network-analysis linked-open-data programming-historian dh open-educational-resources r-studio digital-history distant-reading

Updated Aug 15, 2025
HTML

khuyentran1401 / top-github-scraper

Sponsor

Star

Scape top GitHub repositories and users based on keywords

github python github-api scraping web-scraper web-scraping

Updated Jun 27, 2023
HTML

scrapehero / selectorlib

Star

A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them

python scraping web-scraping selectors xpath

Updated Jan 30, 2023
HTML

LexiestLeszek / sova_ollama

Star

Open source implementation of Sova - RAG-based Web search engine using power of LLMs. Using Langchain, Ollama, HuggingFace Embeddings and scraping google search results.

web-scraping large-language-models llm retrieval-augmented-generation rag-implementation

Updated Feb 12, 2024
HTML

the-markup / investigation-google-search-audit

Star

Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"

search-engine web-scraping google-search algorithm-auditing

Updated Jul 28, 2020
HTML

Data-on-the-Mind / 2017-summer-workshop

Star

Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)

Updated Mar 21, 2019
HTML

testdrivenio / concurrent-web-scraping

Star

Building a Concurrent Web Scraper with Python and Selenium

web-scraping python-selenium python-concurrency python-web-scraping

Updated Dec 22, 2021
HTML

yusuzech / web-scraping-projects

Star

List of my scraping projects

r web-scraping rvest httr

Updated Apr 19, 2019
HTML

tashapiro / drag-race

Star

Project dedicated to collecting, organizing, and analyzing information about RuPaul's Drag Race and related franchises.

spotify r data-visualization web-scraping rupaul drag-race

Updated Aug 24, 2023
HTML

maxhumber / scrape.world

Sponsor

Star

The Web Scraping Sandbox

selenium web-scraping gazpacho

Updated Dec 31, 2024
HTML

KenzoBH / Web-Scraping-and-EDA-iFood

Star

Web Scraping and EDA from iFood website data.

python jupyter jupyter-notebook eda web-scraper web-scraping scraped-data data-cleaning cleaning

Updated Jun 20, 2021
HTML

rohit-yadav / scraping-news-articles

Star

Create a web crawler that goes through the section of a newspaper website and extracts unique articles from different pages of sections.

crawler web-scraping data-analysis screen-scraping

Updated Sep 3, 2018
HTML

ricardo-mattoss / Brazilian-Soccer-Data

Star

Scraping and updating of data from the championships that Brazilian soccer teams participate in

data-science r web-scraping soccer-data gh-actions

Updated Oct 9, 2022
HTML

Improve this page

Add a description, image, and links to the web-scraping topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the web-scraping topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

web-scraping

Here are 301 public repositories matching this topic...

codingforentrepreneurs / 30-Days-of-Python

programminghistorian / jekyll

jaebradley / basketball_reference_web_scraper

austinoboyle / scrape-linkedin-selenium

davidteather / everything-web-scraping

City-Bureau / city-scrapers

currentslab / extractnet

programminghistorian / ph-submissions

khuyentran1401 / top-github-scraper

scrapehero / selectorlib

LexiestLeszek / sova_ollama

the-markup / investigation-google-search-audit

Data-on-the-Mind / 2017-summer-workshop

testdrivenio / concurrent-web-scraping

yusuzech / web-scraping-projects

tashapiro / drag-race

maxhumber / scrape.world

KenzoBH / Web-Scraping-and-EDA-iFood

rohit-yadav / scraping-news-articles

ricardo-mattoss / Brazilian-Soccer-Data

Improve this page

Add this topic to your repo