Scrapy, a fast high-level web crawling & scraping framework for Python.
-
Updated
Apr 9, 2025 - Python
Scrapy, a fast high-level web crawling & scraping framework for Python.
Python APIs for web automation, testing, and bypassing bot-detection.
🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping simple and easy again!
The All in One Framework to Build Undefeatable Scrapers
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale. Includes REST API, Python and JavaScript SDKs, browser debugger.
Scalable Python web scraping scripts for +40 popular domains
In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.
A simple and easy to use web crawler for Python
Web scraping framework built for AI applications. Extract clean, structured content from any website with dynamic content handling, markdown conversion, and intelligent crawling capabilities. Perfect for RAG applications and AI training data pipelines. Features async processing, browser management, and Prometheus monitoring.
TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS client, fingerprinting, anti-bot page bypass, and high performance.
Web scraping API for building AI applications.
CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.
A short introduction to scraping with Python with given steps and an example scraper script.
Builds a Tagalog dictionary by collecting Tagalog words from tagalog.pinoydictionary.com
WatchTower - A platform to save your valuable time while staying updated in the Cyber realm.
The News Headlines Tracker application collects the latest news headlines from major news sources such as CNN, BBC, and The New York Times.
The Facebook Comments Crawler is an unofficial tool for extracting comments from Facebook posts using Selenium in Python. It's designed to aid in academic and personal research. #Facebook comments scaper #Facebook comments crawler
"instagram-post-fetcher" is a Python module leveraging Selenium to extract Instagram post details, including account username, descriptions, media URLs, and post timestamps. Simplifying access to Instagram data for analytics and research.
Repository designed to help freshers easily grasp the basics of web scripting, offering simple guides and examples to build a strong foundation.
Add a description, image, and links to the web-scraping-python topic page so that developers can more easily learn about it.
To associate your repository with the web-scraping-python topic, visit your repo's landing page and select "manage topics."