Anonymous View

DEV Community

# webscraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How We Optimized a Django Playwright Scraper to Save 60% on Rotating Proxy Bandwidth

How We Optimized a Django Playwright Scraper to Save 60% on Rotating Proxy Bandwidth

Comments
4 min read
Building a Lean, Single-Worker Broken URL Monitor for Data Pipelines

Building a Lean, Single-Worker Broken URL Monitor for Data Pipelines

Comments
6 min read
How Paywalls Actually Work: The Engineering Behind Them

How Paywalls Actually Work: The Engineering Behind Them

2
Comments
14 min read
Why Cloudflare Breaks Proxy-Only Scrapers

Why Cloudflare Breaks Proxy-Only Scrapers

Comments
4 min read
Give Your AI Agent a Web-Fetch Tool: a 60-Line MCP Server (Free, Self-Hosted)

Give Your AI Agent a Web-Fetch Tool: a 60-Line MCP Server (Free, Self-Hosted)

Comments 1
10 min read
How to track Weibo hot-search velocity with Python in 2026 — the trending-delta problem and how to handle it

How to track Weibo hot-search velocity with Python in 2026 — the trending-delta problem and how to handle it

Comments
4 min read
How to Scrape E-Commerce Sites for AI Agents Using Playwright and LLMs

How to Scrape E-Commerce Sites for AI Agents Using Playwright and LLMs

Comments
6 min read
Your AI Agent Is Paying for HTML It Never Reads — I Measured the 7x Token Tax

Your AI Agent Is Paying for HTML It Never Reads — I Measured the 7x Token Tax

Comments
5 min read
What is Web Scraping? A Beginner's Guide with Real Python Code

What is Web Scraping? A Beginner's Guide with Real Python Code

2
Comments
2 min read
Your scraper says 200 OK. I measured how often it's lying.

Your scraper says 200 OK. I measured how often it's lying.

1
Comments
7 min read
How Scraping API Pricing Changes Once You Need Browser Sessions

How Scraping API Pricing Changes Once You Need Browser Sessions

Comments
3 min read
How to Get Google Search Results in JSON for an AI Agent

How to Get Google Search Results in JSON for an AI Agent

3
Comments
6 min read
Building Anti-Bot Detection Systems: How RepoHunter Scrapes GitHub Trending Repositories

Building Anti-Bot Detection Systems: How RepoHunter Scrapes GitHub Trending Repositories

Comments
4 min read
Get any Instagram profile data in 10 lines of Python

Get any Instagram profile data in 10 lines of Python

1
Comments
2 min read
How I built a self-healing, robots-respecting web scraper (and put it on the Apify Store)

How I built a self-healing, robots-respecting web scraper (and put it on the Apify Store)

2
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.