DEV Community

Scraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

Optimizing Chunking and Data Extraction for Zero-Hallucination RAG

Comments
4 min read
When web scraping breaks: using AI to extract messy data

When web scraping breaks: using AI to extract messy data

Comments
5 min read
Why I Stopped Writing CSS Selectors for Web Scraping

Why I Stopped Writing CSS Selectors for Web Scraping

Comments
4 min read
Track YC Demo Day Companies in Real Time (with code)

Track YC Demo Day Companies in Real Time (with code)

Comments
5 min read
A Self-Hosted Web Content Extraction API

A Self-Hosted Web Content Extraction API

9
Comments 1
5 min read
Scraping dynamic pages with Python, Playwright and AWS Lambda

Scraping dynamic pages with Python, Playwright and AWS Lambda

Comments
4 min read
Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources

Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources

Comments
4 min read
How I scraped 50k YouTube subtitles in 2 weeks for $7 (and the legal gray zones)

How I scraped 50k YouTube subtitles in 2 weeks for $7 (and the legal gray zones)

Comments
4 min read
API or browser agent? We picked yes.

API or browser agent? We picked yes.

Comments
7 min read
ISP proxies, AI crawlers, and the slow death of datacenter IPs: 2026 in numbers

ISP proxies, AI crawlers, and the slow death of datacenter IPs: 2026 in numbers

Comments
8 min read
I Tested 15 LLMs for Web Scraping and Built Heuristics Instead

I Tested 15 LLMs for Web Scraping and Built Heuristics Instead

Comments
3 min read
How I Sniffed Xiaohongshu's Collection API in 90 Seconds — and Why CORS Made Me Rewrite the Whole Approach

How I Sniffed Xiaohongshu's Collection API in 90 Seconds — and Why CORS Made Me Rewrite the Whole Approach

Comments
6 min read
6 Apify actors I actually use myself

6 Apify actors I actually use myself

Comments
3 min read
Anti-bot without the arms race: what Camoufox does differently

Anti-bot without the arms race: what Camoufox does differently

1
Comments
4 min read
Web Crawling e Web Scraping

Web Crawling e Web Scraping

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.