[GitHub Trending] D4Vinci/Scrapling

7.6 relevance

Adaptive web scraping framework, useful for data engineering but not novel.

2026-06-01 Open Source github.com

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl! - D4Vinci/Scrapling

Summary

Scrapling is an adaptive Python web scraping framework whose parser automatically relocates elements when websites change (via auto_save) and whose fetchers bypass anti-bot systems like Cloudflare Turnstile. It includes a spider framework for concurrent, multi-session crawls with pause/resume and automatic proxy rotation, supporting both simple requests and large-scale crawling.

Key Takeaways

Evaluate Scrapling for your data extraction workflows to reduce breakage from site updates and avoid manual anti-bot workarounds.

Why it matters

For a solutions architect focused on data engineering and automation, Scrapling reduces maintenance overhead from website changes and simplifies bypassing anti-bot protections, making it a strong candidate for production scraping pipelines.

Author

D4Vinci