anti-scraping

2 articles
sort: new top best
clear filter
0 6/10

A detailed retrospective on why Mercado Libre's in-house Amazon scraper required constant maintenance (every 14 days) due to DOM selector instability, layout variants, anti-bot escalation, and JS-rendering dependencies—leading them to adopt a managed API solution with auto-healing extraction logic instead of maintaining 23 custom spiders.

Mercado Libre Amazon Scrapy Selenium Chromium ScraperAPI Oxylabs Bright Data AWS Data Exchange Isabela Rodriguez
infosecwriteups.com · Isabela Rodriguez · 12 hours ago · details
0 3/10

An open-source SDK that obfuscates HTML content using CSS reordering techniques (flexbox, RTL, unicode-bidi) to render correctly in browsers while returning garbage to scrapers; includes honeypots, email obfuscation, and robots.txt AI crawler blocking.

obscrd Bun tsup React 18 TypeScript
obscrd.dev · larsmosr · 2 days ago · details · hn