WeSearch

Open-source Playwright wrapper that passes bot.sannysoft.com, pixelscan, and CreepJS in headless mode

·1 min read · 0 reactions · 0 comments · 10 views
#automation#opensource#python#webscraping
Open-source Playwright wrapper that passes bot.sannysoft.com, pixelscan, and CreepJS in headless mode
⚡ TL;DR · AI summary

An open-source Playwright wrapper has been developed to enhance web scraping capabilities. This library addresses common issues such as being blocked by websites by implementing various techniques to mimic human behavior. It has been tested successfully against multiple anti-bot services in headless mode.

Key facts
Original article
DEV.to (Top)
Read full at DEV.to (Top) →
Opening excerpt (first ~120 words) tap to expand

try { if(localStorage) { let currentUser = localStorage.getItem('current_user'); if (currentUser) { currentUser = JSON.parse(currentUser); if (currentUser.id === 3950052) { document.getElementById('article-show-container').classList.add('current-user-is-article-author'); } } } } catch (e) { console.error(e); } Shanewas Ahmed Posted on May 25 Open-source Playwright wrapper that passes bot.sannysoft.com, pixelscan, and CreepJS in headless mode #automation #opensource #python #webscraping Been scraping for a while and got tired of getting blocked the moment a page loads. Standard Playwright leaks everywhere — TLS fingerprint, navigator.webdriver, WebGL renderer differences. Built a library that handles all of that before the page even loads: 1.

Excerpt limited to ~120 words for fair-use compliance. The full article is at DEV.to (Top).

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from DEV.to (Top)