Is using beautiful soup, selenium headless, and/or scraping against the TOS?
If not I can’t fathom why I keep getting the browser messaage about not being able to use this Replt.
In a bit of a fit I started hooking up brew (osx), vscode & it’s various extensions and tools on my local machine & Replit is far easier to get up and running etc etc…
Hi @SteveMallett !
What website are you trying to scrape?
If you are trying to scrape secure sites like government sites etc, I think that they could be blocked by Replit.
I used Beautiful Soup once to scrape my own website (for testing purposes). I got no such error. To my knowledge (which could be wrong), web scrapers are not inherently against the Replit ToS.
If there is a visible error, can you please show it? If you’re just asking about the nature of scraping, it should be allowed. Just remember to not scrape websites that Replit banned or your repl will go down.
IIRC (probably wrong) scraping is in the 100 days of Python thing? IDK, just remember seeing it somewhere on Replit.
Hello I took a quick look at your Repl and it seems to run fine, but I do have a theory on why you would be running into errors
We have a tool to track outgoing network requests from the machine you are using and attempting to scrape Reddit (especially if you run the Repl multiple times) would cause a lot of requests to be sent and to us it would appear as patterns similar to a DoS attack, resulting in temporary cooldowns.
My suggestion would be to use the Reddit API, but that doesn’t seem likely given the recent controversy Alternatively try loading the webpage with Javascript disabled/through some web caching service like Wayback? Sorry you are running into this issue, we are actively tuning our anti-abuse tools