Many (most?) "big content" sites let Google and Bing spiders scrape the contents...

Atlas22 · on May 24, 2023

Just FYI google and bing publish their user agent strings[1][2] for the crawlers. At least in my experience most of the typical ad-infested and paywalled news sites wont display the paywall if you change the user agent to a crawler they prefer.

[1] https://developers.google.com/search/docs/crawling-indexing/... [2] https://www.bing.com/webmasters/help/which-crawlers-does-bin...

wolverine876 · on May 24, 2023

Doesn't almost every site on the web know exactly what the Google bot looks like?

peter422 · on May 24, 2023

Google gives precise details about how to verify their bot is crawling your site and how to denote what content is paywalled and what isn’t.

Aachen · on May 24, 2023

Bingo. This is what I use to incentivize using a nonmonopolistic search engine to find the few sites I run.