February 2024 Crawl
This is largely the same as the December 2023 crawl code.
Differences:
- well-known data is collected by the crawler
- column values in the debugging table are capped at 4,000 characters, as this is what is specified in our table
- one new human check regular expression