What it checks
- HTTP status, redirects, final URL, and content type
- CDN and anti-bot hints visible in response headers and page samples
- Robots.txt and sitemap clues that shape crawl planning
- Whether direct HTTP looks enough or browser rendering is likely needed