Question 1

What is an HTTP header inspector used for?

Accepted Answer

An HTTP header inspector fetches the response headers from a web server and displays them in a readable format. Web developers and data engineers use it to understand caching behavior, detect CDN or WAF protection, troubleshoot CORS errors, and assess how accessible a site is for automated scraping or API requests.

Question 2

How can I tell if a site uses Cloudflare from its headers?

Accepted Answer

The clearest signal is the cf-ray header, which is added to every response Cloudflare proxies. You may also see server: cloudflare or cf-cache-status headers. Cloudflare's presence doesn't automatically mean scraping is blocked — it depends on the site's security rules — but it does mean bot-detection and potential rate limiting are in play.

Question 3

What does a 403 response header mean for web scraping?

Accepted Answer

A 403 Forbidden response means the server recognized your request but refused to fulfill it. For scrapers, this usually means your IP, user-agent, or request pattern has been identified and blocked. Common causes include missing authentication cookies, a blocked IP range, or a WAF rule triggered by your headers.

Question 4

What are rate limit headers and how should I respect them?

Accepted Answer

Rate limit headers (x-ratelimit-remaining, x-ratelimit-reset, retry-after) tell you how many requests you have left and when your quota resets. Responsible scrapers read these headers and back off accordingly. Ignoring rate limits leads to 429 errors, IP bans, and potential legal issues. Aim to stay well below the limit — around 50-60% of the stated maximum.

Question 5

What is the difference between HEAD and GET for header inspection?

Accepted Answer

A HEAD request asks the server for headers only, without sending the response body. This is faster and more efficient than GET. However, some servers do not support HEAD and return 405 Method Not Allowed. This tool automatically falls back to GET if HEAD is not supported, so you always get accurate header data.

Question 6

Why do response headers matter for CORS and API access?

Accepted Answer

The access-control-allow-origin header controls which origins browsers allow to make cross-origin requests. If it is set to "*", the API endpoint accepts requests from any domain — making it straightforward to query from scripts. If it is restricted to a specific origin, direct browser-based access from your domain will be blocked, though server-side scraping (bypassing the browser) is unaffected by CORS.

Question 7

Can I scrape a site protected by Cloudflare?

Accepted Answer

It depends on the site's Cloudflare configuration. Many sites use Cloudflare purely as a CDN for performance, with no aggressive bot rules, and are perfectly scrapable with standard HTTP requests. Others use Cloudflare's bot management product, which can require JavaScript challenge completion, fingerprint browsers, or block datacenter IP ranges. Tools like Lection run inside a real Chrome browser, which means Cloudflare sees a legitimate browser fingerprint rather than a raw HTTP client.

Header	What it means for scraping
cf-ray	Cloudflare is proxying this site. Scraping may work, but Cloudflare can add JS challenges, CAPTCHAs, or IP bans.
x-ratelimit-remaining	The number of requests you have left before being rate-limited. Back off as this approaches zero.
retry-after	Seconds to wait after a 429 response. Ignoring this will get you blocked faster.
access-control-allow-origin	If set to "*", the API is publicly accessible from any origin — great for scraping JSON APIs.
content-type	Tells you whether you're getting HTML, JSON, XML, or binary data — so you know how to parse it.
content-encoding	If set to gzip or br, the body is compressed. Most HTTP libraries handle decompression automatically.
cache-control: no-store	Content changes frequently and shouldn't be cached — useful for knowing when to re-scrape.
set-cookie	Session cookies may be required for subsequent requests. Some anti-bot systems fingerprint cookie handling.
x-robots-tag	The server-side equivalent of the HTML robots meta tag. Noindex or nofollow instructions for crawlers.

HTTP Header Inspector

What are HTTP headers?

Which headers matter most for web scrapers?

Frequently asked questions

What is an HTTP header inspector used for?

How can I tell if a site uses Cloudflare from its headers?

What does a 403 response header mean for web scraping?

What are rate limit headers and how should I respect them?

What is the difference between HEAD and GET for header inspection?

Why do response headers matter for CORS and API access?

Can I scrape a site protected by Cloudflare?

Common use cases

Related resources

Robots.txt Checker

Meta Tag Checker

Sitemap Viewer

Web Scraping Legality by Country