Question 1

How does a robots.txt URL tester work?

Accepted Answer

A robots.txt URL tester parses your robots.txt file and evaluates each rule against a target URL. It finds the most specific matching rule (by path length) for the given user-agent and reports whether crawling is allowed or blocked. If no rule matches, the default is allowed.

Question 2

What is the robots.txt Allow vs Disallow rule priority?

Accepted Answer

Google uses the most specific (longest) matching path to determine priority. If an Allow and Disallow rule match with equal specificity, Allow takes precedence. For example, 'Allow: /page' overrides 'Disallow: /' for that specific URL.

Question 3

Does robots.txt block all crawlers the same way?

Accepted Answer

No. Robots.txt rules are per user-agent. You can allow Googlebot to crawl a URL while blocking other bots, or vice versa. User-agent: * rules apply to all bots not specifically addressed. A bot first looks for rules matching its specific user-agent, then falls back to wildcard rules.

Question 4

What does 'Disallow: /' mean in robots.txt?

Accepted Answer

'Disallow: /' blocks the entire website for the specified user-agent. It matches every URL that starts with '/' — which is all of them. To unblock specific pages, use 'Allow: /specific-page' before the Disallow rule, since more specific rules take precedence.

Question 5

Can robots.txt use wildcards?

Accepted Answer

Google and Bing support two wildcards: '*' matches any sequence of characters, and '$' anchors to the end of a URL. For example, 'Disallow: /*.pdf$' blocks all URLs ending in .pdf, and 'Disallow: /search*' blocks all URLs starting with /search.

Question 6

Does robots.txt prevent Google from indexing pages?

Accepted Answer

No — robots.txt blocks crawling, not indexing. Google can still index a page it has never crawled if other pages link to it. To prevent indexing, use a 'noindex' meta tag or X-Robots-Tag HTTP header on the page itself.

Robots.txt URL Tester

How Robots.txt Rules Work

Prefix Matching

Specificity Wins

* and $ Patterns

Crawl ≠ Index

Frequently Asked Questions

Is Your robots.txt Accidentally Blocking Google?