Technical SEO

robots.txt

A file that instructs search engine crawlers which URLs they can or cannot access.

Definition

The robots.txt file, placed at the root of a domain, provides crawl directives to search engine bots. While it does not enforce indexation rules, it helps conserve crawl budget by disallowing low-value or sensitive paths and can specify sitemaps. Misconfigurations can block critical content from being crawled, so updates should be carefully tested. It complements other controls such as meta robots, canonical tags, and server-side redirects.

Criteria	Recommended	Notes
Intent Alignment	Map $robots.txt to user journeys and topics	Validate with SERP and on-site behavior
Consistency	Encode in components and content standards	Use CMS validations and preview QA
Measurement	Track internal CTR, sync signals, conversions	Set guardrails to catch regressions

robots.txt

Definition

Recommended Approach Summary

Do's

Pitfalls

Examples

Benefits

Best Practices

Related Terms

Crawl Budget

Site Architecture

Canonicalization

XML Sitemap

Redirect Loop

Faceted Navigation

Content Pruning

FAQs

Ready to optimize your internal linking?

Boost your productivity today

Breadcrumb Navigation

Hreflang

Vector Database

Support

Company

Legal