Where does robots.txt go?

At the root of your domain - for example example.com/robots.txt. Crawlers will not find it in a subfolder.

Does robots.txt stop a page from being indexed?

No. It blocks crawling, not indexing. Use a noindex meta tag to prevent indexing.

What Is Robots.txt? Syntax, Examples & SEO Guide

Robots.txt does not control indexing

A crucial nuance: Disallow blocks crawling, not indexing. A blocked URL can still appear in Google (without a description) if other pages link to it - the "Indexed, though blocked by robots.txt" warning. To keep a page out of the index, allow crawling and use a noindex meta tag instead.

Pro tip

A blocked URL can still appear in Google without a description if other pages link to it. Use noindex - not robots.txt - to truly remove a page.

Key takeaways

Robots.txt tells crawlers which URLs they may request.

It lives at your domain root and is the first file bots check.

Disallow blocks crawling, not indexing.

To keep a page out of the index, use a noindex tag instead.

Put it into practice with Soro

Understanding robots.txt is one thing - applying it across every page is another. Soro automates SEO content end to end, researching keywords and publishing optimised articles so your site ranks on Google and gets cited by AI. See how Soro works.

What Is Robots.txt?

Robots.txt does not control indexing

Put it into practice with Soro

Frequently asked questions

Keep learning