Question 1

Do noindex pages waste crawl budget?

Accepted Answer

Google must crawl a page to see its noindex, so a noindexed page still gets crawled, though less often over time. The important rule: never block a noindex page in robots.txt at the same time. If crawling is blocked, Google cannot read the noindex, and the URL can stay indexed from external links alone.

Question 2

Should I use nofollow on internal links?

Accepted Answer

Almost never. Nofollow on internal links throws away link equity rather than redirecting it, and Google treats nofollow as a hint anyway. If you do not want a page indexed, noindex it. If you do not want it crawled, use robots.txt. Internal nofollow solves neither problem cleanly.

Question 3

What is the difference between meta robots and robots.txt?

Accepted Answer

Robots.txt controls crawling: it tells bots which URLs they may fetch. Meta robots controls indexing and presentation: what Google may do with a page it has already fetched. They are not interchangeable, and combining disallow with noindex on the same URL is the classic mistake, because the noindex never gets seen.

Question 4

What happens if a page has no robots tag at all?

Accepted Answer

Index, follow is assumed. The defaults are permissive, so you only need the tag when you want to restrict something. A tag reading index, follow is harmless but redundant, and you can omit it entirely.

Meta Robots Tag Generator

X-Robots-Tag HTTP header (for PDFs, images, and other non-HTML files)

Every robots directive, in plain English

Meta tag or HTTP header?

The one combination to avoid

Frequently asked questions