Glossary

What is robots.txt?

robots.txt

Definition, formula, India benchmarks, and the operator-grade nuance behind it.

Definition

robots.txt is a plain-text file at the root of a domain that tells web crawlers which paths they can access. It's the first request crawlers make. robots.txt does not prevent indexing (use noindex meta for that) — it controls crawl behavior.

  1. robots.txt controls crawler access.

  2. Doesn't prevent indexing — use noindex meta tag for that.

  3. Frameleads robots.txt explicitly allows 21 LLM bots (already shipped).

Formula

robots.txt is a text file at /robots.txt with User-agent and Allow/Disallow directives controlling crawler access.

robots.txt: User-agent: <bot> + Allow/Disallow: <path>
Example
Input: User-agent: GPTBot\nAllow: /\nUser-agent: *\nDisallow: /admin/
Result: GPTBot can crawl all; others blocked from /admin/

The operator's read on robots.txt

robots.txt is the gatekeeper for crawler access. Common pattern: Disallow /api/ and /_next/ to prevent bot waste; Allow / for everything else. Per-bot rules let you allow LLM crawlers (GPTBot, ClaudeBot, PerplexityBot) while controlling lower-value bots. Important: robots.txt is publicly visible — anyone can read it. Don't put sensitive paths there (use auth + noindex instead).

India 2026 benchmarks — robots.txt

Common mistakes to avoid

FAQ

Frequently asked questions

What's a typical robots.txt value in India?

India 2026 benchmarks vary by category: Frameleads robots.txt allows: 21 LLM/AI crawlers explicitly; Disallow patterns: /api/, /_next/ (build artifacts); Sitemap reference: required (helps crawlers find sitemap). Bands compress in saturated CPM regimes and widen as products move from impulse to considered. The right benchmark for your business depends on stage, gross margin, and channel mix.

What are the most common mistakes when tracking robots.txt?

Three mistakes recur most often: Putting sensitive paths in robots.txt (publicly visible).; Confusing robots.txt with noindex (different mechanisms).; Disallow / accidentally (kills entire site indexing).. The simplest defense is to define each metric explicitly in your reporting playbook and avoid mixing definitions across teams.

How does robots.txt relate to other unit-economics metrics?

robots.txt is most useful in context. Pair it with SITEMAP and NOINDEX to build a complete picture. robots.txt alone can mislead — the relationship between metrics matters more than any single number.

Should I optimize robots.txt or accept industry-standard values?

Optimization depends on your stage. Early-stage businesses often have robots.txt values outside healthy bands and need to fix structural issues (audience, creative, retention) before chasing the metric. Established businesses can compound through marginal improvements. Frameleads' Growth System maps which lever moves which metric in your specific category.

Industry adaptations

How robots.txt behaves per industry

robots.txt is a universal metric, but its band, drivers, and optimisation levers vary by category. Drill into the industry-specific version below for the deep view.

Adjacent questions

Questions about robots.txt

Deeper reading

Long-form guides on related topics

Related terms

Pair this with

Sources & references

Cited primary and analyst sources. Independent of Frameleads' own data.

  1. IBEF — India Brand Equity Foundation: Indian Industry ReportsIBEF (Ministry of Commerce & Industry)

    Sector-level market size, growth, and policy context for Indian industries.

  2. IAMAI — Internet & Mobile Association of IndiaIAMAI

    Digital advertising industry body; reports on India internet user base, ad spend, and platform shares.

  3. MoSPI — Ministry of Statistics and Programme ImplementationGovernment of India

    Primary source for India macro-economic indicators (CPI, GDP, household consumption).

  4. ASCI Code for Self-Regulation of Advertising in IndiaAdvertising Standards Council of India

    Mandatory baseline for all advertising claims in India — including digital, influencer, and comparative ads.

Last reviewed: by Frameleads Editorial TeamRefreshed quarterly from live client data
30-min audit

Want this audited for your business?

Get a free 30-minute marketing audit from a senior Frameleads consultant. No slides — just an honest read of where your funnel leaks, with the three highest-leverage moves we'd make first.