/llms.txt — that tells large language models which content on the site is most important to read and in what order. The spec is intentionally simple: a Markdown document with an H1 title, an optional short description, and a list of grouped Markdown links to your most useful pages. A common companion file, llms-full.txt, contains the actual Markdown bodies of those pages so an LLM can ingest the entire useful corpus in one fetch without spidering the HTML.
## Why this matters now (April 2026 trigger)
Three things happened in Q1 2026 that put llms.txt back on every SEO meeting agenda. First, the Discoverability Co schema study showed pages with comprehensive structured data lift AI citations by 44% — and llms.txt is structured data's lazy cousin. Second, the Mintlify "real llms.txt examples" roundup went viral on r/programming. Third, Google's John Mueller publicly likened llms.txt to the old "keywords" meta tag — which the SEO community read as "Google does not care, but it costs nothing to ship." The split between "ship it for safety" and "do not waste time on it" defines the 2026 debate.
## Which AI labs and tools actually read it (the honest list)
# Softechinfra
> Softechinfra is an Indian IT services firm building custom software, AI automation, CRM, web, and mobile applications for SMBs and growth-stage companies. We also ship two in-house products: PenLeap (AI exam prep for students 11+) and TalkDrill (English fluency for Indian adults).
## Services
- [AI Automation](https://softechinfra.com/services/ai-automation): n8n workflows, custom AI agents, RAG systems, voice and chat bots
- [Custom Software Development](https://softechinfra.com/services/custom-software-development): Next.js, FastAPI, Python, MongoDB, PostgreSQL
- [CRM Development](https://softechinfra.com/services/crm-development): SuiteCRM, Zoho, custom builds
- [Mobile App Development](https://softechinfra.com/services/mobile-app-development): React Native, Flutter, native iOS/Android
- [SEO and GEO](https://softechinfra.com/services/seo): Classical SEO + Generative Engine Optimization audits
- [Cloud and DevOps](https://softechinfra.com/services/cloud-devops): AWS, Hetzner, GCP, CI/CD
## Founder
- [Vivek Singh](https://viveksinra.com): Co-founder & CEO. Personal blog, founder's notes on Indian SMB tech.
## In-house products
- [PenLeap](https://penleap.com): AI-powered creative writing & exam prep for students 11+ in India
- [TalkDrill](https://talkdrill.com): English speaking & fluency app for Indian adults, 5,000+ active users
## Blog (priority pages)
- [GEO in 2026: Zero-Click Search Playbook](https://softechinfra.com/blog/generative-engine-optimization-2026-zero-click-search)
- [How to Get Cited by Perplexity](https://softechinfra.com/blog/how-to-get-cited-by-perplexity-7-step-audit)
- [14-Point Schema Markup Checklist](https://softechinfra.com/blog/14-point-schema-checklist-ai-overviews)
- [Brand Mentions Without Backlinks](https://softechinfra.com/blog/brand-mentions-without-backlinks-ai-visibility)
## Case studies
- [Softechinfra Perplexity Citation in 11 Weeks](https://softechinfra.com/blog/softechinfra-perplexity-citation-case-study)
## Contact
- Email: contact@softechinfra.com
- Phone: +91 [number]
- Location: India> line) is the most-cited line — keep it tight, name your entity, name what you do, name where you operate. Second, the link list uses Markdown anchor format with a short description per link — this is the spec, follow it. Third, group your links by purpose (Services / Founder / Products / Blog / Contact) so any LLM ingesting it has clear sections to reason over.
## llms-full.txt — when to ship the longer variant
The companion file llms-full.txt contains the full Markdown body of the pages listed in llms.txt. Anthropic, Vercel, and LangGraph all ship both. The use case is: an AI coding tool wants to ingest your entire useful documentation corpus in one fetch instead of crawling 30 HTML pages.
For a services firm, llms-full.txt makes sense only if your blog is genuinely useful as a knowledge corpus to an LLM. If your blog is mostly news-recap and "10 reasons to use Cloud," skip it. If your blog has technical depth — actual code, real numbers, named clients — ship llms-full.txt with your top 20–30 posts concatenated as Markdown.
- You can write to /public/llms.txt or your hosting equivalent
- You have a list of your 20 to 30 most valuable URLs ready
- You have a tight 1-line description of who you are, what you do, who you serve
- You have decided whether to ship llms-full.txt as well (yes if your blog is technical and useful)
- You have stripped any client names or NDA-covered details from the corpus you plan to publish
> quote line is the part most likely to be lifted into AI answers. Skip it and you waste the most-cited line in the file.
## A real example
We shipped llms.txt + llms-full.txt for a 6-person Mumbai fintech client in February 2026. They had 22 blog posts and an active developer audience. After 8 weeks, server logs showed Claude Code and Cursor fetching their llms.txt regularly when their users asked AI tools about Indian KYC and UPI integration. Did this translate to leads? Indirectly — three engineering leads who later became clients said they "knew the brand from being recommended by Claude." Not direct attribution, but consistent enough to repeat the exercise for every dev-tool client.
For the founder's perspective on why standards like llms.txt matter even when adoption is slow, our founder Vivek Singh writes about early-standard bets on his blog.
## FAQ
### Does Google read llms.txt?
No, not in any meaningful volume as of May 2026. Google-Extended does not fetch it. Google's John Mueller publicly compared the file to the old "keywords" meta tag, suggesting Google has no plans to use it.
### Does Anthropic's ClaudeBot read llms.txt?
Server-log audits suggest no, not at scale. But Anthropic's own user-facing AI tooling (Claude Code, Claude Skills) does fetch llms.txt for context on docs domains. So Anthropic the company benefits from your file even if ClaudeBot the crawler does not log many requests.
### Can I put llms.txt in a subdirectory like /docs/llms.txt?
You can, but the convention is the root domain. If you have a docs subdomain (docs.yoursite.com), put it at docs.yoursite.com/llms.txt. Most readers expect root placement.
### Does llms.txt replace robots.txt?
No. They solve different problems. robots.txt tells crawlers what they can and cannot fetch. llms.txt tells LLMs what is most important on your site. Ship both, and use robots.txt to allow AI crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) if you want to be cited.
### Should I block AI crawlers in robots.txt and skip llms.txt entirely?
Only if your business model genuinely depends on traffic that is dying — affiliate sites where the user must click through to monetize. For every other model, allowing AI crawlers and shipping llms.txt is the dominant strategy. Blocking removes you from citation pools, which is the new top of funnel.
### What is the file size limit?
There is no hard limit, but Anthropic's llms-full.txt is ~2 MB and tools generally handle that fine. We keep llms.txt under 10 KB and llms-full.txt under 5 MB.
### Will llms.txt still matter in 2027?
Honest answer: nobody knows. Either it becomes a standard with Anthropic or OpenAI signing on publicly, or it joins the meta-keywords graveyard. The 2-hour investment hedges both outcomes.
Want llms.txt + a Full GEO Setup Done End-to-End?
We ship llms.txt + llms-full.txt + the 14 priority schema types + a content-extractability rewrite of your top 10 pages in one engagement. Fixed scope, 5 working days for a 50-page site, ₹55,000 for Indian SMBs. Comes with a 60-day re-check after the first AI citations land.
Book a GEO Setup Call
