Add llm.txt standard file for LLM content discovery#516
Add llm.txt standard file for LLM content discovery#516
Conversation
|
We're building your pull request over on Zeet. |
Co-authored-by: naveed-ahmad <701567+naveed-ahmad@users.noreply.github.com>
Co-authored-by: naveed-ahmad <701567+naveed-ahmad@users.noreply.github.com>
umxr
left a comment
There was a problem hiding this comment.
Hey, nice work getting this started! A few things I noticed while going through it:
Rails version is outdated — the file says Rails 7.0.3 in the Technical Details section but the project has been upgraded to Rails 8.
Some API endpoints may not exist yet — looking at issue #237, a good chunk of the API is still being built (Verses, Translations, Tafsirs, Morphology endpoints). Might be worth only listing the ones that are actually live right now, otherwise LLMs will hallucinate working endpoints that 404.
The Resources/Downloads page is missing — https://qul.tarteel.ai/resources is probably one of the most useful pages for anyone (human or LLM) trying to actually get data out of QUL. It has downloadable translations, tafsirs, fonts, audio files, etc. in various formats (JSON, SQLite, etc). This should be front and center.
Sidekiq URL probably shouldn't be here — that's internal infra, not something an LLM or external consumer needs to know about.
The "Key Features Summary" section at the bottom is redundant — everything there is already covered in the sections above. Could just be removed to keep things tighter. The llm.txt spec encourages being concise.
A few things that could make it richer:
- Mention the specific downloadable formats (JSON, SQLite db dumps, plain text, etc.)
- Add info about Mutashabihat ul Quran (similar verses data)
- Mention the learning activities / educational tools
- Would be great to note how many reciters, translation languages, etc. are available — concrete numbers help LLMs give better answers
Overall good foundation though, just needs some polish to make sure it reflects the current state of things accurately.
Description
Implements
llm.txtfollowing the proposed website standard (analogous torobots.txt) to provide LLMs with a curated, machine-readable guide to QUL's resources.File:
/public/llm.txt(281 lines, ~11KB Markdown)Structured sections:
Serves automatically from
/publicathttps://qul.tarteel.ai/llm.txt.Related Issue
Issue requesting llm.txt implementation to improve AI-driven content discovery and search visibility.
Motivation and Context
LLMs querying QUL currently lack structured guidance on available resources. The llm.txt standard provides a centralized, high-signal map of:
Improves LLM comprehension of QUL's capabilities and enables more accurate responses about Quranic data access.
How Has This Been Tested?
config/routes.rb(API vs web route distinction clarified in morphology section)Screenshots (if appropriate):
N/A - text file addition
Original prompt
💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.