Skip to content

AI-friendly llms.txt & sitemap generator for websites. Crawl or import sitemaps, filter pages, and generate curated indexes for LLMs.

License

Notifications You must be signed in to change notification settings

Thordata/llms-sitemap-generator

Repository files navigation

LLMS Sitemap Generator

License: MIT Python 3.9+

AI-friendly llms.txt & sitemap generator / 面向 LLM 的站点地图生成器

Automatically collect, filter, and organize website URLs into curated indexes for LLMs and search engines.

自动收集、过滤和组织网站 URL,生成面向 LLM 和搜索引擎的策展式索引。

✨ Features / 功能特性

  • 🤖 LLM Training / LLM 训练: Generate llms.txt files for AI model training
  • 🔍 SEO Optimization / SEO 优化: Create comprehensive sitemaps
  • 📊 Content Discovery / 内容发现: Automatically discover and catalog all pages
  • 🌐 Multi-language / 多语言: Smart language filtering
  • 🏢 Enterprise Sites / 企业网站: Handle complex B2B SaaS sites with multiple subdomains

📦 Installation / 安装

# Basic / 基础安装
pip install llms-sitemap-generator

# With GUI / 带 GUI 支持
pip install llms-sitemap-generator[gui]

🎯 Quick Start / 快速开始

1. Analyze site / 分析站点

llms-sitemap-generator analyze https://example.com

2. Generate / 生成

# Generate llms.txt / 生成 llms.txt
llms-sitemap-generator generate

# Dry-run / 预览模式
llms-sitemap-generator generate --dry-run --max-pages 100

3. GUI / 图形界面

llms-sitemap-generator gui

📝 Configuration / 配置示例

Minimal llmstxt.config.yml:

site:
  base_url: "https://example.com"
  default_language: "en"

sources:
  - type: "sitemap"
    url: "https://example.com/sitemap.xml"

filters:
  include:
    - pattern: "^/products"
      group: "Products"
  exclude:
    - pattern: "blog"
  max_urls: 1000
  auto_group: true

output:
  llms_txt: "llms.txt"
  sitemap_xml: "sitemap.xml"

🛠️ Build Windows EXE / 构建 Windows 可执行文件

python build_exe.py

Output: dist/llms-sitemap-generator-gui.exe

📄 License

MIT License - see LICENSE


Made with ❤️ by Thordata

About

AI-friendly llms.txt & sitemap generator for websites. Crawl or import sitemaps, filter pages, and generate curated indexes for LLMs.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Languages