Skip to content

Commit 55a03dc

Browse files
committed
Tell the bots to go away
The site is having issues at the moment, if it's not there for humans then the bots shouldn't be taking up resources. We can decide if someone these should be allowed in future
1 parent 7d6b57e commit 55a03dc

File tree

1 file changed

+68
-2
lines changed

1 file changed

+68
-2
lines changed

root/robots.txt

Lines changed: 68 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,4 @@
1-
# http://www.robotstxt.org/wc/norobots.html
2-
1+
# Stop anything going into these locations
32
User-agent: *
43
Disallow: /login/
54
Disallow: */diff/
@@ -14,3 +13,70 @@ Disallow: /*?*size=*
1413

1514
Sitemap: https://metacpan.org/sitemap-authors.xml.gz
1615
Sitemap: https://metacpan.org/sitemap-releases.xml.gz
16+
17+
# Stop the bots, using list from:
18+
# https://github.com/ai-robots-txt/ai.robots.txt/blob/main/robots.txt
19+
User-agent: AI2Bot
20+
User-agent: Ai2Bot-Dolma
21+
User-agent: aiHitBot
22+
User-agent: Amazonbot
23+
User-agent: anthropic-ai
24+
User-agent: Applebot
25+
User-agent: Applebot-Extended
26+
User-agent: Brightbot 1.0
27+
User-agent: Bytespider
28+
User-agent: CCBot
29+
User-agent: ChatGPT-User
30+
User-agent: Claude-SearchBot
31+
User-agent: Claude-User
32+
User-agent: Claude-Web
33+
User-agent: ClaudeBot
34+
User-agent: cohere-ai
35+
User-agent: cohere-training-data-crawler
36+
User-agent: Cotoyogi
37+
User-agent: Crawlspace
38+
User-agent: Diffbot
39+
User-agent: DuckAssistBot
40+
User-agent: FacebookBot
41+
User-agent: Factset_spyderbot
42+
User-agent: FirecrawlAgent
43+
User-agent: FriendlyCrawler
44+
User-agent: Google-CloudVertexBot
45+
User-agent: Google-Extended
46+
User-agent: GoogleOther
47+
User-agent: GoogleOther-Image
48+
User-agent: GoogleOther-Video
49+
User-agent: GPTBot
50+
User-agent: iaskspider/2.0
51+
User-agent: ICC-Crawler
52+
User-agent: ImagesiftBot
53+
User-agent: img2dataset
54+
User-agent: imgproxy
55+
User-agent: ISSCyberRiskCrawler
56+
User-agent: Kangaroo Bot
57+
User-agent: meta-externalagent
58+
User-agent: Meta-ExternalAgent
59+
User-agent: meta-externalfetcher
60+
User-agent: Meta-ExternalFetcher
61+
User-agent: MistralAI-User/1.0
62+
User-agent: NovaAct
63+
User-agent: OAI-SearchBot
64+
User-agent: omgili
65+
User-agent: omgilibot
66+
User-agent: Operator
67+
User-agent: PanguBot
68+
User-agent: Perplexity-User
69+
User-agent: PerplexityBot
70+
User-agent: PetalBot
71+
User-agent: QualifiedBot
72+
User-agent: Scrapy
73+
User-agent: SemrushBot-OCOB
74+
User-agent: SemrushBot-SWA
75+
User-agent: Sidetrade indexer bot
76+
User-agent: TikTokSpider
77+
User-agent: Timpibot
78+
User-agent: VelenPublicWebCrawler
79+
User-agent: Webzio-Extended
80+
User-agent: wpbot
81+
User-agent: YouBot
82+
Disallow: /

0 commit comments

Comments
 (0)