Replies: 1 comment 3 replies
-
I emailed Yahoo asking for a list of egress IP addresses. Once I get that, I can allowlist Yahoo slurp. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm posting this under Discussions because, while being unsure whether it merits a full issue ticket or not, I wanted to hear people's thoughts. Since Anubis is used by a growing number of websites (especially within the FOSS community), the crawlers it decides to allow can potentially have a sizable impact on what is accessible from search engines.
In addition to indexes provided by Bing, Yahoo Search also seems to use a separate crawler called Slurp, which is currently not on Anubis' list of allowed bots. The xeiaso.net webpage still appears on Yahoo Search, and I suspect this may be due to Yahoo's use of Bing indexes. However, I still wonder if it might be a good idea to explicitly add Yahoo's crawler to the list, or if there was a reason it hasn't been included.
On the subject, Yahoo writes that the crawler is used for Yahoo Mobile search, as well as the company's news, sports, and finance sites. Like Google, Yahoo Mobile does advertise its own set of AI features, and it wasn't clear to me whether this is accomplished through the Slurp crawler or not. (Dark Visitors does not currently label Slurp as being AI-related.) I imagine whether maintainers wish to allow Slurp may depend on (a) what purpose the bot is actually used for, (b) the ease of keeping associated IP ranges up to date, and (c) how nicely the bot plays with websites.
Beta Was this translation helpful? Give feedback.
All reactions