Skip to content

Releases: tomverran/robots

Fix order of implode arguments

03 Oct 08:20

Choose a tag to compare

Fix notice being emitted with malformed files

31 Jan 11:01
953b5c1

Choose a tag to compare

Also drops support for very old PHP versions

Skip records with an empty user agent

10 Mar 14:18
d6413b1

Choose a tag to compare

With thanks to David Goodwin

Fix case sensitivity

05 Jan 08:35

Choose a tag to compare

Merge pull request #15 from nickmoline/master

URL paths in robots.txt are case sensitive

1.13

31 Oct 21:11

Choose a tag to compare

Support files with only \n line separators

1.12

05 Jun 11:59

Choose a tag to compare

This fixes exact user-agent matches not being prioritised over partial matches.

1.11

06 Mar 20:56

Choose a tag to compare

1.10

06 Mar 15:58

Choose a tag to compare

See 1.1-beta

Renamed from 1.1 due to packagist getting the order wrong

1.10-beta

06 Mar 12:33

Choose a tag to compare

1.10-beta Pre-release
Pre-release

from PR #9

This update brings the library into line with the following two specs for Robots files:

A summary of the changes

  • The most specific matching User-agent is found in the file to determine the rules to apply
  • I took most specific to mean longest match, which is how allow / disallow rules are ordered.
  • Wildcards expand across directory boundaries and can be anchored to the end of the string with $
  • Encoded HTML characters are supported, with the exception of encoded slashes which are left as-is

The file is no longer modelled as a tree but as a series of Records, with User-Agent and AccessRules objects. The actual file parser uses arrays as an intermediate representation to cut down on the number of objects created.

I will mark this release as production ready after a couple of days of testing in the wild.

Version 1.02

07 Dec 21:34

Choose a tag to compare

Relative to 1.02-beta

  • Adds isDisallowed (thanks @waknauss-kingdom)

Relative to 1.01

  • Fixes multiple user agents
  • Fixes non lowercase user agents