Skip to content

Conversation

Cladis
Copy link

@Cladis Cladis commented Feb 10, 2018

It looks that although the matching regex in hashtags/utils.py support all unicode the SQL, regexes here do not and only [A-Za-z][A-Za-z0-9] hashtags are this supported by the tool. This attempts to fix it using Unicode character properties as per https://mariadb.com/kb/en/library/pcre/ . Support for this is since MariaDB 10.0.5, so 10.0.33 new replica servers should support it. Unfortunately I cannot test this though, as s52467__new_hashtags table is not public.

I was asked to figure out why https://tools.wmflabs.org/hashtags/search/спецпатруль?lang=uk request does not work. You can use it for testing the patch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant