Skip to content

Clarify handling of invalid codepoint escape sequences #164

@kasei

Description

@kasei

I think the current spec text is ambiguous about how codepoint escape sequences should be handled if they are invalid. For example:

SELECT * WHERE { ?s ?p "\\u000Z" }

I think we might want to consider adding (either normative or best-practice) text about how this case should be handled. It seems like several systems (including my own, and Jena) ignore invalid sequences, causing the above query to have a literal that starts with an escaped backslash, followed by the four characters "000Z". Other systems might see the \u with invalid trailing characters and raise an error. Having clarity on the expected behavior here would be good.

Metadata

Metadata

Assignees

No one assigned

    Labels

    ErrataErrata management: confirmed erratum

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions