-
Notifications
You must be signed in to change notification settings - Fork 11
Collapse whitespace #12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Collapse whitespace #12
Conversation
the principle issue is that wilbur - in its inherited form, predates any reasonable support for utf encoding.
the whitespace predicate appears to reflect this.
i am not a user (see dydra.com) and set it up here so that it would not disappear.
if you intend to use it, would you consider integrating the capabilities which are now present in lisp implementations?
also, your whitespace characterization seems suspect.
|
Long time ago I faced similar problem. Better solution would be to remove from Wilbur all xml functions and delegate it to an external lib such as https://common-lisp.net/project/cxml/ |
as i noted earlier, my interest in wilbur is curatorial. it would be a significant compromise of its artifactual status to introduce a large dependency to replace an aspect which, in its state of incompleteness, expresses a judgement about the role of rdf-xml. i am open to being convinced otherwise. |
The character encoding should be handled by the reader, e.g. the web client - I'm using Drakma, which works well for me. Wilbur only has to deal with Lisp characters.
How do you imagine that? Shall I take over maintenance?
I don't think so. It's according to the ASCII definition. |
if the goal is to make this artefact behave better with unicode, then wrt whitespace the class of non-graphic characters does not coincide with that of whitespace characters. https://en.wikipedia.org/wiki/Whitespace_character |
in some sense that is true, but the change removes the primitive support which it had for utf decoding in that one function, which leaves one to wonder where else that situation applies. |
you are certainly free to fork it. |
I had a problem when parsing an UTF-8 encoded RDF/XML document.
Signals an error:
Here is a fix.