Specific meaning attached to the word 'characters' in Authority #277
-
Referring to the URI doc here: https://github.com/eclipse-uprotocol/up-spec/blob/main/basics/uri.adoc#data-model-definition In the section explaining Authority (2.1), the document states:
Does this assume that a character is an ASCII-sized character, in other words, an octet? I am asking because an UTF-8 encoded character stream may be longer in length, In that case, does the invariant hold? I may be missing something obvious; in that case, please guide me to the right place for filling the gap. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Hi @nsengupta -- interesting question! I'd like to hear both what @stevenhartley thinks and some thoughts from @sophokles73. Input from others is also welcome. |
Beta Was this translation helpful? Give feedback.
-
@nsengupta in the UUri specification we have tried to define the allowed characters that an authority may consist of. My understanding of the RFC3986 is that these characters are single-octet ASCII characters. You can use '%' encoding to include other characters, though. |
Beta Was this translation helpful? Give feedback.
@nsengupta in the UUri specification we have tried to define the allowed characters that an authority may consist of. My understanding of the RFC3986 is that these characters are single-octet ASCII characters. You can use '%' encoding to include other characters, though.
Based on that, the limit on the authority's length defines the maximum number of octets (bytes).