-
Notifications
You must be signed in to change notification settings - Fork 1.8k
lib/tutf8e: refactoring for flb_encoding (WIP) #2287
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@nigels-com would like to see some comments of my changes... |
6a84fd0
to
d40af29
Compare
I'll take a closer look. Seems like a significant rework, some further discussion of the goals and rationale would be helpful, I think. |
My first question from a |
@nigels-com Probably overkill having different modes. Anyway. Just failing with bad chars is not enough (for me). |
@bluebike You're saying that the advantage of REPLACEMENT or QUESTION modes are that they'll flag message streams that are not actually encoded as expected, but at least the ASCII parts will succeed. Do you feel that KEEP and IGNORE are also important modes? |
I'm not convinced the TUTF8E_SAME mode is worth the additional complexity. Have you been able to measure any actual throughput, memory or CPU improvements? |
As an aside, I filed some issues over on the tutf8e repo, as reminders to myself. |
* separate _encoding_* functions removed. * length_functions return TUTF8E_SAME if no changes needed in string * multiple modes modes for handlig invalid chars: KEEP(default),IGNORE,FAIL,REPLACEMENT,QUESTION Signed-off-by: Jukka Pihl <[email protected]>
d40af29
to
127111f
Compare
Ok.. .did some changes...
|
(this goes nowhere) |
WIP: in process of enable encoding first few changes to tutf8e encoding library.
Summary
Enter
[N/A]
in the box, if an item is not applicable to your change.Testing
Before we can approve your change; please submit the following in a comment:
Documentation
Fluent Bit is licensed under Apache 2.0, by submitting this pull request I understand that this code will be released under the terms of that license.