Keep original line endings when reading expectation files by xymaxim · Pull Request #25 · zaufi/pytest-matcher

xymaxim · 2024-03-05T16:37:26Z

Changes in this PR

Since Python by default opens files in the universal newlines mode (newline=None), line ending characters are translated. Which is not good for us. So, let's preserve original line endings while writing and reading expectation files.

zaufi · 2024-03-05T18:56:36Z

Could you please elaborate and gimme some more details and/or an example? I just can't understand what problem you're trying to solve %)

xymaxim · 2024-03-05T20:05:23Z

Could you please elaborate and gimme some more details and/or an example? I just can't understand what problem you're trying to solve %)

While I was working on #24, I noticed that regular tests (not from the previous issue) with \r\n would fail:

	def test_sample_out(capfd, expected_out):
    	print('Hello Africa!\r\n', end='')
    	stdout, stderr = capfd.readouterr()
>   	assert expected_out == stdout
E   	AssertionError: assert
E     	The test output doesn't equal to the expected
E     	(from `/tmp/pytest-of-ms/pytest-109/crlf_test0/crlf_test/test_sample_out.out`):
E     	---[BEGIN actual output]---
E     	Hello Africa!↵
E     	---[END actual output]---
E     	---[BEGIN expected output]---
E     	Hello Africa!↵
E     	---[END expected output]---

Here’s a quick demo of what’s happening:

# test: 00000000: 410d 0a42                            	A..B
#                   \r \n
with open("test", "w") as f:
    f.write("A\r\nB")

with open("test", "r") as f:
    print(repr(f.read()))  # “A\nB”

While this works as desired:

with open("test", "r", newline=””) as f:
    print(repr(f.read()))  # “A\r\nB”

xymaxim · 2024-03-05T20:07:02Z

src/matcher/plugin.py


        # Store!
-        self._pattern_filename.write_text(text)
+        with self._pattern_filename.open('w', newline='') as f:


Well, this one is unnecessary and needs to be removed: the conversion of newlines doesn't happen during writing (see https://peps.python.org/pep-0278/#specification).

zaufi · 2024-03-05T20:36:16Z

What makes me worried is that this will ruin (almost) everything! Indeed, for example, I have a trivial output matching test print('Hello Africa!') with the obviously trivial expectation. Whatever OS I use, thanks to Git and the auto CRLF option everything works fine on all platforms! -- cuz during checkout git will replace EOL(s) in the expectations file to the native format and Python's TextIO will handle \n properly...

If I needed smth system-specific, I add a f'-{platform.system()}' suffix to the expectation filename and told Git via .gitattributes to set whatever CRLF style I needed over system-dependent expectations.

Obviously, this patch will make this way broken...

I've taken a closed look at the test added by the #24 ... it looks incorrect to me %) makepatternfile() of the fixture behaves similarly to the pytester.makefile and does not write EOL at the last (and in this case the only) text line! So, strictly speaking, that test never checks for EOL styles...

xymaxim · 2024-03-06T09:28:11Z

What makes me worried is that this will ruin (almost) everything! Indeed, for example, I have a trivial output matching test print('Hello Africa!') with the obviously trivial expectation. Whatever OS I use, thanks to Git and the auto CRLF option everything works fine on all platforms! -- cuz during checkout git will replace EOL(s) in the expectations file to the native format and Python's TextIO will handle \n properly...

Hmm, indeed, that’s something I hadn’t considered. My initial thoughts were that expectation texts should be treated as immutable, without any changes, and files are just an intermediate state of them, but, yes, Git couldn't be excluded from the story.

xymaxim · 2024-03-06T09:30:01Z

If I needed smth system-specific, I add a f'-{platform.system()}' suffix to the expectation filename and told Git via .gitattributes to set whatever CRLF style I needed over system-dependent expectations.

I recently came across a binary file with an ASCII text header that needed to be parsed, and the header contains Windows-style line endings. The imaginary test case (just for an example) for some header extract function would be to output the header as is. This is not a platform-specific case.

As I understand, on Unix, it’s not possible to have a pattern with CRLF symbols right now (even if we add tests/data/expected/crlf_test.out -text to .gitattributes) because of how Python converts newlines during reading (CRLF -> LF).

So, the only way is to read it in a binary mode?

xymaxim · 2024-03-06T09:34:54Z

I've taken a closed look at the test added by the #24 ... it looks incorrect to me %) makepatternfile() of the fixture behaves similarly to the pytester.makefile and does not write EOL at the last (and in this case the only) text line! So, strictly speaking, that test never checks for EOL styles...

Let's discuss it in #24, I'll answer there.

zaufi · 2024-03-09T17:43:37Z

As I understand, on Unix, it’s not possible to have a pattern with CRLF symbols right now (even if we add tests/data/expected/crlf_test.out -text to .gitattributes) because of how Python converts newlines during reading (CRLF -> LF).

I didn't get it... why not? The other question is if you want to make this test run on all platforms (w/ different native EOLs) and a test's input data file is "static" (in terms of EOL style in its beginning (I'm thinking about some "self-extract" archive w/ a shell script at the beginning %) obviously u need to preserve EOLs in the pattern file added to VCS (git I guess. so .gitattributes could help) and during the test tell to Python to use specific EOL style on read from file...

What could go wrong here? %)

xymaxim · 2024-03-12T19:07:34Z

The problem is that the test from the PR doesn't pass without the proposed changes because of the missing CR symbol. This may look unexpected and confusing for users, since an expectation file actually contains the symbol, but not the fixture.

xymaxim requested a review from zaufi as a code owner March 5, 2024 16:37

xymaxim force-pushed the keep-original-eols branch from 0d31584 to 1d636b5 Compare March 5, 2024 18:27

xymaxim changed the title ~~Keep original line endings while writing and reading expectation~~ Keep original line endings when reading expectation files Mar 5, 2024

xymaxim commented Mar 5, 2024

View reviewed changes

xymaxim added 2 commits March 5, 2024 23:22

fix: keep original line endings when reading expectation files

8bd7650

misc: Update changelog with keeping original line endings fix

2991c3e

xymaxim force-pushed the keep-original-eols branch from 46ffe41 to 2991c3e Compare March 5, 2024 20:30

zaufi force-pushed the master branch 2 times, most recently from 0196775 to 68dc622 Compare March 11, 2024 03:36

zaufi force-pushed the master branch 6 times, most recently from ec9dfe3 to 8abc274 Compare July 17, 2025 22:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep original line endings when reading expectation files#25

Keep original line endings when reading expectation files#25
xymaxim wants to merge 2 commits intozaufi:masterfrom
xymaxim:keep-original-eols

xymaxim commented Mar 5, 2024

Uh oh!

zaufi commented Mar 5, 2024

Uh oh!

xymaxim commented Mar 5, 2024

Uh oh!

xymaxim Mar 5, 2024

Uh oh!

zaufi commented Mar 5, 2024 •

edited

Loading

Uh oh!

xymaxim commented Mar 6, 2024

Uh oh!

xymaxim commented Mar 6, 2024

Uh oh!

xymaxim commented Mar 6, 2024

Uh oh!

zaufi commented Mar 9, 2024

Uh oh!

xymaxim commented Mar 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xymaxim commented Mar 5, 2024

Changes in this PR

Uh oh!

zaufi commented Mar 5, 2024

Uh oh!

xymaxim commented Mar 5, 2024

Uh oh!

xymaxim Mar 5, 2024

Choose a reason for hiding this comment

Uh oh!

zaufi commented Mar 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xymaxim commented Mar 6, 2024

Uh oh!

xymaxim commented Mar 6, 2024

Uh oh!

xymaxim commented Mar 6, 2024

Uh oh!

zaufi commented Mar 9, 2024

Uh oh!

xymaxim commented Mar 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zaufi commented Mar 5, 2024 •

edited

Loading