- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 33.2k
          gh-104306: Fix incorrect comment handling in the netrc module, minor refactor
          #104511
        
          New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
| Most changes to Python require a NEWS entry. Please add it using the blurb_it web app or the blurb command-line tool. | 
| Most changes to Python require a NEWS entry. Please add it using the blurb_it web app or the blurb command-line tool. | 
        
          
                Misc/NEWS.d/next/Library/2023-05-15-17-22-53.gh-issue-104306.YMiegg.rst
              
                Outdated
          
            Show resolved
            Hide resolved
        
      | Any chance we can revive this? I already spend few hours trying to identify why I was getting 404 errors from github.com, just to discover that it was a commented line in my  | 
| @@ -0,0 +1 @@ | |||
| Fix incorrect comment parsing in the :mod:`netrc` module. | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you expand on this and include context that would give an arbitrary changelog reader an idea of how the change might impact them?
        
          
                Lib/test/test_netrc.py
              
                Outdated
          
        
      | ('anonymous', '', 'pass')) | ||
|  | ||
|  | ||
| def test_main(): | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this needed?
| @sleiderr the CI is failing with this PR. Here I've extracted the relevant part of the log from one of the jobs: 1 test failed:
    test_netrc
435 tests OK.
0:12:34 load avg: 5.15 Re-running 1 failed tests in verbose mode in subprocesses
0:12:34 load avg: 5.15 Run 1 test in parallel using 1 worker process (timeout: 10 min, worker timeout: 15 min)
0:12:34 load avg: 5.15 [1/1/1] test_netrc failed (uncaught exception)
Re-running test_netrc in verbose mode
test test_netrc crashed -- Traceback (most recent call last):
  File "D:\a\cpython\cpython\Lib\test\libregrtest\single.py", line 184, in _runtest_env_changed_exc
    _load_run_test(result, runtests)
    ~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^
  File "D:\a\cpython\cpython\Lib\test\libregrtest\single.py", line 129, in _load_run_test
    test_mod = importlib.import_module(module_name)
  File "D:\a\cpython\cpython\Lib\importlib\__init__.py", line 88, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<frozen importlib._bootstrap>", line 1386, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1359, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1330, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 935, in _load_unlocked
  File "<frozen importlib._bootstrap_external>", line 756, in exec_module
  File "<frozen importlib._bootstrap>", line 488, in _call_with_frames_removed
  File "D:\a\cpython\cpython\Lib\test\test_netrc.py", line 6, in <module>
    from test.support import os_helper, run_unittest
ImportError: cannot import name 'run_unittest' from 'test.support' (D:\a\cpython\cpython\Lib\test\support\__init__.py)
1 test failed again:
    test_netrc
== Tests result: FAILURE then FAILURE ==(https://github.com/python/cpython/actions/runs/12611674426/job/35147581981?pr=104511#step:6:631) | 
        
          
                Lib/test/test_netrc.py
              
                Outdated
          
        
      | import sys | ||
| import textwrap | ||
| import unittest | ||
| from test.support import os_helper, run_unittest | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This causes a ImportError: cannot import name 'run_unittest' from 'test.support' (#104511 (comment)).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that dropping the unrelated changes may fix #104511 (comment).
Co-authored-by: 🇺🇦 Sviatoslav Sydorenko (Святослав Сидоренко) <[email protected]>
Co-authored-by: 🇺🇦 Sviatoslav Sydorenko (Святослав Сидоренко) <[email protected]>
Co-authored-by: 🇺🇦 Sviatoslav Sydorenko (Святослав Сидоренко) <[email protected]>
| 
 | 
| for line in self.macros[macro]: | ||
| rep += line | ||
| rep += "\n" | ||
| return rep | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Revert deleting the CLI entry point logic:
| return rep | |
| return rep | |
| if __name__ == '__main__': | |
| print(netrc()) | 
        
          
                Lib/netrc.py
              
                Outdated
          
        
      | raise NetrcParseError( | ||
| "missing %r name" % tt, file, lexer.lineno) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks like an irrelevant formatting change
| raise NetrcParseError( | |
| "missing %r name" % tt, file, lexer.lineno) | |
| raise NetrcParseError("missing %r name" % tt, file, lexer.lineno) | 
        
          
                Lib/netrc.py
              
                Outdated
          
        
      | if lexer.lineno == saved_lineno and len(tt) == 1: | ||
| # For top level tokens, we skip line if the # is followed | ||
| # by a space / newline. Otherwise, we only skip the token. | ||
| if tt == '#' and not lexer.dontskip: | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure if I understand why the entirety of t is being compared to a '#'. Is this semantically “the entire token consists of just #”?
This could be
| if tt == '#' and not lexer.dontskip: | |
| if len(tt) == 1 and not lexer.dontskip: | 
but then, why does it matter whether it's # thing vs #thing? Typically, comment parsers just disregard whatever's after the hash and don't interpret that in any way…
        
          
                Lib/netrc.py
              
                Outdated
          
        
      | if ch in self.whitespace and not enquoted: | ||
| if token == "": | ||
| continue | ||
| if ch == '\n': | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What about \r? \r\n?
        
          
                Lib/netrc.py
              
                Outdated
          
        
      | for ch in fiter: | ||
| if ch in self.whitespace: | ||
| enquoted = False | ||
| while ch := self._read_char(): | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Was this refactoring necessary to fix the bug? It's rather difficult to read the diff with so many lines reshuffled. If I were to guess, this might be the reason people are postponing doing reviews on this PR.
        
          
                Lib/netrc.py
              
                Outdated
          
        
      | import os | ||
| import stat | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's undo formatting to make sure only relevant changes are visible in the diff.
| import os | |
| import stat | |
| import os, stat | 
        
          
                Lib/netrc.py
              
                Outdated
          
        
      | return self.hosts['default'] | ||
| else: | ||
| return None | ||
| return self.hosts.get(host, self.hosts.get('default')) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have a feeling that this might not be relevant to the fix and would be better in a separate refactoring PR.
| Hi @webknjaz - thanks for reviewing this pull request. I agree that most of the changes were not relevant to the actual bug fix and were more confusing than anything else (even for myself, one year later). I've reverted to the upstream version of the module, and fixed the bug in (hopefully) a clearer manner. Removing extra new lines before storing the line count fixes that issue, I've added a few test cases based on the original bug report. | 
| @sleiderr would you improve the change note with now this affects the end-users in practice? No low-level details, just relatable high-level effects. | 
netrcemits syntax errors for comments after blank lines #104306