Encoding config support #20

iamgd67 · 2020-01-22T09:49:03Z

add python script encoding config support, to handle some error like
UnicodeDecodeError: 'gbk' codec can't decode byte 0xac in position 797: illegal multibyte sequence

…`UnicodeDecodeError: 'gbk' codec can't decode byte 0xac in position 797: illegal multibyte sequence`

adamtheturtle · 2020-06-18T19:48:41Z

Thank you @iamgd67 . I know this is an old PR, but I would be happy to get a fix in for your issue.

Could you provide either/both of (ideally both) the following:

An issue which describes how to reproduce this problem.
A test case in this PR which fails without the fix.

iamgd67 · 2020-06-22T09:37:05Z

in my situation, it failes because when I process files with utf-8 encoding and contains chinese character on windows platform, looks like on windows open use gbk encoding default, and decode file failed.

…dingConfigSupport # Conflicts: # pip_check_reqs/find_extra_reqs.py # pip_check_reqs/find_missing_reqs.py

…dingConfigSupport

zhu · 2021-01-20T01:14:57Z

It seems that ast.parse() can handle the encoding, so we can use open(filename, mode='rb') to solve this problem.
The gbk encoding script file should add # coding: gbk line, see PEP263.

adamtheturtle · 2021-01-20T11:09:02Z

Thanks for the work @zhu . Please ping me when tests are passing or I can help!

fabswt · 2023-02-11T12:27:03Z

Getting the same kind of error:

(venv) root@web:/var/www/html/python-tests/pronunciation-demo# pip-extra-reqs .
Traceback (most recent call last):
  File "/var/www/html/python-tests/pronunciation-demo/venv/bin/pip-extra-reqs", line 8, in <module>
    sys.exit(main())
  File "/var/www/html/python-tests/pronunciation-demo/venv/lib/python3.9/site-packages/pip_check_reqs/find_extra_reqs.py", line 211, in main
    extras = find_extra_reqs(
  File "/var/www/html/python-tests/pronunciation-demo/venv/lib/python3.9/site-packages/pip_check_reqs/find_extra_reqs.py", line 35, in find_extra_reqs
    used_modules = common.find_imported_modules(
  File "/var/www/html/python-tests/pronunciation-demo/venv/lib/python3.9/site-packages/pip_check_reqs/common.py", line 151, in find_imported_modules
    content = file_obj.read()
  File "/usr/lib/python3.9/codecs.py", line 322, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa4 in position 64: invalid start byte

Maybe more annoying is that it won't tell me on which file scan.

iamgd67 added 2 commits January 22, 2020 16:54

add python script encoding config support, to handle some error like …

0150d3b

…`UnicodeDecodeError: 'gbk' codec can't decode byte 0xac in position 797: illegal multibyte sequence`

use codecs to support python2

9e268fd

test that will fail on windows

48febc1

iamgd67 added 12 commits June 22, 2020 17:40

Merge branch 'master' of https://github.com/r1chardj0n3s/pip-check-reqs

b1fc672

update and fix

564ffbf

rebase

f895eff

use codecs to support python2

ebf5524

Merge remote-tracking branch 'origin/encodingConfigSupport' into enco…

d3dadb8

…dingConfigSupport # Conflicts: # pip_check_reqs/find_extra_reqs.py # pip_check_reqs/find_missing_reqs.py

test fix

b5ef10a

test fix

77d580b

Merge remote-tracking branch 'origin/encodingConfigSupport' into enco…

e545db2

…dingConfigSupport

fix lint check

46b63a9

fix lint check

4b73439

path fix

3c19d6e

improve test coverage

b83e755

adamtheturtle mentioned this pull request Mar 20, 2021

Always read python source using UTF-8 #59

Merged

adamtheturtle force-pushed the master branch from f53f3f5 to a7b797e Compare September 10, 2023 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Encoding config support #20

Encoding config support #20

Uh oh!

iamgd67 commented Jan 22, 2020

Uh oh!

adamtheturtle commented Jun 18, 2020

Uh oh!

iamgd67 commented Jun 22, 2020

Uh oh!

zhu commented Jan 20, 2021

Uh oh!

adamtheturtle commented Jan 20, 2021

Uh oh!

fabswt commented Feb 11, 2023 •

edited

Loading

Uh oh!

Uh oh!

Encoding config support #20

Are you sure you want to change the base?

Encoding config support #20

Uh oh!

Conversation

iamgd67 commented Jan 22, 2020

Uh oh!

adamtheturtle commented Jun 18, 2020

Uh oh!

iamgd67 commented Jun 22, 2020

Uh oh!

zhu commented Jan 20, 2021

Uh oh!

adamtheturtle commented Jan 20, 2021

Uh oh!

fabswt commented Feb 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

fabswt commented Feb 11, 2023 •

edited

Loading