-
Notifications
You must be signed in to change notification settings - Fork 871
Open
Description
Hi! I am running git-filter-repo on a large mono-repo. In the past a lot of large files (images, presentations, etc.) were checked in. Many of these files have been deleted years ago, but they still bloat the git package size.
Process:
I ran the following two commands on a fresh clone
git-filter-repo --analyze
git-filter-repo --strip-blobs-bigger-than 10M --paths-from-file .git\filter-repo\analysis\path-deleted-sizes.txt --invert-paths
Output & logs:
I received the following output
NOTICE: Removing 'origin' remote; see 'Why is my origin removed?'
in the manual if you want to push back there.
(was https://gitlab.XXX.com/XXX/XXX.git)
Processed 640925 blob sizes
Parsed 189732 commitsfatal: invalid path 'ai-test/done/features.csv:Zone.Identifier'
fast-import: dumping crash report to .git/fast_import_crash_69140
Traceback (most recent call last):
File "<frozen runpy>", line 198, in _run_module_as_main
File "<frozen runpy>", line 88, in _run_code
File "C:\Users\XXX\.local\bin\git-filter-repo.exe\__main__.py", line 10, in <module>
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 4973, in main
filter.run()
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 4892, in run
self._parser.run(self._input, self._output)
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 1527, in run
self._parse_commit()
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 1378, in _parse_commit
self._commit_callback(commit, aux_info)
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 4125, in _tweak_commit
self._insert_into_stream(commit)
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 4865, in _insert_into_stream
self._parser.insert(obj)
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 1505, in insert
obj.dump(self._output)
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 820, in dump
change.dump(file_)
File "C:\Users\XXX\AppData\Roaming\uv\tools\git-filter-repo\Lib\site-packages\git_filter_repo.py", line 723, in dump
file_.write(b'M %s %s %s\n' % (self.mode, self.blob_id, quoted_filename))
BrokenPipeError: [Errno 32] Broken pipe
The fast_import_crash_69140 file shows the following output
fast-import crash report:
fast-import process: 69140
parent process : 1
at 2025-09-19 13:25:52 +0000
fatal: invalid path 'ai-test/done/features.csv:Zone.Identifier'
Most Recent Commands Before Crash
---------------------------------
[TRUNCATED FOR READABILITY]
M 100644 31efdb4edde5f5ade2597cf07e5848f18a38e861 ai-test/done/091.txt
M 100644 08382c0d556fd2e051895e99e8ba7bd2578bf893 ai-test/done/094.txt
M 100644 d261bc7accb4c2cd00f6881b3144a2a9f00557b7 ai-test/done/095.txt
M 100644 6410432760f388ffd0889827d4f0cca90d03a498 ai-test/done/104.txt
M 100644 338722d837b8bacb9aa40f6d3f3a361761557361 ai-test/done/109.txt
M 100644 42da71ca83de2a863408fab8a6a22e8362682a64 ai-test/done/features.csv
* M 100644 e69de29bb2d1d6434b8b29ae775ad8c2e48c5391 ai-test/done/features.csv:Zone.Identifier
[TRUNCATED FOR READABILITY]
Assumed error cause:
Seems like git fast-import is not expecting a file with a ':' in its name
Workaround:
Removing the entry referencing the features.csv:Zone.Identifier from the path-deleted-sizes.txt and re-run
Additional information:
- OS: Windows 11 Business
- OS version: 24H2, OS build 26100.6584
- git version: 2.50.0.windows.1
- git-filter-repo version: 2.47.0
corbin-poteet
Metadata
Metadata
Assignees
Labels
No labels