Version 2.0.0b5 by cdgriffith · Pull Request #114 · cdgriffith/puremagic

cdgriffith · 2025-07-05T21:03:48Z

Add sndhdr tests #85
Add dynamic text type tests

Copilot

Pull request overview

This PR adds support for sndhdr audio file tests and enhances text type detection with dynamic checks for version 2.0.0b5. The changes include significant refactoring of internal functions from private (underscore-prefixed) to public names, new scanner implementations, and improved test coverage.

Key Changes:

Added new sndhdr_scanner for audio file detection
Enhanced text scanner with CSV detection and format-specific checks
Improved Python scanner with better code pattern recognition
Added JSON array support to json_scanner
Refactored internal functions to use public naming
Added new test cases for audio formats and dynamic text types

Reviewed changes

Copilot reviewed 13 out of 28 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
test/test_scanners.py	Added sndhdr scanner tests and updated text scanner assertions to use lowercase text type names
test/test_main.py	Removed deprecated imghdr compatibility tests
test/test_common_extensions.py	Updated function calls to match renamed public API
test/resources/system/test_list.json	Added test JSON file with array structure
test/resources/audio/*.mp3	Added MP3 test files for audio detection
test/resources/audio/test.sndr	Added SNDR format test file
test/resources/audio/test.opus	Added Opus format test file
pyproject.toml	Disabled black and flake8 linting in favor of Ruff
puremagic/scanners/text_scanner.py	Major refactoring with CSV detection and dynamic format checks
puremagic/scanners/sndhdr_scanner.py	New scanner for audio formats
puremagic/scanners/python_scanner.py	Enhanced Python code detection with keyword/pattern matching
puremagic/scanners/json_scanner.py	Added support for JSON arrays
puremagic/main.py	Version bump, function renames, new scanner integration, error type changes
CHANGELOG.md	Updated with issue #85 reference
.gitignore	Added VS Code exclusions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-18T12:02:59Z

puremagic/scanners/text_scanner.py


+    #
+    # # Check for quotes that might indicate CSV
+    # has_quoted_fields = '"' in text and (f'"{best_delimiter}' in text or f'{best_delimiter}\"')
+    #
+


There are commented-out code blocks left in the production code. These should either be removed if they're no longer needed, or completed and uncommented if they serve a purpose. Leaving commented code in production reduces code quality and maintainability.

Suggested change

#

# # Check for quotes that might indicate CSV

# has_quoted_fields = '"' in text and (f'"{best_delimiter}' in text or f'{best_delimiter}\"')

#

Copilot · 2025-12-18T12:02:59Z

puremagic/main.py

+            result = catch_all_deep_scan(filename, head, foot)
        except Exception:
-            pass
+            raise


The exception handling was changed to re-raise all exceptions instead of silently passing. This change may expose previously caught errors to users. While this can be beneficial for debugging, it's a change in behavior that could break existing code. The comment says 'pass' was replaced with 'raise', but this should be intentional and documented.

Suggested change

raise

# Intentionally suppress exceptions from catch-all deep scan to

# maintain backward-compatible behavior and continue fallback logic.

pass

Copilot · 2025-12-18T12:03:00Z

puremagic/main.py

    filename: os.PathLike | str,
    head=None,
    foot=None,
+    confidence=0,


The function signature changed to add a new parameter 'confidence=0' but this change in the function signature could affect compatibility if this function is used elsewhere. Ensure that all call sites have been updated to handle this new parameter.

Copilot · 2025-12-18T12:03:00Z

puremagic/main.py

            for i, result in enumerate(matches):
                if i == 0:
-                    print("\n\tBest Match")
+                    print("\n\tDeepscan Match" if int(result.confidence == 1) else "\n\tBest Match")


The condition 'int(result.confidence == 1)' will always evaluate to either 0 or 1 (since it's casting a boolean to int), and when used as a condition in an if statement, it would only be truthy when confidence is exactly 1. However, the logic seems inverted - you're checking if it should print "Deepscan Match" when confidence equals 1, but the condition will be 0 (falsy) when confidence is 1. This should likely be 'if result.confidence == 1.0:' instead.

Suggested change

print("\n\tDeepscan Match" if int(result.confidence == 1) else "\n\tBest Match")

print("\n\tDeepscan Match" if result.confidence == 1.0 else "\n\tBest Match")

Copilot · 2025-12-18T12:03:00Z

puremagic/scanners/mpeg_audio_scanner.py

+                    if 1700 <= year_int <= 3000:
+                        self.tags.append("ID3v1")
+                        return True
+            except ValueError:


'except' clause does nothing but pass and there is no explanatory comment.

Suggested change

except ValueError:

except ValueError:

# If the year string cannot be converted to an int, treat it as an invalid/missing year.

# We intentionally ignore this and fall through to returning None below.

Copilot · 2025-12-18T12:03:01Z

puremagic/scanners/sndhdr_scanner.py

+                confidence=0.1,  # Lower confidence due to simple format
+            )
+    except (IndexError, struct.error):
+        pass


'except' clause does nothing but pass and there is no explanatory comment.

Suggested change

pass

# If the header is too short or malformed for an SNDR rate, treat it as

# not matching SNDR and fall through to other format detection.

return test_hcom(head)

Copilot · 2025-12-18T12:03:01Z

puremagic/scanners/text_scanner.py

+    try:
+        dialect = csv.Sniffer().sniff(text, delimiters=''.join(potential_delimiters))
+        csv_sniffer_result = dialect.delimiter
+    except Exception:


'except' clause does nothing but pass and there is no explanatory comment.

Suggested change

except Exception:

except Exception:

# csv.Sniffer can fail on non-standard or truncated input; in that case

# we ignore the error and rely solely on the heuristic delimiter detection below.

penguinpee and others added 8 commits July 5, 2025 09:22

Fix two minor issues with fastflix (#112)

cfe7458

More cleanup and prep

be2a78e

Add version printout

908c912

Adding file types from sndr

8c62627

Merge remote-tracking branch 'origin/develop' into develop

43759bc

version bump

b2897bf

Remove imghdr

2a98164

Add dynamic checks

971c780

NebularNerd mentioned this pull request Oct 26, 2025

I'm back! Need help with scanner basics. #116

Open

MPEG Audio Scanner (aka MP3 Scanner) (#122)

f886e81

Copilot AI review requested due to automatic review settings December 18, 2025 11:57

Copilot started reviewing on behalf of cdgriffith December 18, 2025 11:57 View session

Copilot AI reviewed Dec 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version 2.0.0b5#114

Version 2.0.0b5#114
cdgriffith wants to merge 9 commits intomasterfrom
develop

cdgriffith commented Jul 5, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-            raise
+            # Intentionally suppress exceptions from catch-all deep scan to
+            # maintain backward-compatible behavior and continue fallback logic.
+            pass

	print("\n\tDeepscan Match" if int(result.confidence == 1) else "\n\tBest Match")
	print("\n\tDeepscan Match" if result.confidence == 1.0 else "\n\tBest Match")

-        pass
+        # If the header is too short or malformed for an SNDR rate, treat it as
+        # not matching SNDR and fall through to other format detection.
+        return test_hcom(head)

Conversation

cdgriffith commented Jul 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes:

Reviewed changes

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cdgriffith commented Jul 5, 2025 •

edited

Loading