normalize code before hashing #307

misrasaurabh1 · 2025-06-09T07:04:08Z

PR Type

Enhancement, Tests

Description

Normalize code before hashing using ast.unparse
Standardize environment checks and quoting
Update tests to use single quotes
Remove extraneous comments and blank lines

Changes walkthrough 📝

Relevant files

Enhancement

code_context_extractor.py `Normalize code before hashing` codeflash/context/code_context_extractor.py Imported ast for code processing Added ast.unparse normalization under HASHING context	+5/-1

Tests

test_code_context_extractor.py `Standardize test quoting and formatting` tests/test_code_context_extractor.py Replaced double quotes with single quotes Removed unnecessary blank lines and comments Consolidated multi-line signatures into single lines	+21/-57

Need help?
Type /help how to ... in the comments thread for any questions about PR-Agent usage.
Check out the documentation for more information.

github-actions · 2025-06-09T07:05:21Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Missing import The new use of logging.warning requires an import of the logging module; without it calls to logging.warning will cause a NameError. if isinstance(node, cst.FunctionDef): Bare except Using a bare except around hash key generation catches all exceptions, including system exits or keyboard interrupts; consider catching specific exceptions instead. return None, False Unhandled parse errors Parsing and unparsing code with ast.parse/unparse may raise SyntaxError for invalid code; wrap in try/except to provide a fallback to the original code string. code = ast.unparse(ast.parse(code)) # Makes it standard return code

github-actions · 2025-06-09T07:06:05Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
Possible issue	Catch AST parse errors Wrap the parsing and unparsing in a try/except block to avoid crashing on invalid syntax. Fallback to the original code if a SyntaxError occurs. codeflash/context/code_context_extractor.py [516] -code = ast.unparse(ast.parse(code)) # Makes it standard +try: + code = ast.unparse(ast.parse(code)) +except SyntaxError: + pass Suggestion importance[1-10]: 6 __ Why: Catching a `SyntaxError` when unparsing ensures invalid snippets don’t crash the extractor and preserves the original code as a fallback.	Low
General	Trim normalized code whitespace Strip leading and trailing whitespace after normalization to ensure consistent code hashes. This removes incidental formatting differences. codeflash/context/code_context_extractor.py [516] -code = ast.unparse(ast.parse(code)) # Makes it standard +code = ast.unparse(ast.parse(code)).strip() Suggestion importance[1-10]: 4 __ Why: Adding `.strip()` removes incidental formatting differences, improving hashing consistency with minimal impact.	Low

KRRT7 · 2025-06-09T07:23:08Z

codeflash/context/code_context_extractor.py

+        code = str(filtered_node.code)
+        if code_context_type == CodeContextType.HASHING:
+            code = ast.unparse(ast.parse(code))  # Makes it standard
+        return code


codeflash/code_utils/code_replacer.py has normalize_code

yes but it did more things as well which i did not want to happen. I have already removed those sections earlier in the libcst processing code

normalize code before hashing

51c936f

github-actions bot added the Review effort 3/5 label Jun 9, 2025

edge case for python 39

7167d2b

misrasaurabh1 merged commit 8fe970c into main Jun 9, 2025
16 checks passed

KRRT7 reviewed Jun 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

normalize code before hashing #307

normalize code before hashing #307

Uh oh!

misrasaurabh1 commented Jun 9, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jun 9, 2025

Uh oh!

github-actions bot commented Jun 9, 2025

Uh oh!

Uh oh!

KRRT7 Jun 9, 2025

Uh oh!

misrasaurabh1 Jun 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

normalize code before hashing #307

normalize code before hashing #307

Uh oh!

Conversation

misrasaurabh1 commented Jun 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Type

Description

Changes walkthrough 📝

Uh oh!

github-actions bot commented Jun 9, 2025

PR Reviewer Guide 🔍

Uh oh!

github-actions bot commented Jun 9, 2025

PR Code Suggestions ✨

Uh oh!

Uh oh!

KRRT7 Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

misrasaurabh1 Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

misrasaurabh1 commented Jun 9, 2025 •

edited by github-actions bot

Loading