Replace stdlib json with orjson across all tritonparse modules (#362) by FindHao · Pull Request #362 · meta-pytorch/tritonparse

FindHao · 2026-03-20T23:41:32Z

Summary:

Replace all stdlib json usage with orjson across tritonparse for faster JSON serialization/deserialization.

Resolves: #266

Changes:

Add orjson>=3.9 as a dependency in pyproject.toml
Add fbsource//third-party/pypi/orjson:orjson to BUCK deps
Replace import json with import orjson in 20 Python files
Handle orjson.dumps() returning bytes (added .decode() where string output is needed)
Map API differences: indent=2 → OPT_INDENT_2, sort_keys=True → OPT_SORT_KEYS
Replace json.load(f) → orjson.loads(f.read()) and json.dump(obj, f) → f.write(orjson.dumps(obj))
Preserve "import json" in reproducer-generated user code (function_extractor.py)

Reviewed By: xuzhao9

Differential Revision: D97564498

meta-codesync · 2026-03-20T23:41:55Z

@FindHao has exported this pull request. If you are a Meta employee, you can view the originating Diff in D97564498.

Copilot

Pull request overview

This PR replaces Python stdlib json usage with orjson across tritonparse to improve JSON parse/serialization performance, updating call sites to account for orjson’s API differences and bytes return type from dumps().

Changes:

Introduce orjson>=3.9 as a project dependency and migrate imports/usages across modules.
Update JSON read/write patterns (e.g., load/dump equivalents, indentation/sort options, .decode() where string output is required).
Keep NDJSON output behavior broadly consistent while moving formatting and tracing code paths to orjson.

Reviewed changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
tritonparse/validation/schema_loader.py	Load packaged JSON schemas using `orjson`.
tritonparse/validation/json_validator.py	Validate NDJSON traces using `orjson` parsing and error types.
tritonparse/tools/prettify_ndjson.py	Parse NDJSON and write prettified JSON via `orjson` options.
tritonparse/tools/extract_irs.py	Replace line-level JSON parsing with `orjson` and update exception handling.
tritonparse/structured_logging.py	Switch structured trace formatting/parsing to `orjson`.
tritonparse/reproducer/utils.py	Read JSON args files using `orjson`.
tritonparse/reproducer/placeholder_replacer.py	Embed JSON blocks using `orjson` indentation output.
tritonparse/reproducer/multi_file_analyzer.py	Write analysis JSON output using `orjson` indentation.
tritonparse/parse/trace_processor.py	Use `orjson` for parsing/writing NDJSON throughout trace processing.
tritonparse/parse/torch_trace_parser.py	Parse embedded JSON metadata using `orjson`.
tritonparse/parse/sourcemap_utils.py	Use `orjson` for stable serialization in launch hash computation.
tritonparse/parse/event_diff.py	Use `orjson` for value-keying and NDJSON emission in diff generation.
tritonparse/parse/common.py	Write parse output mapping JSON using `orjson`.
tritonparse/info/cli.py	Parse CLI list-like inputs using `orjson`.
tritonparse/diff/output/event_writer.py	Emit diff NDJSON using `orjson`.
tritonparse/diff/core/event_matcher.py	Convert event↔string roundtrip using `orjson` around sourcemap generation.
tritonparse/bisect/state.py	Persist/load bisect state via `orjson`.
tritonparse/ai/parsers.py	Parse extracted JSON from text using `orjson`.
tritonparse/ai/client.py	Parse JSON responses/stream events using `orjson`.
pyproject.toml	Add `orjson>=3.9` to project dependencies.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-20T23:48:19Z

tritonparse/parse/sourcemap_utils.py

    # Sort keys for stable serialization
-    stable_json = json.dumps(stable_event, sort_keys=True, separators=(",", ":"))
+    stable_json = orjson.dumps(stable_event, option=orjson.OPT_SORT_KEYS).decode()
    return hashlib.sha256(stable_json.encode()).hexdigest()[:16]


compute_launch_event_hash() is documented as producing a stable hash, but switching from json.dumps(..., sort_keys=True, separators=(",", ":")) to orjson.dumps(..., OPT_SORT_KEYS) changes the canonical bytes for the same logical event (notably Unicode escaping and float rendering), so launch_group_hash values will differ from those produced by prior versions. If these hashes are persisted/consumed across runs or versions, consider preserving the previous canonicalization (e.g., matching stdlib JSON escaping behavior) or explicitly documenting this as a breaking change.

Copilot · 2026-03-20T23:48:19Z

tritonparse/validation/schema_loader.py

    """Load a JSON schema file from the schemas package."""
    ref = pkg_files(_SCHEMAS_PACKAGE).joinpath(filename)
-    return json.loads(ref.read_text(encoding="utf-8"))
+    return orjson.loads(ref.read_text(encoding="utf-8"))


For the stated performance goal, consider avoiding read_text() here. orjson.loads() can consume bytes directly, so using ref.read_bytes() (or opening the resource in binary) avoids creating an intermediate Python str and an extra UTF-8 encode step inside orjson.

Suggested change

return orjson.loads(ref.read_text(encoding="utf-8"))

return orjson.loads(ref.read_bytes())

Copilot · 2026-03-20T23:48:19Z

tritonparse/diff/output/event_writer.py

    with open(file_path, "a") as f:
-        f.write(json.dumps(diff_event) + "\n")
+        f.write(orjson.dumps(diff_event).decode() + "\n")



orjson.dumps() will raise on non-finite floats (NaN/Inf). Diff events include tensor_value_diff metrics (e.g., cosine_similarity/rmse) that can legitimately become NaN/Inf, so these writes can fail at runtime. Consider sanitizing the diff_event (recursively converting non-finite floats to strings/null) before dumping, or ensure the analyzer never produces NaN/Inf values in the serialized result.

Summary: Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Differential Revision: D97564498

Summary: Pull Request resolved: #362 Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Differential Revision: D97564498

Summary: Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Differential Revision: D97564498

Summary: Pull Request resolved: #362 Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Differential Revision: D97564498

Summary: Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Differential Revision: D97564498

FindHao · 2026-03-24T16:24:51Z

@copilot review again

Copilot · 2026-03-24T16:24:59Z

@FindHao I've opened a new pull request, #364, to work on those changes. Once the pull request is ready, I'll request review from you.

Summary: Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Reviewed By: xuzhao9 Differential Revision: D97564498

Summary: Pull Request resolved: #362 Replace all stdlib `json` usage with `orjson` across tritonparse for faster JSON serialization/deserialization. Resolves: #266 Changes: - Add `orjson>=3.9` as a dependency in `pyproject.toml` - Add `fbsource//third-party/pypi/orjson:orjson` to BUCK deps - Replace `import json` with `import orjson` in 20 Python files - Handle `orjson.dumps()` returning `bytes` (added `.decode()` where string output is needed) - Map API differences: `indent=2` → `OPT_INDENT_2`, `sort_keys=True` → `OPT_SORT_KEYS` - Replace `json.load(f)` → `orjson.loads(f.read())` and `json.dump(obj, f)` → `f.write(orjson.dumps(obj))` - Preserve `"import json"` in reproducer-generated user code (`function_extractor.py`) Reviewed By: xuzhao9 Differential Revision: D97564498

meta-codesync · 2026-03-24T22:48:54Z

This pull request has been merged in 1c5c71e.

Copilot AI review requested due to automatic review settings March 20, 2026 23:41

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 20, 2026

meta-codesync bot added fb-exported meta-exported labels Mar 20, 2026

Copilot started reviewing on behalf of FindHao March 20, 2026 23:41 View session

Copilot AI reviewed Mar 20, 2026

View reviewed changes

meta-codesync bot changed the title ~~Replace stdlib json with orjson across all tritonparse modules~~ Replace stdlib json with orjson across all tritonparse modules (#362) Mar 21, 2026

meta-codesync bot force-pushed the export-D97564498 branch from 399c3ad to 271ace5 Compare March 21, 2026 00:02

meta-codesync bot force-pushed the export-D97564498 branch from 271ace5 to 395a11f Compare March 21, 2026 00:03

FindHao force-pushed the export-D97564498 branch from 395a11f to 50da3f2 Compare March 21, 2026 00:06

meta-codesync bot force-pushed the export-D97564498 branch from 50da3f2 to 0a3d171 Compare March 22, 2026 23:51

FindHao force-pushed the export-D97564498 branch from 0a3d171 to f9d1131 Compare March 22, 2026 23:54

meta-codesync bot force-pushed the export-D97564498 branch from f9d1131 to 393d67c Compare March 24, 2026 12:02

Copilot AI mentioned this pull request Mar 24, 2026

[WIP] Replace stdlib json with orjson across all tritonparse modules #364

Closed

meta-codesync bot force-pushed the export-D97564498 branch from 393d67c to 863704d Compare March 24, 2026 16:30

meta-codesync bot force-pushed the export-D97564498 branch from 863704d to e8c4ea9 Compare March 24, 2026 16:35

FindHao force-pushed the export-D97564498 branch from e8c4ea9 to fb41266 Compare March 24, 2026 16:38

meta-codesync bot closed this in 1c5c71e Mar 24, 2026

facebook-github-tools bot added the Merged label Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace stdlib json with orjson across all tritonparse modules (#362)#362

Replace stdlib json with orjson across all tritonparse modules (#362)#362
FindHao wants to merge 1 commit intomainfrom
export-D97564498

FindHao commented Mar 20, 2026 •

edited by meta-codesync bot

Loading

Uh oh!

meta-codesync bot commented Mar 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

FindHao commented Mar 24, 2026

Uh oh!

Copilot AI commented Mar 24, 2026

Uh oh!

meta-codesync bot commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	return orjson.loads(ref.read_text(encoding="utf-8"))
	return orjson.loads(ref.read_bytes())

Conversation

FindHao commented Mar 20, 2026 • edited by meta-codesync bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync bot commented Mar 20, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

FindHao commented Mar 24, 2026

Uh oh!

Copilot AI commented Mar 24, 2026

Uh oh!

meta-codesync bot commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

FindHao commented Mar 20, 2026 •

edited by meta-codesync bot

Loading