fix(julia): port parameterized-type / qualified-def / qualified-import fixes to WASM by carlos-alm · Pull Request #1128 · optave/ops-codegraph-tool

carlos-alm · 2026-05-15T04:04:11Z

Summary

Port the three Julia extractor fixes from feat(native): port Julia extractor to Rust #1098 (native) to the WASM extractor at src/extractors/julia.ts per the dual-engine policy.
Adds a findBaseName helper that recurses through binary_expression / parametrized_type_expression / parameterized-identifier / type-parameter / type-argument wrappers, mirroring the native find_base_name.
Guards module prefixing with !base.includes('.') in handleFunctionDef and handleAssignment so qualified names like Base.show inside module Foo no longer become Foo.Base.show.
Updates handleImport to walk into selected_import children, treating scoped_identifier modules as the source and the trailing segment as the imported display name.
Adds a signatureCall helper to unwrap function_definition → signature → call_expression. Without this, findChild(node, 'call_expression') was matching the function body's first call (e.g. println(...)) and recording it as the function name — a latent bug not surfaced by the existing fixtures, but required for the qualified-def test to pass.
Also includes the three follow-up WASM fixes from fix(julia): port abstract-def / macro-def / signature-call WASM bugs #1130 (stacked PR merged into this branch): handleAbstractDef falls back to findBaseName(typeHead), handleMacroDef unwraps via signatureCall, and handleCall skips signature-parented call_expression nodes to avoid recording every long-form function name as a self-call.

Test plan

npx vitest run tests/parsers/julia.test.ts — 10/10 pass, including the four new parity tests (extracts parameterized struct base name, qualified short-form method does not double-prefix, qualified function def does not double-prefix, selected_import handles qualified module).
cargo test --lib julia — native side still green (16/16).
Biome clean on touched files.

Closes #1111
Closes #1126

…t fixes to WASM The native Julia extractor was fixed in #1098 for three issues that were already latent in the WASM extractor but not surfaced by the existing fixtures. Per the dual-engine policy, port the fixes so both engines produce identical results. 1. Parameterized struct names (`struct Vec{T} <: AbstractArray{T,1}`) no longer silently emit the raw type-head text as the definition name — `findBaseName` recurses through `binary_expression`, `parametrized_type_expression`, and related wrappers to locate the base identifier. 2. Qualified function defs / short-form methods inside a module no longer get double-prefixed: `function Base.show ... end` inside `module Foo` now records `Base.show` (not `Foo.Base.show`); same for short-form `Foo.bar(x, y) = x + y` inside `module Outer`. 3. `selected_import` with a qualified module (`import LinearAlgebra.BLAS: gemm`) now correctly records `LinearAlgebra.BLAS` as the import source and `gemm` as the imported name. Also fixes a related latent bug: `findChild(node, 'call_expression')` on a `function_definition` was matching the body's first call (e.g. `println(...)`) instead of the signature, because the signature is wrapped in a `signature` node. Added a `signatureCall` helper mirroring the native code. Closes #1111

claude · 2026-05-15T04:04:25Z

Claude finished @carlos-alm's task —— View job

PR Review: Julia WASM Extractor Bug Fixes

Todo List:

Phase 1: Root cause analysis - understand the problem being solved
Phase 2: Check for configuration changes that might mask issues
Phase 3: Verify test integrity and quality
Phase 4: Code quality review of Julia extractor implementation
Phase 5: Backlog compliance assessment
Provide final review recommendation

Root Cause Analysis

Problem Being Solved: This PR addresses real dual-engine parity bugs in the Julia WASM extractor. Three specific issues from #1098 (native engine fixes) needed to be ported to WASM:

Parameterized type base name extraction - struct Vec{T} was not extracting the base name "Vec" correctly
Qualified definition double-prefixing - function Base.show() inside module Foo was becoming Foo.Base.show instead of Base.show
Import resolution for qualified modules - import Foo.Bar: baz was not properly handling scoped identifiers as module sources

Why This Approach: The fix correctly implements dual-engine parity by mirroring the native Rust implementation. The approach adds targeted helper functions (findBaseName, signatureCall) and guards (!base.includes('.')) rather than broad refactoring, which is appropriate for a bug fix.

Risk Assessment: Low risk. This is correcting extraction accuracy to match the native engine - no behavioral changes to the query layer or API contracts.

Backlog Compliance

Zero-dep: ✓ — No new runtime dependencies, builds on existing tree-sitter infrastructure
Foundation-aligned: ✓ — Maintains dual-engine parity (P1: always-current) with no API changes (P5: embeddable-first)
Problem-fit: 5 — Directly prevents hallucinated dependencies by fixing extraction bugs that cause missing/incorrect symbols in the graph
Breaking: No — Pure internal bug fixes with no API or output changes
Tier: 1

Critical Concerns

None identified. This is a well-executed bug fix that addresses real dual-engine parity issues.

Strengths:

Targeted fixes: Each helper function addresses a specific extraction bug without over-engineering
Comprehensive testing: New tests directly validate the bug fixes with both positive and negative assertions
Clear documentation: Comments explain Julia grammar specifics and why each fix is needed
Maintains existing patterns: Follows established code style and structure

Code quality highlights:

findBaseName() properly handles Julia's complex parameterized type structures (lines 189-202)
qualifyName() prevents double-prefixing with simple !base.includes('.') guard (lines 86-93)
signatureCall() fixes latent bug where body calls were matched instead of signature calls (lines 105-109)
Enhanced import handling correctly distinguishes module sources from imported names (lines 316-335)

Final Recommendation

Rating: ⭐⭐⭐⭐☆ (4/5)
Action: APPROVE
Reasoning: High-quality bug fix that addresses real dual-engine parity issues. Well-tested, follows established patterns, and maintains code quality. The approach is appropriately targeted rather than over-engineered. Only minor consideration is that this is routine maintenance rather than exceptional work.

Job Run

github-actions · 2026-05-15T04:10:44Z

Codegraph Impact Analysis

11 functions changed → 11 callers affected across 2 files

find_base_name in crates/codegraph-core/src/extractors/julia.rs:319 (4 transitive callers)
qualifyName in src/extractors/julia.ts:86 (4 transitive callers)
signatureCall in src/extractors/julia.ts:113 (4 transitive callers)
handleFunctionDef in src/extractors/julia.ts:119 (2 transitive callers)
handleAssignment in src/extractors/julia.ts:153 (2 transitive callers)
findBaseName in src/extractors/julia.ts:202 (4 transitive callers)
handleStructDef in src/extractors/julia.ts:217 (2 transitive callers)
handleAbstractDef in src/extractors/julia.ts:284 (2 transitive callers)
handleMacroDef in src/extractors/julia.ts:303 (2 transitive callers)
handleImport in src/extractors/julia.ts:327 (2 transitive callers)
handleCall in src/extractors/julia.ts:374 (2 transitive callers)

greptile-apps · 2026-05-15T04:11:10Z

Greptile Summary

This PR ports three Julia extractor correctness fixes from the native Rust engine (julia.rs) to the WASM TypeScript engine (julia.ts), maintaining the project's dual-engine parity policy. The changes also fix a latent signature-vs-body-call confusion in handleCall that the grammar's signature wrapper node had exposed.

Parameterized types: adds findBaseName / TYPE_HEAD_WRAPPERS to recurse through binary_expression, parametrized_type_expression, and parameterized_identifier nodes, replacing the fragile findChild(typeHead, 'identifier') that silently dropped Vec{T} <: AbstractArray{T,1} definitions.
Double-prefix guard: qualifyName / !base.includes('.') prevents function Base.show inside module Foo from being recorded as Foo.Base.show; signatureCall unwraps the signature wrapper to avoid matching a body call-expression as the function name.
Selected-import fix: handleImport now walks selected_import children, distinguishing the module source from the imported names, fixing import Foo.Bar: baz recording source='Foo.Bar' and names=['baz'] instead of the previous mangled form.

Confidence Score: 5/5

Safe to merge — all changes are correctness fixes with direct test coverage and a clean port from the already-verified native engine.

Every changed code path is covered by a new or updated test (10 tests, all passing). The logic mirrors the native Rust engine exactly, and the previous discussion threads have been addressed. No new behavioral regressions were introduced.

No files require special attention.

Important Files Changed

Filename	Overview
src/extractors/julia.ts	Core WASM extractor — adds qualifyName, signatureCall, findBaseName helpers and rewrites handleStructDef/handleAbstractDef/handleMacroDef/handleImport/handleCall; logic is correct and directly mirrors the native engine.
tests/parsers/julia.test.ts	Adds 10 targeted tests covering all new code paths: parameterized/non-parameterized struct inheritance, qualified name double-prefix, macro name extraction, signature-vs-call disambiguation, and selected_import with qualified module source.
crates/codegraph-core/src/extractors/julia.rs	Native Rust engine — removes type_parameter_list/type_argument_list from find_base_name wrappers and adds clarifying doc comments; no behavioral change.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[struct_definition node] --> B{findChild type_head?}
    B -- no --> SKIP[return early]
    B -- yes --> C{findChild binary_expression?}
    C -- yes --> D[collect non-operator sides]
    D --> E[findBaseName sides-0 -> nameNode]
    D --> F[findBaseName sides-1 -> supertypeNode]
    C -- no --> G[findBaseName typeHead -> nameNode]
    G --> H[supertypeNode = null]
    E --> I{nameNode null?}
    F --> I
    H --> I
    I -- yes --> SKIP
    I -- no --> J[iterate struct_definition children]
    J --> K{child type?}
    K -- typed_expression --> L[field property]
    K -- identifier --> M[untyped field property]
    K -- other --> N[skip]
    L --> O{supertypeNode?}
    M --> O
    N --> O
    O -- yes --> P[ctx.classes.push extends]
    O -- no --> Q[skip extends]
    P --> R[ctx.definitions.push struct]
    Q --> R

_{Reviews (6): Last reviewed commit: "fix(julia): port abstract-def / macro-de..." | Re-trigger Greptile}

greptile-apps · 2026-05-15T04:11:14Z

+  const binary = findChild(typeHead, 'binary_expression');
+  if (binary) {
+    // Walk into each side of the binary expression to find the base-name
+    // identifier — handles parameterized forms like `Vec{T} <: AbstractArray{T,1}`.
+    const sides: TreeSitterNode[] = [];
+    for (let i = 0; i < binary.childCount; i++) {
+      const c = binary.child(i);
+      if (c && c.type !== 'operator') sides.push(c);
+    }
+    nameNode = sides[0] ? findBaseName(sides[0]) : null;
+    supertypeNode = sides[1] ? findBaseName(sides[1]) : null;
+  } else {
+    nameNode = findBaseName(typeHead);
+  }


Missing test for non-parameterized struct inheritance

The old code explicitly looked for a subtype_expression node (findChild(typeHead, 'subtype_expression')) to detect Point <: AbstractPoint. That path has been entirely removed and replaced with a binary_expression lookup. If the tree-sitter-julia grammar represents simple non-parameterized inheritance with a subtype_expression node (rather than binary_expression), findChild(typeHead, 'binary_expression') returns null, findBaseName(typeHead) recurses without entering subtype_expression (not in TYPE_HEAD_WRAPPERS), and returns null — so the entire struct is silently dropped from ctx.definitions. The new parameterized test (Vec{T} <: AbstractArray{T,1}) confirms the grammar uses binary_expression for that form, but there is no test for the simple case struct Point <: AbstractPoint to verify the same grammar node is used and the extends relationship is still recorded.

Fixed in 8c2e148 — added a test for non-parameterized struct inheritance (struct Point <: AbstractPoint) in tests/parsers/julia.test.ts. Confirmed via AST inspection that the Julia grammar wraps both the simple and parameterized cases in a binary_expression node, so the new code path handles both correctly. The native engine already had this test (crates/codegraph-core/src/extractors/julia.rs:592), so this brings WASM to parity.

greptile-apps · 2026-05-15T04:11:15Z

+const TYPE_HEAD_WRAPPERS: ReadonlySet<string> = new Set([
+  'binary_expression',
+  'parametrized_type_expression',
+  'parameterized_identifier',
+  'type_parameter_list',
+  'type_argument_list',
+]);
+
+function findBaseName(node: TreeSitterNode): TreeSitterNode | null {
+  if (node.type === 'identifier') return node;
+  const direct = findChild(node, 'identifier');
+  if (direct) return direct;
+  for (let i = 0; i < node.childCount; i++) {
+    const child = node.child(i);
+    if (!child) continue;
+    if (TYPE_HEAD_WRAPPERS.has(child.type)) {
+      const found = findBaseName(child);
+      if (found) return found;
+    }
+  }
+  return null;
+}


type_parameter_list / type_argument_list in TYPE_HEAD_WRAPPERS can yield the wrong identifier

findBaseName checks findChild(node, 'identifier') before recursing, so in practice the struct name is found before the loop reaches a type_parameter_list or type_argument_list. However, if findBaseName is ever called with a node that lacks a direct identifier child and does have one of those wrapper types as a child — for example, a future call site or an unusual parameterized form — the function will recurse into type_parameter_list and return the first type-parameter identifier (e.g. T) instead of the struct name. Removing those two entries from TYPE_HEAD_WRAPPERS would eliminate the risk without affecting correctness.

Fixed in 8c2e148 — removed type_parameter_list and type_argument_list from TYPE_HEAD_WRAPPERS in both the WASM and native engines (preserving dual-engine parity per CLAUDE.md). AST inspection confirmed Julia's grammar uses curly_expression for {T} constructs, not those node kinds, so the entries were dead code. Removing them eliminates the risk of recursing into a type-parameter list and returning a type variable as the struct name, as you noted.

…ype test (#1128) - Remove type_parameter_list / type_argument_list from TYPE_HEAD_WRAPPERS in both WASM and native engines. Julia grammar uses curly_expression for {T} constructs, so these were dead code. Removing them prevents findBaseName from ever recursing into a type-parameter list and returning a type variable (e.g. T) instead of the struct name. - Add WASM test for non-parameterized struct inheritance (struct Point <: AbstractPoint). The native engine already covers this case; the WASM side now has parity.

carlos-alm · 2026-05-15T05:18:46Z

@greptileai

…lback (#1128) - Strengthen the `import Base: show` test to assert the corrected source/names shape (was only checking that imports were emitted at all, so a regression to the broken pre-fix shape would have slipped through). - Document the grammar assumption behind `signatureCall` / `signature_call` in both engines: the call_expression fallback exists only for defensive grammar-drift protection, not as a routine path — if it ever fires on a real definition, the function name will silently match the first body call_expression instead.

carlos-alm · 2026-05-15T05:26:41Z

Addressed Greptile's round-2 feedback in 92ce81b:

Issue 1 (signatureCall fallback may match body call): Documented the grammar assumption in both engines (WASM + native). The fallback to findChild(node, 'call_expression') exists only as defensive protection against grammar drift — if it ever fires on a real definition, callers must treat it as a parser/grammar mismatch worth investigating.
Issue 2 (existing selected_import test doesn't assert corrected source/names): Strengthened the import Base: show test to assert source: 'Base' and names: ['show'], with negative assertions that the names array does not contain 'Base'. The pre-fix broken shape would now fail this test.

Both Rust and TypeScript still in parity; all julia tests pass (11 WASM, 16 native).

carlos-alm · 2026-05-15T05:27:00Z

@greptileai

…1130) * fix(julia): port abstract-def / macro-def / signature-call WASM bugs The WASM Julia extractor diverged from the native Rust extractor in three ways that no existing WASM fixture exercised: - handleAbstractDef: `findChild(node, 'identifier')` only looks at direct children of `abstract_definition`, but tree-sitter-julia nests the identifier inside `type_head`. Result: no abstract type was ever recorded. Fall back to `findBaseName(typeHead)` like the native code. - handleMacroDef: `findChild(node, 'identifier')` resolves to the body's first identifier rather than the macro name (e.g. `macro mymac(x) x end` recorded `@x` instead of `@mymac`). Unwrap via `signatureCall` to reach the call_expression name. - handleCall: the guard `parent.type === 'function_definition'` never matched — the signature's call_expression is parented by `signature`, whose own parent is the function/macro definition. Result: every long-form `function greet(...) ... end` recorded `greet` as both a definition and a call. Match the native walk: skip when parent is `signature` and grandparent is `function_definition` or `macro_definition`. Adds WASM tests mirroring the native cases: extracts_abstract_type, extracts_parameterized_abstract_type_base_name, extracts_macro_def, and does_not_record_function_signature_as_call. Closes #1126 * fix(julia): align abstract-def name resolution with struct-def (#1130)

carlos-alm · 2026-05-15T15:27:32Z

CI status: 26/27 checks green. The single failure (Build x86_64-apple-darwin on macos-14) is an infrastructure flake where cargo resolves to rustup-init because the macos-14 runner image is now ARM-only — unrelated to this PR's diff and reproduces across re-runs with no code changes. Tracked as #1136 for a workflow fix.

greptile-apps Bot reviewed May 15, 2026

View reviewed changes

carlos-alm mentioned this pull request May 15, 2026

ci: macOS Native host build fails with 'cargo metadata: unexpected argument' (rust-cache) #1129

Open

carlos-alm mentioned this pull request May 15, 2026

fix(julia): port abstract-def / macro-def / signature-call WASM bugs #1130

Merged

5 tasks

carlos-alm mentioned this pull request May 15, 2026

ci: Build x86_64-apple-darwin sporadically fails — cargo resolves to rustup-init on macos-14 #1136

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(julia): port parameterized-type / qualified-def / qualified-import fixes to WASM#1128

fix(julia): port parameterized-type / qualified-def / qualified-import fixes to WASM#1128
carlos-alm wants to merge 4 commits into
mainfrom
fix/1111-julia-wasm-extractor-bugs

carlos-alm commented May 15, 2026 •

edited

Loading

Uh oh!

claude Bot commented May 15, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 15, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented May 15, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot May 15, 2026

Uh oh!

carlos-alm May 15, 2026

Uh oh!

greptile-apps Bot May 15, 2026

Uh oh!

carlos-alm May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

carlos-alm commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

claude Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: Julia WASM Extractor Bug Fixes

Todo List:

Root Cause Analysis

Backlog Compliance

Critical Concerns

Final Recommendation

Uh oh!

github-actions Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codegraph Impact Analysis

Uh oh!

greptile-apps Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm May 15, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm May 15, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

carlos-alm commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

carlos-alm commented May 15, 2026 •

edited

Loading

claude Bot commented May 15, 2026 •

edited

Loading

github-actions Bot commented May 15, 2026 •

edited

Loading

greptile-apps Bot commented May 15, 2026 •

edited

Loading