[Custom Partitioning] How to condition `sharding_rule` on input sharding? (Migration from `infer_sharding_from_operands`) #34749

ASKabalan · 2026-02-01T15:08:13Z

ASKabalan
Feb 1, 2026

Hi,

I am migrating a library (jaxDecomp) from the legacy infer_sharding_from_operands callback to the new sharding_rule API compatible with Shardy.

My library implements distributed FFTs. The critical logic relies on inspecting the input sharding (specifically the PartitionSpec of the input) to decide:

Which algorithm to use (e.g., Slab decomposition vs. Pencil decomposition).
What the output sharding will look like (the algorithm effectively rotates the sharding axes).

The "Old" Way (Working)

Previously, infer_sharding_from_operands provided arg_infos populated with NamedSharding. I could inspect the input spec at compile time to dynamically determine the output spec.

@spmd_fft_primitive.def_infer_sharding
def infer_sharding_from_operands(mesh, arg_infos, result_infos):
    # 1. Access input sharding
    input_sharding = arg_infos[0].sharding 
    spec = input_sharding.spec
    
    # 2. Logic: Depending on which axis is sharded, the output spec changes
    # e.g., if input is sharded on Z, output must be sharded on Y (Slab XY algo)
    # e.g., if input is sharded on X, output must be sharded on Z (Slab YZ algo)
    pencil_type = get_pencil_type(spec) 
    
    transposed_specs = get_output_specs(pencil_type, spec)
    return NamedSharding(mesh, P(*transposed_specs))

The "New" Way (The Problem)

In the new API, sharding_rule is required for Shardy propagation. However, arg_infos passed to this callback only contains ranked shapes (ShapeDtypeStruct), not the sharding.

@spmd_fft_primitive.def_sharding_rule
def fft_sharding_rule_producer(mesh, arg_infos, result_infos):
    # arg_infos[0] is just a ShapeDtypeStruct. 
    # I cannot access .sharding to check which axis is distributed!
    
    # I need to return an Einsum or SdyShardingRule here, but I don't know 
    # which rule to return because I don't know the input layout.
    
    # If I return a generic "i j k -> i j k", it fails because my custom_op 
    # inherently performs a global transpose (reshuffle) that changes the sharding.
    return ???

The Question

My operation is polymorphic: the relationship between input and output dimensions depends entirely on how the input is currently distributed.

If sharding_rule is supposed to be purely declarative (agnostic of input sharding), how should we handle ops where the propagation rule itself depends on the input layout?

Is the recommended pattern to:

Resolve the sharding eagerly in Python (outside the primitive), determine the "mode" (Slab/Pencil), and pass that mode as a static argument to the primitive?
Or is there a way to define a SdyShardingRule that can express "If input dim 2 is sharded, output dim 1 becomes sharded"?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Custom Partitioning] How to condition `sharding_rule` on input sharding? (Migration from `infer_sharding_from_operands`) #34749

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

[Custom Partitioning] How to condition sharding_rule on input sharding? (Migration from infer_sharding_from_operands) #34749

Uh oh!

ASKabalan Feb 1, 2026

The "Old" Way (Working)

The "New" Way (The Problem)

The Question

Replies: 0 comments

[Custom Partitioning] How to condition `sharding_rule` on input sharding? (Migration from `infer_sharding_from_operands`) #34749

ASKabalan
Feb 1, 2026