Cleanup EP and optimize parts of the run #202

TedThemistokleous · 2026-01-03T06:14:41Z

Description

Motivation and Context

Updated 32 log statements in the compute function from LOGS_DEFAULT(VERBOSE) to LOGS_DEFAULT(INFO) for better visibility of compute-related operations during inference.

Use this call from previous genreated bits to encapsulate model compilation and parameters needed to ensure an MIGraphX program is properly compiled and setup accordingly

Cleans up the compute thread and keeps all inputs and pieces clear to whats going to be run to migraphx through the api.

Remoev this from the compute so we can encapsulate things in a reasonable way

make this a seperate call that takes in input context, program and paramters hape and name information so we can populate the items needed based of the MIGraphX program to perform a run_async later. Doing this as part of cleaning up the compute function to further optimize later

capture this in a sepeate call so we get an idea of how input shapes are handled during compute and the modes

Reuse this and remove a bunch of redundant repeated code

Store a compiled or preloaded from disk MIGraphX program into a map index by batch size. Use this as the program in the compute method if an incomming batch size matches that of what we wanted to run. If this fails, fallback to the preload from disk, and if that fails compile the model in the compute thread

- TIghten lock around run async' - Remove O(n) lookup with find and use unordered_set instead - Use optional to help tighten up lock

Should improve runtime from O(N*2) to O(N) for running through outputs and checking

Do this so that we can mange updates between inferences for things like dynamic batch or sequence length more effetively. Right now we were corsely recompiling based on any mismatch and always checking input shapes. In these cases now instaed of cheaping all N inputs we should check the symbolic dimensions for updates

Move this out into a seperate call to ensure we're tracking whether we get a dynamic batch size as well as other symbolic dimensions in the model we detect on compile

This is used everywhere in the EP and should be encapsulated to ensure we're consistent across all our compiles and load attempts

…ash for batch caching

…ynamic_batch and caching

…tions Clean up the compile() call so its clear we're just setting up an initial set of options for a model before we perform first inference in compute. Use this to link program input names to initial shapes, regardless if they're dynamic or static.

Used to get initial info of the model before we pass things to the compute thread

…ange update

…fast compute path

Make things const and dont lookup stuff in map index a 3 times for the same thing.

Reduces further API overhead and gets us closer to migraphx driver performance parity

…kups for ultra fast path

Dont need the overhead of a mutex as most cases they'll be multple instanecs of the run_async call

…compute func Ensure we can main aech run more effectively and cleanly. Allows us to add / change to these if needed

refactor whats needed here so that we can pass in the updated input paramter shapes and update compile options

TedThemistokleous added 11 commits January 2, 2026 14:26

[AI Generated] Change [Compute] logs from VERBOSE to INFO level

3f3b1a8

Updated 32 log statements in the compute function from LOGS_DEFAULT(VERBOSE) to LOGS_DEFAULT(INFO) for better visibility of compute-related operations during inference.

Create CompileProgramWithBatch to simplify compile steps

649bb5a

Use this call from previous genreated bits to encapsulate model compilation and parameters needed to ensure an MIGraphX program is properly compiled and setup accordingly

Encapsulate run_migraphX_program

c1ad1d6

Cleans up the compute thread and keeps all inputs and pieces clear to whats going to be run to migraphx through the api.

fixup! Create CompileProgramWithBatch to simplify compile steps

5c432a2

Encapsulate input mismatch during compute into call

0288cba

Remoev this from the compute so we can encapsulate things in a reasonable way

Encapsulate input shape handling for compute function

066b0f8

capture this in a sepeate call so we get an idea of how input shapes are handled during compute and the modes

reduce input args to input shape mismatch

89b8bbb

Use CompileProgramWithBatch to handle compile

9e43483

Reuse this and remove a bunch of redundant repeated code

Move alld INFO logs in compute to VERBOSE

a9a09fb

TedThemistokleous self-assigned this Jan 3, 2026

Tighten loop in run_migraphx_program

9e03d9c

- TIghten lock around run async' - Remove O(n) lookup with find and use unordered_set instead - Use optional to help tighten up lock

TedThemistokleous force-pushed the cleanup_compute_load branch from 04bb00a to 9e03d9c Compare January 5, 2026 06:42

TedThemistokleous added 16 commits January 5, 2026 09:26

Replace find in run_migraphx_progarm with set lookup

f62abd4

Should improve runtime from O(N*2) to O(N) for running through outputs and checking

Enapsulate startup symbolic dims check in compile()

005481a

Move this out into a seperate call to ensure we're tracking whether we get a dynamic batch size as well as other symbolic dimensions in the model we detect on compile

[Cleanup] Remove unessicary logging in compile thread

1f57f2f

Add fast path based on batch size and cached programs

998ec47

Add helper for batch size generation for powers of 2

b98e2af

Encapsulate load or compile path for model compilation

fc0e658

This is used everywhere in the EP and should be encapsulated to ensure we're consistent across all our compiles and load attempts

Hookup max_dynamic_batch into EP

647c25b

Use load_or_compile_model for initial model compile

1a6d058

Disable fast path for now. Use dynamic input shapes to create model h…

57b8660

…ash for batch caching

Renable fast path and use batch for lookup

b803080

Simplify MIGX EP - Compile on first compute, before we leverage max_d…

390f34e

…ynamic_batch and caching

Encapsulate get_input_name map in compile()

eda6e7d

Used to get initial info of the model before we pass things to the compute thread

Refactor get_io_names to return pair instead of pass by reference

2bb5b6d

Update so we only use param inputs and not all inputs during input ch…

7e2d602

…ange update

TedThemistokleous added 10 commits January 6, 2026 21:47

Change info to verbose and remove lambda creation every iteration in …

4caaf77

…fast compute path

handle_program_input_outputs - Remove duplicate lookups and types

9097484

Make things const and dont lookup stuff in map index a 3 times for the same thing.

Add ultra fast path using cache/rebind approach

361c142

Reduces further API overhead and gets us closer to migraphx driver performance parity

Second set of performance optimiztions so that that we're caching loo…

7d62c14

…kups for ultra fast path

Cache input shapes for quick shape compare in ultra fast path

e3793f7

Replace mutex with binary_semaphore for run_async lockuing

bf7bc67

Dont need the overhead of a mutex as most cases they'll be multple instanecs of the run_async call

Encapsulate ultra, fast and standard paths for each run condition in …

5fd64be

…compute func Ensure we can main aech run more effectively and cleanly. Allows us to add / change to these if needed

Update handle_input_shape_mistmatch to use load_or_comple_model

81e1ca8

refactor whats needed here so that we can pass in the updated input paramter shapes and update compile options

Fix mutex (not C++11) and order for handle_input_shape

330b95a

Make Env Override variable INFO instead of Warning

f88193a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cleanup EP and optimize parts of the run #202

Cleanup EP and optimize parts of the run #202

Uh oh!

TedThemistokleous commented Jan 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Cleanup EP and optimize parts of the run #202

Are you sure you want to change the base?

Cleanup EP and optimize parts of the run #202

Uh oh!

Conversation

TedThemistokleous commented Jan 3, 2026

Description

Motivation and Context

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants