Releases: sdpython/onnx-diagnostic
Releases · sdpython/onnx-diagnostic
0.7.14
- #257: patch to disable one exception in pytorch
- #256: extract subfolder from modelid//subfolder
- #252: adds new sets of inputs for task texgt-generation
- #250: add variables to track sequence nodes
- #249: patches _maybe_broadcast to support a corner case
0.7.13
- #247: supports more gemma models with ModelBuilder
- #246: add a set of inputs checking models works for an empty cache on task text-generation
- #237: dummy inputs for google/gemma-3-4b-it (task image-text-to-text)
- #244: add a patch to bypass the exception raised when the dynamic dimension is in {0,1}
0.7.12
- #232: fixes
--patch
argument so that --patch=0
works
- #231: better statistics about fusions
- #227: better support for
model_id//pretrained
, adds speed up when running command validate
- #226: fix input order for models created with modelbuilder
0.7.11
0.7.11
- #224: support model_id with // to specify a subfolder
- #223: adds task image-to-video
- #220: adds option --ort-logs to display onnxruntime logs when creating the session
- #220: adds a patch for PR #40791 huggingface/transformers#40791_ in transformers
0.7.10
- #218: patches used sdpa_mask_recent_torch used from _vmap_for_bhqkv
0.7.9
- #214: fix modelbuilder export
- #213: use DYNAMIC on batch size
0.7.8
- #210: add utilities to investigate models
- #208: add a patch for Qwen3 (rewrite a loop)
0.7.7
- #205: add in_channels in image_text_to_text
- #204: switch default num_hidden_layers to 4
- #203: Add option to disable patches for torch in command line validate
- #202: add models DeepseekV3ForCausalLM, Gemma3ForCausalLM, Glm4vMoeForConditionalGeneration
- #201: switch CI to 4.55.4
- #200: fixes patches for 4.55.1+, DynamicCache is no longer registered by default, this code moved to executorch.py in transformers
- #199: delete hidden_size and num_attention_heads modification in a config
- #198: support gpt-oss
- #197: updates CI for torch 2.8
- #196: implements a patch to rewrite a loop in modeling_qwen2_vl.VisionAttention
0.7.6
- #193: validates with 4.53.3
- #189: support for task mask-generation
- #192: add support for Gemma-3, add serialization for HybridCache, changes to support
transformers>=4.54
0.7.5
- #186: add parameter --output_names to command line validate to change the output names of the onnx exported model
- #185: remove the use of _seen_tokens in DynamicCache (removed in transformers>4.53),
updates dummpy inputs for feature-extraction
- #184: implements side-by-side
0.7.4
- #178: add a patch for eager_mask to handle
assert len(flat_dynamic_shapes) == num_placeholders - num_lifted_inputs
- #177: changes for the next version of onnx, fixes all_dynamic_shape_from_inputs
0.7.3
- #173: fixes function to_any for BaseModelOutput
0.7.2
- #170: patches LlamaRotaryEmbedding
- #168, #169: introduces patch_diffusers
- #166: improves handling of StaticCache
- #165: support for task text-to-image
- #162: improves graphs rendering for historical data