Skip to content

Releases: sdpython/onnx-diagnostic

0.7.14

10 Oct 16:58
fde0173
Compare
Choose a tag to compare
  • #257: patch to disable one exception in pytorch
  • #256: extract subfolder from modelid//subfolder
  • #252: adds new sets of inputs for task texgt-generation
  • #250: add variables to track sequence nodes
  • #249: patches _maybe_broadcast to support a corner case

0.7.13

03 Oct 15:58
d8e0dd8
Compare
Choose a tag to compare
  • #247: supports more gemma models with ModelBuilder
  • #246: add a set of inputs checking models works for an empty cache on task text-generation
  • #237: dummy inputs for google/gemma-3-4b-it (task image-text-to-text)
  • #244: add a patch to bypass the exception raised when the dynamic dimension is in {0,1}

0.7.12

26 Sep 16:48
4866853
Compare
Choose a tag to compare
  • #232: fixes --patch argument so that --patch=0 works
  • #231: better statistics about fusions
  • #227: better support for model_id//pretrained, adds speed up when running command validate
  • #226: fix input order for models created with modelbuilder

0.7.11

19 Sep 17:25
c7afba2
Compare
Choose a tag to compare

0.7.11

  • #224: support model_id with // to specify a subfolder
  • #223: adds task image-to-video
  • #220: adds option --ort-logs to display onnxruntime logs when creating the session
  • #220: adds a patch for PR #40791 huggingface/transformers#40791_ in transformers

0.7.10

  • #218: patches used sdpa_mask_recent_torch used from _vmap_for_bhqkv

0.7.9

  • #214: fix modelbuilder export
  • #213: use DYNAMIC on batch size

0.7.8

  • #210: add utilities to investigate models
  • #208: add a patch for Qwen3 (rewrite a loop)

0.7.7

27 Aug 15:17
a51448e
Compare
Choose a tag to compare
  • #205: add in_channels in image_text_to_text
  • #204: switch default num_hidden_layers to 4
  • #203: Add option to disable patches for torch in command line validate
  • #202: add models DeepseekV3ForCausalLM, Gemma3ForCausalLM, Glm4vMoeForConditionalGeneration
  • #201: switch CI to 4.55.4
  • #200: fixes patches for 4.55.1+, DynamicCache is no longer registered by default, this code moved to executorch.py in transformers
  • #199: delete hidden_size and num_attention_heads modification in a config
  • #198: support gpt-oss
  • #197: updates CI for torch 2.8
  • #196: implements a patch to rewrite a loop in modeling_qwen2_vl.VisionAttention

0.7.6

25 Jul 17:01
1158eca
Compare
Choose a tag to compare
  • #193: validates with 4.53.3
  • #189: support for task mask-generation
  • #192: add support for Gemma-3, add serialization for HybridCache, changes to support transformers>=4.54

0.7.5

11 Jul 18:25
d705153
Compare
Choose a tag to compare
  • #186: add parameter --output_names to command line validate to change the output names of the onnx exported model
  • #185: remove the use of _seen_tokens in DynamicCache (removed in transformers>4.53),
    updates dummpy inputs for feature-extraction
  • #184: implements side-by-side

0.7.4

04 Jul 16:38
375a622
Compare
Choose a tag to compare
  • #178: add a patch for eager_mask to handle assert len(flat_dynamic_shapes) == num_placeholders - num_lifted_inputs
  • #177: changes for the next version of onnx, fixes all_dynamic_shape_from_inputs

0.7.3

02 Jul 14:01
f7dd78e
Compare
Choose a tag to compare
  • #173: fixes function to_any for BaseModelOutput

0.7.2

30 Jun 09:18
8592cd6
Compare
Choose a tag to compare
  • #170: patches LlamaRotaryEmbedding
  • #168, #169: introduces patch_diffusers
  • #166: improves handling of StaticCache
  • #165: support for task text-to-image
  • #162: improves graphs rendering for historical data