Releases · sdpython/onnx-diagnostic

#290: adds one prompt for text2text-generation
#289: adds command line options --exppo to give the exporter additional options
#287: adds input 'inputs_prompt' to test a LLM, meant to be used during validation
#288: add .contiguous in torch.cond branch (attention patch for sdpa implementation)
#286: adds variable to track random nodes in models

#283: fix historical aggregation when multiple input sets are used
#282: add tools to understand better which functions were patched
#280: fixes patches for sdpa_attention_forward for different version of transformers
#278: implements onnx_generate_with_genai
#277: changes the serialization for all caches to reorder the model outputs (key_1, value_1, key_2, ...)
#276: implements onnx_generate which implements method generate for an onnx model,
#275: fixes function patched_vmap

#273: enables export with FakeTensor
#272: makes patches work with FakeTensor
#270: add export sample code to export a specific model id with the appropriate inputs
#269: adds one unit test to track a patch fixing broadcast output shape
#267: patches sdpa_attention_forward because of a control flow (transformers>=5.0)
#266: makes patch_torch an integer in torch_export_patches to enable more patches

#247: supports more gemma models with ModelBuilder
#246: add a set of inputs checking models works for an empty cache on task text-generation
#237: dummy inputs for google/gemma-3-4b-it (task image-text-to-text)
#244: add a patch to bypass the exception raised when the dynamic dimension is in {0,1}

#232: fixes --patch argument so that --patch=0 works
#231: better statistics about fusions
#227: better support for model_id//pretrained, adds speed up when running command validate
#226: fix input order for models created with modelbuilder

0.7.11

#224: support model_id with // to specify a subfolder
#223: adds task image-to-video
#220: adds option --ort-logs to display onnxruntime logs when creating the session
#220: adds a patch for PR #40791 huggingface/transformers#40791_ in transformers

#205: add in_channels in image_text_to_text
#204: switch default num_hidden_layers to 4
#203: Add option to disable patches for torch in command line validate
#202: add models DeepseekV3ForCausalLM, Gemma3ForCausalLM, Glm4vMoeForConditionalGeneration
#201: switch CI to 4.55.4
#200: fixes patches for 4.55.1+, DynamicCache is no longer registered by default, this code moved to executorch.py in transformers
#199: delete hidden_size and num_attention_heads modification in a config
#198: support gpt-oss
#197: updates CI for torch 2.8
#196: implements a patch to rewrite a loop in modeling_qwen2_vl.VisionAttention

#193: validates with 4.53.3
#189: support for task mask-generation
#192: add support for Gemma-3, add serialization for HybridCache, changes to support transformers>=4.54