# Configuration classes for ONNX exports

Exporting a model to ONNX involves specifying:
1. The input names.
2. The output names.
3. The dynamic axes. These refer to the input dimensions can be changed dynamically at runtime (e.g. a batch size or sequence length).
All other axes will be treated as static, and hence fixed at runtime.
4. Dummy inputs to trace the model. This is needed in PyTorch to record the computational graph and convert it to ONNX.

Since this data depends on the choice of model and task, we represent it in terms of _configuration classes_. Each configuration class is associated with
a specific model architecture, and follows the naming convention `ArchitectureNameOnnxConfig`. For instance, the configuration which specifies the ONNX
export of BERT models is `BertOnnxConfig`.

Since many architectures share similar properties for their ONNX configuration, 🤗 Optimum adopts a 3-level class hierarchy:
1. Abstract and generic base classes. These handle all the fundamental features, while being agnostic to the modality (text, image, audio, etc).
2. Middle-end classes. These are aware of the modality, but multiple can exist for the same modality depending on the inputs they support.
They specify which input generators should be used for the dummy inputs, but remain model-agnostic.
3. Model-specific classes like the `BertOnnxConfig` mentioned above. These are the ones actually used to export models.

## Base classes[[optimum.exporters.onnx.OnnxConfig]]

#### optimum.exporters.onnx.OnnxConfig[[optimum.exporters.onnx.OnnxConfig]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/base.py#L92)

inputsoptimum.exporters.onnx.OnnxConfig.inputshttps://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/base.py#L164[]`Dict[str, Dict[int, str]]`A mapping of each input name to a mapping of axis position to the axes symbolic name.

Dict containing the axis definition of the input tensors to provide to the model.

**Returns:**

``Dict[str, Dict[int, str]]``

A mapping of each input name to a mapping of axis position to the axes symbolic name.
#### outputs[[optimum.exporters.onnx.OnnxConfig.outputs]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/base.py#L175)

Dict containing the axis definition of the output tensors to provide to the model.

**Returns:**

``Dict[str, Dict[int, str]]``

A mapping of each output name to a mapping of axis position to the axes symbolic name.
#### generate_dummy_inputs[[optimum.exporters.onnx.OnnxConfig.generate_dummy_inputs]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/base.py#L223)

Generates the dummy inputs necessary for tracing the model. If not explicitly specified, default input shapes are used.

**Parameters:**

framework (`str`, defaults to `"pt"`) : The framework for which to create the dummy inputs.

batch_size (`int`, defaults to 2) : The batch size to use in the dummy inputs.

sequence_length (`int`, defaults to 16) : The sequence length to use in the dummy inputs.

num_choices (`int`, defaults to 4) : The number of candidate answers provided for multiple choice task.

image_width (`int`, defaults to 64) : The width to use in the dummy inputs for vision tasks.

image_height (`int`, defaults to 64) : The height to use in the dummy inputs for vision tasks.

num_channels (`int`, defaults to 3) : The number of channels to use in the dummpy inputs for vision tasks.

feature_size (`int`, defaults to 80) : The number of features to use in the dummpy inputs for audio tasks in case it is not raw audio. This is for example the number of STFT bins or MEL bins.

nb_max_frames (`int`, defaults to 3000) : The number of frames to use in the dummpy inputs for audio tasks in case the input is not raw audio.

audio_sequence_length (`int`, defaults to 16000) : The number of frames to use in the dummpy inputs for audio tasks in case the input is raw audio.

**Returns:**

``Dict[str, [tf.Tensor, torch.Tensor]]``

A dictionary mapping the input names to dummy tensors in the proper framework format.

#### optimum.exporters.onnx.OnnxConfigWithPast[[optimum.exporters.onnx.OnnxConfigWithPast]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/base.py#L426)

Inherits from [OnnxConfig](/docs/optimum/main/en/onnx/package_reference/configuration#optimum.exporters.onnx.OnnxConfig). A base class to handle the ONNX configuration of decoder-only models.

add_past_key_valuesoptimum.exporters.onnx.OnnxConfigWithPast.add_past_key_valueshttps://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/base.py#L550[{"name": "inputs_or_outputs", "val": ": dict[str, dict[int, str]]"}, {"name": "direction", "val": ": str"}]- **inputs_or_outputs** (`Dict[str, Dict[int, str]]`) --
  The mapping to fill.
- **direction** (`str`) --
  either "inputs" or "outputs", it specifies whether `input_or_outputs` is the input mapping or the
  output mapping, this is important for axes naming.0
Fills `input_or_outputs` mapping with past_key_values dynamic axes considering the direction.

**Parameters:**

inputs_or_outputs (`Dict[str, Dict[int, str]]`) : The mapping to fill.

direction (`str`) : either "inputs" or "outputs", it specifies whether `input_or_outputs` is the input mapping or the output mapping, this is important for axes naming.

#### optimum.exporters.onnx.OnnxSeq2SeqConfigWithPast[[optimum.exporters.onnx.OnnxSeq2SeqConfigWithPast]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/base.py#L622)

Inherits from [OnnxConfigWithPast](/docs/optimum/main/en/onnx/package_reference/configuration#optimum.exporters.onnx.OnnxConfigWithPast). A base class to handle the ONNX configuration of encoder-decoder models.

with_behavioroptimum.exporters.onnx.OnnxSeq2SeqConfigWithPast.with_behaviorhttps://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/base.py#L653[{"name": "behavior", "val": ": str | ConfigBehavior"}, {"name": "use_past", "val": ": bool = False"}, {"name": "use_past_in_inputs", "val": ": bool = False"}]- **behavior** (`ConfigBehavior`) --
  The behavior to use for the new instance.
- **use_past** (`bool`, defaults to `False`) --
  Whether or not the ONNX config to instantiate is for a model using KV cache.
- **use_past_in_inputs** (`bool`, defaults to `False`) --
  Whether the KV cache is to be passed as an input to the ONNX.0`OnnxSeq2SeqConfigWithPast`
Creates a copy of the current OnnxConfig but with a different `ConfigBehavior` and `use_past` value.

**Parameters:**

behavior (`ConfigBehavior`) : The behavior to use for the new instance.

use_past (`bool`, defaults to `False`) : Whether or not the ONNX config to instantiate is for a model using KV cache.

use_past_in_inputs (`bool`, defaults to `False`) : Whether the KV cache is to be passed as an input to the ONNX.

**Returns:**

``OnnxSeq2SeqConfigWithPast``

## Middle-end classes

### Text[[optimum.exporters.onnx.TextEncoderOnnxConfig]]

#### optimum.exporters.onnx.TextEncoderOnnxConfig[[optimum.exporters.onnx.TextEncoderOnnxConfig]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/config.py#L47)

Handles encoder-based text architectures.

#### optimum.exporters.onnx.TextDecoderOnnxConfig[[optimum.exporters.onnx.TextDecoderOnnxConfig]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/config.py#L53)

Handles decoder-based text architectures.

#### optimum.exporters.onnx.TextSeq2SeqOnnxConfig[[optimum.exporters.onnx.TextSeq2SeqOnnxConfig]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/config.py#L157)

Handles encoder-decoder-based text architectures.

### Vision[[optimum.exporters.onnx.config.VisionOnnxConfig]]

#### optimum.exporters.onnx.config.VisionOnnxConfig[[optimum.exporters.onnx.config.VisionOnnxConfig]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/config.py#L206)

Handles vision architectures.

### Multi-modal[[optimum.exporters.onnx.config.TextAndVisionOnnxConfig]]

#### optimum.exporters.onnx.config.TextAndVisionOnnxConfig[[optimum.exporters.onnx.config.TextAndVisionOnnxConfig]]

[Source](https://github.com/huggingface/optimum-onnx/blob/main/optimum/exporters/onnx/config.py#L212)

Handles multi-modal text and vision architectures.