UpdatedSeptember 5, 2025

Cast

Description

The operator casts the elements of a given input tensor to a data type specified by the ‘to’ argument and returns an output tensor of the same size in the converted type. The ‘to’ argument must be one of the data types specified in the ‘DataType’ enum field in the TensorProto message.

Casting from string tensor in plain (e.g., “3.14” and “1000”) and scientific numeric representations (e.g., “1e-5” and “1E8”) to float types is supported. For example, converting string “100.5” to an integer may yield result 100. There are some string literals reserved for special floating-point values; “+INF” (and “INF”), “-INF”, and “NaN” are positive infinity, negative infinity, and not-a-number, respectively. Any string which can exactly match “+INF” in a case-insensitive way would be mapped to positive infinite. Similarly, this case-insensitive rule is applied to “INF” and “NaN”. When casting from numeric tensors to string tensors, plain floating-point representation (such as “314.15926”) would be used. Converting non-numerical-literal string such as “Hello World!” is an undefined behavior. Cases of converting string representing floating-point arithmetic value, such as “2.718”, to INT is an undefined behavior.

Conversion from a numerical type to any numerical type is always allowed. User must be aware of precision loss and value change caused by range difference between two types. For example, a 64-bit float 3.1415926459 may be round to a 32-bit float 3.141592. Similarly, converting an integer 36 to Boolean may produce 1 because we truncate bits which can’t be stored in the targeted type.

In more detail, the conversion among numerical types should follow these rules if the destination type is not a float 8 type.

Casting from floating point to:
- floating point: +/- infinity if OOR (out of range).
- fixed point: undefined if OOR.
- bool: +/- 0.0 to False; all else to True.
Casting from fixed point to:
- floating point: +/- infinity if OOR. (+ infinity in the case of uint)
- fixed point: when OOR, discard higher bits and reinterpret (with respect to two’s complement representation for signed types). For example, 200 (int16) -> -56 (int8).
- bool: zero to False; nonzero to True.
Casting from bool to:
- floating point: {1.0, 0.0}.
- fixed point: {1, 0}.
- bool: no change.

Float 8 type were introduced to speed up the training of deep models. By default the conversion of a float x obeys to the following rules. [x] means the value rounded to the target mantissa width.

x	E4M3FN	E4M3FNUZ	E5M2	E5M2FNUZ
0	0	0	0	0
-0	-0	0	-0	0
NaN	NaN	NaN	NaN	NaN
Inf	FLT_MAX	NaN	FLT_MAX	NaN
-Inf	-FLT_MAX	NaN	-FLT_MAX	NaN
[x] > FLT_MAX	FLT_MAX	FLT_MAX	FLT_MAX	FLT_MAX
[x] < -FLT_MAX	-FLT_MAX	-FLT_MAX	-FLT_MAX	-FLT_MAX
else	RNE	RNE	RNE	RNE

The behavior changes if the parameter ‘saturate’ is set to False. The rules then become :

x	E4M3FN	E4M3FNUZ	E5M2	E5M2FNUZ
0	0	0	0	0
-0	-0	0	-0	0
NaN	NaN	NaN	NaN	NaN
-NaN	-NaN	NaN	-NaN	NaN
Inf	NaN	NaN	Inf	NaN
-Inf	-NaN	NaN	-Inf	NaN
[x] > FLT_MAX	NaN	NaN	Inf	NaN
[x] < -FLT_MAX	NaN	NaN	-Inf	NaN
else	RNE	RNE	RNE	RNE

Input parameters

specified_outputs_name : array, this parameter lets you manually assign custom names to the output tensors of a node.
input (heterogeneous) – T1 : object, input tensor to be cast.

Parameters : cluster,

saturate : boolean, the parameter defines how the conversion behaves if an input value is out of range of the destination type. It only applies for float 8 conversion (float8e4m3fn, float8e4m3fnuz, float8e5m2, float8e5m2fnuz). All cases are fully described in two tables inserted in the operator description.
Default value “True”.
to : enum, the data type to which the elements of the input tensor are cast. Strictly must be one of the types from DataType enum in TensorProto.
Default value “UNDEFINED”.
training? : boolean, whether the layer is in training mode (can store data for backward).
Default value “True”.
lda coeff : float, defines the coefficient by which the loss derivative will be multiplied before being sent to the previous layer (since during the backward run we go backwards).
Default value “1”.

name (optional) : string, name of the node.

Output parameters

output (heterogeneous) – T2 : object, output tensor with the same shape as input with type specified by the ‘to’ argument.

Type Constraints

T1 in (tensor(bfloat16), tensor(bool), tensor(double), tensor(float), tensor(float16), tensor(float8e4m3fn),
tensor(float8e4m3fnuz), tensor(float8e5m2), tensor(float8e5m2fnuz), tensor(int16), tensor(int32), tensor(int64), tensor(int8), tensor(string), tensor(uint16), tensor(uint32), tensor(uint64), tensor(uint8)) : Constrain input types. Casting from complex is not supported.

T2 in (tensor(bfloat16), tensor(bool), tensor(double), tensor(float), tensor(float16), tensor(float8e4m3fn),
tensor(float8e4m3fnuz), tensor(float8e5m2), tensor(float8e5m2fnuz), tensor(int16), tensor(int32), tensor(int64), tensor(int8), tensor(string), tensor(uint16), tensor(uint32), tensor(uint64), tensor(uint8)) : Constrain output types. Casting to complex is not supported.

Example

All these exemples are snippets PNG, you can drop these Snippet onto the block diagram and get the depicted code added to your VI (Do not forget to install Deep Learning library to run it).

Quick start

Installation guide

Execution providers

General

Iconography

API

Architecture

Layers

Nodes

Nodes

Activation

Mono Input

Parameters

Graph Function

Graph

File

Get & Set

Runtime

Create

Inference

Training

Academic Training

Exec

Inference

Input

Reinforcement Learning

Advanced

Add Weight

Index

Name

Format Weight

Get Weight

Index

Name

Set Weight

More

Layers parameters

Nodes Parameters