intelligence.acuirt.dataclasses
AcuiRTBaseConversionConfig Objects
@dataclass
class AcuiRTBaseConversionConfig()
Base configuration for AcuiRT model conversion.
Attributes:
rt_modeOptional[str] - The runtime mode for conversion (e.g., 'onnx', 'torch2trt').autobool - Whether to enable automatic conversion.childrenOptional[Dict[str, "AcuiRTBaseConversionConfig"]] - Configuration for child modules.input_shapesOptional[tuple] - Input shapes for the model.input_argsOptional[tuple] - Input arguments for the model.
AcuiRTBaseConversionReport Objects
@dataclass
class AcuiRTBaseConversionReport()
Base report for AcuiRT model conversion.
Attributes:
rt_modeOptional[str] - The runtime mode used for conversion (e.g., 'onnx', 'torch2trt').childrenOptional[Dict[str, "AcuiRTBaseConversionReport"]] - Reports for child modules.input_shapesOptional[tuple] - Input shapes used during conversion.input_argsOptional[tuple] - Input arguments used during conversion.class_nameOptional[Type[nn.Module]] - Class name of the converted module.statusstr - Status of the conversion ('success', 'failed', etc.).errorOptional[str] - Error message if the conversion failed.tracebackOptional[str] - Traceback information if an error occurred.
AcuiRTONNXConversionInfo Objects
@dataclass
class AcuiRTONNXConversionInfo()
ONNX conversion specific information.
Attributes:
int8bool - Whether INT8 quantization was used.fp16bool - Whether FP16 precision was used.input_namesOptional[List[str]] - Names of the model inputs. dynamic_axes (Optional[Dict[str, Dict[int, Union[str, Tuple[str, int, int]]]]): Dynamic axes information for ONNX export.
AcuiRTONNXConversionConfig Objects
@dataclass
class AcuiRTONNXConversionConfig(AcuiRTONNXConversionInfo,
AcuiRTBaseConversionConfig)
Configuration for ONNX model conversion. Inherits from AcuiRTONNXConversionInfo and AcuiRTBaseConversionConfig.
AcuiRTONNXConversionReport Objects
@dataclass
class AcuiRTONNXConversionReport(AcuiRTONNXConversionInfo,
AcuiRTBaseConversionReport)
Report for ONNX model conversion. Inherits from AcuiRTONNXConversionInfo and AcuiRTBaseConversionReport.
AcuiRTTorch2TRTConversionInfo Objects
@dataclass
class AcuiRTTorch2TRTConversionInfo()
Torch2TRT conversion specific information.
Attributes:
int8bool - Whether INT8 quantization was used.fp16bool - Whether FP16 precision was used.use_dlabool - Whether DLA was used for acceleration.
AcuiRTTorch2TRTConversionConfig Objects
@dataclass
class AcuiRTTorch2TRTConversionConfig(AcuiRTTorch2TRTConversionInfo,
AcuiRTBaseConversionConfig)
Configuration for Torch2TRT model conversion. Inherits from AcuiRTTorch2TRTConversionInfo and AcuiRTBaseConversionConfig.
AcuiRTTorch2TRTConversionReport Objects
@dataclass
class AcuiRTTorch2TRTConversionReport(AcuiRTTorch2TRTConversionInfo,
AcuiRTBaseConversionReport)
Report for Torch2TRT model conversion. Inherits from AcuiRTTorch2TRTConversionInfo and AcuiRTBaseConversionReport.
AcuiRTPerformanceReport Objects
@dataclass
class AcuiRTPerformanceReport()
Report containing performance metrics.
Attributes:
accuracyDict[str, Real] - Dictionary of accuracy metrics.latencynp.timedelta64 - Measured latency.
from_dict
def from_dict(input: dict)
Converts a dictionary to an AcuiRTBaseConversionReport or AcuiRTBaseConversionConfig object.
Arguments:
inputdict - Input dictionary to convert.
Returns:
Union[AcuiRTBaseConversionReport, AcuiRTBaseConversionConfig]: Converted object based on the input dictionary.
AcuiRTWorkFlowReport Objects
@dataclass
class AcuiRTWorkFlowReport()
Report for the AcuiRT conversion workflow.
Attributes:
performanceAcuiRTPerformanceReport - Performance report of the converted model.non_converted_performanceOptional[AcuiRTPerformanceReport] - Performance report of the original model.conversion_resultAcuiRTBaseConversionReport - Conversion report.profiletorch.profiler.profile - Profiling information.conversion_rateTuple[int, int] - Conversion rate of the model. Both converted and total include non-leaf nn.Module (converted, total)num_modulesint - Number of modules in the model.