Skip to main content

Glossary

Terms and acronyms that appear across the framework's docs and code.

Framework concepts

TermDefinition
Neat (the framework)The library this documentation describes — a C++/Python framework for building, validating, and running GStreamer-based AI inference pipelines on Modalix.
NodeThe smallest building block of a pipeline. A typed wrapper over a GStreamer element. See include/builder/Node.h.
NodeGroupA bundle of Nodes that travels as a unit (e.g., a model's preprocess).
SessionThe assembly stage that turns Nodes into a runnable Run.
RunA live, running pipeline. Push samples in, pull samples out.
ModelThe simplified entry point for loading and running an MPK.
MPKModel Pack — a .tar.gz / .mpk archive bundling a compiled model and everything it needs at runtime.
SampleThe framework's unit of pipeline data — wraps a tensor or encoded media plus metadata.
TensorThe framework's typed view of a buffer of pixels / audio / inference results.
TensorBufferThe underlying memory backing a Tensor — carries the (buffer_id, paddr, vaddr) triple.
SegmentA named memory region with allocator + access metadata. Buffers come from segments.

Hardware

TermDefinition
ModalixSiMa's edge AI SoC platform. The framework targets it.
MLAMachine Learning Accelerator — the SoC's main inference engine.
MLASHMMLA Shared Memory — a low-latency memory region the MLA reads from.
EV74 / CVUCompute Vision Unit — a SIMD-friendly DSP for preprocess / postprocess kernels.
A65The application ARM cores running Linux.
APUAudio Processing Unit.
M4A Cortex-M4 used for low-level coordination (RPMsg, hardware sequencing).
TVMApache TVM — the framework's CPU-side fallback compiler for ops the MLA can't handle.
VCCMA SoC-internal coherent memory region used by some accelerators.
OCMOn-chip memory used by the MLA.
DMSDirect Memory Server — the SoC's memory controller / allocator service.
RPMsgRemote-processor messaging — the IPC channel between A65 and M4.
IOMMUI/O memory-management unit — maps physical to virtual for hardware.

Data formats

TermDefinition
BF16Brain Float 16 — IEEE 754 binary16 with full FP32 exponent range. The MLA's preferred float dtype.
NV12YUV 4:2:0 with Y plane + interleaved UV plane. The framework's default decoded-video format.
I420YUV 4:2:0 with separate Y, U, V planes.
HWC / CHWTensor layouts (Height-Width-Channels vs. Channels-Height-Width).
TessellationThe tile-shuffle that arranges a tensor into the geometry the MLA's input scratchpad expects. Pure layout — same bytes, different order.
QuantizationMapping FP32 values to INT8 with a scale and zero-point.

Operational

TermDefinition
Generic PreprocA preprocess upgrade that fuses arbitrary user-supplied transforms into the standard preprocess graph.
BoxDecodeA postprocess fusion that runs NMS / decode for detection models on the EV74.
DetectionMetaThe metadata struct attached to detection-model output samples by BoxDecode.
GstSimaMetaThe framework's GStreamer metadata struct, attached to every framework-managed buffer.
Route planThe framework's compile-time decision about which processor runs each stage and which segments hold each buffer.
Repro launch stringThe deterministic gst-launch text reproducer emitted by Session::describe().

See also