ProfilerKernelAggregate Struct
Aggregated timings for one (backend, kernel, stage, slot) tuple. More...
Declaration
Included Headers
Public Member Functions Index
| double | avg_ms () const |
|
Mean latency per invocation, in milliseconds. More... | |
Public Member Attributes Index
| std::string | backend |
|
Backend label ("MLA", "A65", ...). More... | |
| std::string | kernel_name |
|
Kernel name within the backend. More... | |
| std::string | stage_name |
|
Pipeline stage name. More... | |
| std::int32_t | physical_input_index = -1 |
|
Physical input index, -1 if N/A. More... | |
| std::int32_t | output_slot = -1 |
| std::uint64_t | count = 0 |
|
Number of invocations in the bucket. More... | |
| double | total_ms = 0.0 |
|
Total time across invocations (ms). More... | |
| double | min_ms = 0.0 |
|
Minimum single-invocation time (ms). More... | |
| double | max_ms = 0.0 |
|
Maximum single-invocation time (ms). More... | |
Description
Aggregated timings for one (backend, kernel, stage, slot) tuple.
Bucketed view over ProfilerKernelInvocation records: call count plus total/min/max latency in milliseconds. Use avg_ms() for the mean.
Definition at line 130 of file LatencyProfiler.h.
Public Member Attributes
backend
|
Backend label ("MLA", "A65", ...).
Definition at line 131 of file LatencyProfiler.h.
count
|
Number of invocations in the bucket.
Definition at line 136 of file LatencyProfiler.h.
kernel_name
|
Kernel name within the backend.
Definition at line 132 of file LatencyProfiler.h.
max_ms
|
Maximum single-invocation time (ms).
Definition at line 139 of file LatencyProfiler.h.
min_ms
|
Minimum single-invocation time (ms).
Definition at line 138 of file LatencyProfiler.h.
output_slot
|
Output slot, -1 if N/A.
Definition at line 135 of file LatencyProfiler.h.
physical_input_index
|
Physical input index, -1 if N/A.
Definition at line 134 of file LatencyProfiler.h.
stage_name
|
total_ms
|
Total time across invocations (ms).
Definition at line 137 of file LatencyProfiler.h.
The documentation for this struct was generated from the following file:
Generated via doxygen2docusaurus 2.0.0 by Doxygen 1.9.8.