CastTess.h File
CastTess Node — fused CVU kernel that casts FP32→BF16 then tessellates. More...
Included Headers
#include "builder/Node.h"
#include "builder/NodeContractConfigurable.h"
#include "builder/NodeContractProvider.h"
#include <nlohmann/json.hpp>
#include <memory>
#include <optional>
#include <string>
#include <vector>
Namespaces Index
| namespace | simaai |
| namespace | neat |
| namespace | nodes |
Classes Index
| struct | CastTessOptions |
| class | CastTess |
|
Fused CVU kernel Node: Cast (FP32→BF16) followed by Tess. More... | |
Description
CastTess Node — fused CVU kernel that casts FP32→BF16 then tessellates.
Combines a BF16 cast and a tessellate layout shuffle into a single CVU pass. Inserted by the route planner before the MLA when the model expects BF16 input and MLA-side tessellation is not part of the compiled MLA kernel — fusing the two stages saves a DDR round-trip versus running Cast and Tess back to back.
- See Also
"The dtype contract" page in /concepts/dtype_contract
File Listing
The file content with the documentation metadata removed is:
29namespace simaai::neat {
38struct CastTessOptions {
40 CastTessOptions() = default;
42 explicit CastTessOptions(const simaai::neat::Model& model);
44 std::string config_path;
47 std::string element_name;
50 int num_buffers = 0;
51 int num_buffers_model = 0;
52 bool num_buffers_locked = false;
69class CastTess final : public Node, public NodeContractProvider, public NodeContractConfigurable {
72 explicit CastTess(CastTessOptions opt = {});
80 NodeCapsBehavior caps_behavior() const override {
84 NodeContractDefinition contract_definition() const override;
86 bool compile_node_contract(const ContractCompileInput& input, CompiledNodeContract* out,
89 void apply_compiled_contract(const CompiledNodeContract& contract, std::string* err) override;
91 std::string backend_fragment(int node_index) const override;
93 std::vector<std::string> element_names(int node_index) const override;
96 const nlohmann::json* config_json() const;
99 const CastTessOptions& options() const {
103 const std::string& config_path() const {
112 CastTessOptions opt_;
119namespace simaai::neat::nodes {
Generated via doxygen2docusaurus 2.0.0 by Doxygen 1.9.8.