2023-02-10 02:18:07 +01:00
|
|
|
---
|
|
|
|
layout: default
|
|
|
|
title: Building Graphs in C++
|
|
|
|
parent: Graphs
|
|
|
|
nav_order: 1
|
|
|
|
---
|
|
|
|
|
|
|
|
# Building Graphs in C++
|
|
|
|
{: .no_toc }
|
|
|
|
|
|
|
|
1. TOC
|
|
|
|
{:toc}
|
|
|
|
---
|
|
|
|
|
|
|
|
## C++ Graph Builder
|
|
|
|
|
|
|
|
C++ graph builder is a powerful tool for:
|
|
|
|
|
|
|
|
* Building complex graphs
|
2023-02-11 00:52:37 +01:00
|
|
|
* Parametrizing graphs (e.g. setting a delegate on `InferenceCalculator`,
|
|
|
|
enabling/disabling parts of the graph)
|
2023-02-10 02:18:07 +01:00
|
|
|
* Deduplicating graphs (e.g. instead of CPU and GPU dedicated graphs in pbtxt
|
|
|
|
you can have a single code that constructs required graphs, sharing as much
|
|
|
|
as possible)
|
|
|
|
* Supporting optional graph inputs/outputs
|
|
|
|
* Customizing graphs per platform
|
|
|
|
|
|
|
|
### Basic Usage
|
|
|
|
|
|
|
|
Let's see how C++ graph builder can be used for a simple graph:
|
|
|
|
|
|
|
|
```proto
|
|
|
|
// Graph inputs.
|
|
|
|
input_stream: "input_tensors"
|
|
|
|
input_side_packet: "model"
|
|
|
|
|
|
|
|
// Graph outputs.
|
|
|
|
output_stream: "output_tensors"
|
|
|
|
|
|
|
|
node {
|
|
|
|
calculator: "InferenceCalculator"
|
|
|
|
input_stream: "TENSORS:input_tensors"
|
|
|
|
input_side_packet: "MODEL:model"
|
|
|
|
output_stream: "TENSORS:output_tensors"
|
|
|
|
node_options: {
|
|
|
|
[type.googleapis.com/mediapipe.InferenceCalculatorOptions] {
|
|
|
|
# Requesting GPU delegate.
|
|
|
|
delegate { gpu {} }
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
Function to build the above `CalculatorGraphConfig` may look like:
|
|
|
|
|
|
|
|
```c++
|
|
|
|
CalculatorGraphConfig BuildGraph() {
|
|
|
|
Graph graph;
|
|
|
|
|
|
|
|
// Graph inputs.
|
|
|
|
Stream<std::vector<Tensor>> input_tensors =
|
|
|
|
graph.In(0).SetName("input_tensors").Cast<std::vector<Tensor>>();
|
|
|
|
SidePacket<TfLiteModelPtr> model =
|
|
|
|
graph.SideIn(0).SetName("model").Cast<TfLiteModelPtr>();
|
|
|
|
|
|
|
|
auto& inference_node = graph.AddNode("InferenceCalculator");
|
|
|
|
auto& inference_opts =
|
|
|
|
inference_node.GetOptions<InferenceCalculatorOptions>();
|
|
|
|
// Requesting GPU delegate.
|
|
|
|
inference_opts.mutable_delegate()->mutable_gpu();
|
|
|
|
input_tensors.ConnectTo(inference_node.In("TENSORS"));
|
|
|
|
model.ConnectTo(inference_node.SideIn("MODEL"));
|
|
|
|
Stream<std::vector<Tensor>> output_tensors =
|
|
|
|
inference_node.Out("TENSORS").Cast<std::vector<Tensor>>();
|
|
|
|
|
|
|
|
// Graph outputs.
|
|
|
|
output_tensors.SetName("output_tensors").ConnectTo(graph.Out(0));
|
|
|
|
|
|
|
|
// Get `CalculatorGraphConfig` to pass it into `CalculatorGraph`
|
|
|
|
return graph.GetConfig();
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
Short summary:
|
|
|
|
|
|
|
|
* Use `Graph::In/SideIn` to get graph inputs as `Stream/SidePacket`
|
|
|
|
* Use `Node::Out/SideOut` to get node outputs as `Stream/SidePacket`
|
2023-02-11 00:52:37 +01:00
|
|
|
* Use `Stream/SidePacket::ConnectTo` to connect streams and side packets to
|
|
|
|
node inputs (`Node::In/SideIn`) and graph outputs (`Graph::Out/SideOut`)
|
2023-02-10 02:18:07 +01:00
|
|
|
* There's a "shortcut" operator `>>` that you can use instead of
|
|
|
|
`ConnectTo` function (E.g. `x >> node.In("IN")`).
|
2023-02-11 00:52:37 +01:00
|
|
|
* `Stream/SidePacket::Cast` is used to cast stream or side packet of `AnyType`
|
|
|
|
(E.g. `Stream<AnyType> in = graph.In(0);`) to a particular type
|
|
|
|
* Using actual types instead of `AnyType` sets you on a better path for
|
|
|
|
unleashing graph builder capabilities and improving your graphs
|
|
|
|
readability.
|
2023-02-10 02:18:07 +01:00
|
|
|
|
2023-02-11 00:52:37 +01:00
|
|
|
### Advanced Usage
|
2023-02-10 02:18:07 +01:00
|
|
|
|
2023-02-11 00:52:37 +01:00
|
|
|
#### Utility Functions
|
|
|
|
|
|
|
|
Let's extract inference construction code into a dedicated utility function to
|
|
|
|
help for readability and code reuse:
|
|
|
|
|
|
|
|
```c++
|
|
|
|
// Updates graph to run inference.
|
|
|
|
Stream<std::vector<Tensor>> RunInference(
|
|
|
|
Stream<std::vector<Tensor>> tensors, SidePacket<TfLiteModelPtr> model,
|
|
|
|
const InferenceCalculatorOptions::Delegate& delegate, Graph& graph) {
|
|
|
|
auto& inference_node = graph.AddNode("InferenceCalculator");
|
|
|
|
auto& inference_opts =
|
|
|
|
inference_node.GetOptions<InferenceCalculatorOptions>();
|
|
|
|
*inference_opts.mutable_delegate() = delegate;
|
|
|
|
tensors.ConnectTo(inference_node.In("TENSORS"));
|
|
|
|
model.ConnectTo(inference_node.SideIn("MODEL"));
|
|
|
|
return inference_node.Out("TENSORS").Cast<std::vector<Tensor>>();
|
|
|
|
}
|
|
|
|
|
|
|
|
CalculatorGraphConfig BuildGraph() {
|
|
|
|
Graph graph;
|
|
|
|
|
|
|
|
// Graph inputs.
|
|
|
|
Stream<std::vector<Tensor>> input_tensors =
|
|
|
|
graph.In(0).SetName("input_tensors").Cast<std::vector<Tensor>>();
|
|
|
|
SidePacket<TfLiteModelPtr> model =
|
|
|
|
graph.SideIn(0).SetName("model").Cast<TfLiteModelPtr>();
|
|
|
|
|
|
|
|
InferenceCalculatorOptions::Delegate delegate;
|
|
|
|
delegate.mutable_gpu();
|
|
|
|
Stream<std::vector<Tensor>> output_tensors =
|
|
|
|
RunInference(input_tensors, model, delegate, graph);
|
|
|
|
|
|
|
|
// Graph outputs.
|
|
|
|
output_tensors.SetName("output_tensors").ConnectTo(graph.Out(0));
|
|
|
|
|
|
|
|
return graph.GetConfig();
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
As a result, `RunInference` provides a clear interface stating what are the
|
|
|
|
inputs/outputs and their types.
|
|
|
|
|
|
|
|
It can be easily reused, e.g. it's only a few lines if you want to run an extra
|
|
|
|
model inference:
|
|
|
|
|
|
|
|
```c++
|
|
|
|
// Run first inference.
|
|
|
|
Stream<std::vector<Tensor>> output_tensors =
|
|
|
|
RunInference(input_tensors, model, delegate, graph);
|
|
|
|
// Run second inference on the output of the first one.
|
|
|
|
Stream<std::vector<Tensor>> extra_output_tensors =
|
|
|
|
RunInference(output_tensors, extra_model, delegate, graph);
|
|
|
|
```
|
|
|
|
|
|
|
|
And you don't need to duplicate names and tags (`InferenceCalculator`,
|
|
|
|
`TENSORS`, `MODEL`) or introduce dedicated constants here and there - those
|
|
|
|
details are localized to `RunInference` function.
|
|
|
|
|
|
|
|
Tip: extracting `RunInference` and similar functions to dedicated units (e.g.
|
|
|
|
inference.h/cc which depends on the inference calculator) enables reuse in
|
|
|
|
graphs construction code and helps automatically pull in calculator dependencies
|
|
|
|
(e.g. no need to manually add `:inference_calculator` dep, just let your IDE
|
|
|
|
include `inference.h` and build cleaner pull in corresponding dependency).
|