5. Known Issues and Workarounds
- When a graph has multiple subgraphs, the dla_compiler performance estimator might estimate the performance of the combined graph incorrectly.
When multiple subgraphs exist, you can validate the performance calculation as follows:
- Gather the performance estimates for each subgraph (fps1, fps2, and so on), but ignore the estimates for any CPU subgraphs.
- Calculate the latency of the combined graph from the performance estimates for each subgraph:
- Calculate the performance of the combined graph:
-
The runtime for the PCIe-based design example does not compile on Red Hat* Enterprise Linux* 8 due to the presence of an old version of the gflags library. To resolve this issue:
- Run the following command:
sudo sed -i -e "s/DEFINE_uint32/DEFINE_int32/" \ $COREDLA_ROOT/runtime/dla_benchmark/dla_benchmark.hpp
- If you have $COREDLA_WORK defined, run the following command:
sed -i -e "s/DEFINE_uint32/DEFINE_int32/" \ $COREDLA_WORK/runtime/dla_benchmark/dla_benchmark.hpp
Note that Red Hat 8 does not support execution of either PCIe-based design example. This change enables only compilation of the runtime (and only if the necessary BSP is installed).
- Run the following command: