Hi, yes - you can and should include the NMS post-processing in your HEF. It’s the recommended approach. When you parse your ONNX model, you cut the graph before the NMS/decode layers using end_node_names. Then you tell the DFC to re-attach an optimized NMS implementation via the nms_postprocess command in your .alls model script. This gets baked into the HEF so HailoRT handles it automatically at runtime.
For example, see the official YOLOv8s .alls script in the Hailo Model Zoo. Also see this relevant community thread: Not Sure If I Properly Converted Yolo 8 Model
Thanks,