CE loss in fine-tuning does not work for binary classification

stwerner · March 31, 2025, 4:00pm

Hi, I am trying to fine-tune a Yolox with a single class and want to use a (binary-)cross-entropy loss for the objectness score and classification logit outputs. I’ve tried setting the ce loss type in the post_quantization_optimization(finetune, ...) instruction, but when I do, I get a warning that I should use a binary_cross_entropy loss instead for outputs of shapes such as 1x80x80x1. However, there is no such option available ( I am using DFC v 3.28.0) and when using ce the distillation loss of these layers is just 0. Am I doing something wrong?

omria · April 1, 2025, 12:21pm

Hey @stwerner ,

The Hailo DFC currently doesn’t support binary_cross_entropy as a loss function option in post_quantization_optimization(finetune, ...). The only supported loss types are:

{ce, l2, l2rel, cosine}

Where ce means categorical cross-entropy, which works for multi-class classification but not for binary classification with logits per anchor (as in YOLOX).

If you use loss_types=['ce'] for binary outputs, you may get a zero distillation loss due to:

Incompatibility between softmax vs. sigmoid interpretation
Mismatched label formatting
Missing activation before logits

Recommended Solutions

Since binary_cross_entropy isn’t supported:

Restructure your output to work with ce loss (convert to multi-class with 2 classes)
Use l2 or l2rel loss as an alternative:

post_quantization_optimization(
    finetune,
    loss_types=['l2', 'l2', 'l2'],
    loss_layer_names=['obj_head', 'cls_head', 'bbox_head'],
    policy='enabled',
    dataset_size=2048,
    learning_rate=0.0002,
    epochs=8
)

This approach is valid when classification loss doesn’t work due to architecture-specific shapes.

stwerner · April 2, 2025, 6:58am

Hi @omria , thanks for the response. Would be great to also have an option for binary_cross_entropy !

omria · April 6, 2025, 10:05am

Thanks for your suggestion. I’ll share this with our R&D team so they can determine the priority level.
Appreciate your input!

Topic		Replies	Views
Adaround loss type General	6	56	March 18, 2025
Yolov8s Total Distill Loss is not decreasing when converting to hef General dfc , hailo8	7	200	January 7, 2025
catastropic degradation after optimizing Yolov5 with VisDrone data General	2	61	April 24, 2025
YoloV11 On Raspberry Pi General raspberry-pi , hailo8	3	518	March 7, 2025
How to interpret the YOLO outputs? General	5	312	April 16, 2025

CE loss in fine-tuning does not work for binary classification

Recommended Solutions

Related topics