Journal of Chinese Computer Systems.
2025, 46(7):
1652-1658.
In view of the problem that during the traffic sign detection process,driving too fast or external environmental factors such as lighting cause traffic signs to deform and blur,resulting in reduced detection accuracy and speed.The article proposes an improved RT-DETR detection algorithm for traffic signs.First,DCNv2 deformable convolution was introduced into the backbone network of the algorithm and the MCCA attention mechanism was designed,thereby proposing the DCNv2att deformable attention convolution block,which effectively generated the offset and adaptively adjusted the target object according to the shape change.Select the sampling position and capture the detailed information of the object boundary more accurately; then,to address the periodic defects existing in the position encoding method of the original algorithm Transformer encoder,a learnable position encoding is adopted,and the model can autonomously learn and adapt to the input sequence Characteristic position information representation,thereby improving the performance of the model; finally,the original GIoU loss function is replaced by the MPDIoU loss function to directly minimize the distance between the upper left and lower right points between the predicted bounding box and the ground truth bounding box,easing the problem when predicting When the bounding box is completely covered by the ground truth bounding box,the loss function cannot optimize the position and size of the anchor box.Compared with the original RT-DETR algorithm on the TT100K data set,the improved algorithm has a comprehensive average accuracy increased by 3.0 percentage points,the calculation amount has been reduced by 18%,and the FPS has been increased by 12 frames,while being lightweight.The detection accuracy and speed are improved,which is significantly better than similar comparison algorithms,and can be practically used in traffic sign detection scenarios.