|
Joint-Detection-and-Embedding paradigm achieves fast tracking by simultaneously learning detection and Re-ID features. However, it still faces performance degradation in complex scenes and the misalignment between detection and Re-ID features. In this paper, we propose a decoupling module based on channel-wise attention mechanism to obtain task-aligned features served for different demands of detection and Re-ID. To improve the performance of data association, we fuse motion, location, appearance information and perform a two-round matching for high and low confidence detections respectively by the Motion-GIoU matrix and the Embedding-GIoU matrix. Additionally, we apply the camera motion compensation to get a more accurate motion estimation, resulting in a more robust tracking in the scenes of camera motion and low-frame-rate. Extensive experiments show that our proposed method outperforms a wide range of existing methods on the MOTChallenge and HiEvE datasets. |
|
Keywords:Multi-Object Tracking, Joint-Detection-and-Embedding, channel-wise attention, association matrix |
|