{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T03:26:12Z","timestamp":1773890772170,"version":"3.50.1"},"reference-count":64,"publisher":"Association for Computing Machinery (ACM)","issue":"2s","license":[{"start":{"date-parts":[[2021,5,18]],"date-time":"2021-05-18T00:00:00Z","timestamp":1621296000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2021,6,21]]},"abstract":"<jats:p>In this article, we detect and track visual objects by using Siamese network or twin neural network. The Siamese network is constructed to classify moving objects based on the associations of object detection network and object tracking network, which are thought of as the two branches of the twin neural network. The proposed tracking method was designed for single-target tracking, which implements multitarget tracking by using deep neural networks and object detection. The contributions of this article are stated as follows. First, we implement the proposed method for visual object tracking based on multiclass classification using deep neural networks. Then, we attain multitarget tracking by combining the object detection network and the single-target tracking network. Next, we uplift the tracking performance by fusing the outcomes of the object detection network and object tracking network. Finally, we speculate on the object occlusion problem based on IoU and similarity score, which effectively diminish the influence of this issue in multitarget tracking.<\/jats:p>","DOI":"10.1145\/3441656","type":"journal-article","created":{"date-parts":[[2021,5,18]],"date-time":"2021-05-18T14:43:16Z","timestamp":1621348996000},"page":"1-16","update-policy":"https:\/\/linproxy.fan.workers.dev:443\/https\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":40,"title":["Multitarget Tracking Using Siamese Neural Networks"],"prefix":"10.1145","volume":"17","author":[{"given":"Na","family":"An","sequence":"first","affiliation":[{"name":"Auckland University of Technology, CBD Auckland, New Zealand"}]},{"given":"Wei","family":"Qi Yan","sequence":"additional","affiliation":[{"name":"Auckland University of Technology, CBD Auckland, New Zealand"}]}],"member":"320","published-online":{"date-parts":[[2021,5,18]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of IEEE CVPR. 2544\u20132550","author":"Bolme D. S.","unstructured":"D. S. Bolme , J. R. Beveridge , B. A. Draper , and Y. M. Lui . 2010. Visual object tracking using adaptive correlation filters . In Proceedings of IEEE CVPR. 2544\u20132550 . D. S. Bolme, J. R. Beveridge, B. A. Draper, and Y. M. Lui. 2010. Visual object tracking using adaptive correlation filters. In Proceedings of IEEE CVPR. 2544\u20132550."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2345390"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.490"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of ECCV. 21\u201337","author":"Liu W.","unstructured":"W. Liu , D. Anguelov , D. Erhan , C. Szegedy , S. Reed , C. Y. Fu , and A. C. Berg . 2016. SSD: Single shot multibox detector . In Proceedings of ECCV. 21\u201337 . W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg. 2016. SSD: Single shot multibox detector. In Proceedings of ECCV. 21\u201337."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of IEEE CVPR. 8971\u20138980","author":"Li B.","unstructured":"B. Li , J. Yan , W. Wu , Z. Zhu , and X. Hu . 2018. High performance visual tracking with Siamese region proposal network . In Proceedings of IEEE CVPR. 8971\u20138980 . B. Li, J. Yan, W. Wu, Z. Zhu, and X. Hu. 2018. High performance visual tracking with Siamese region proposal network. In Proceedings of IEEE CVPR. 8971\u20138980."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of ECCV. 472\u2013488","author":"Danelljan M.","unstructured":"M. Danelljan , A. Robinson , F. S. Khan , and M. Felsberg . 2016. Beyond correlation filters: Learning continuous convolution operators for visual tracking . In Proceedings of ECCV. 472\u2013488 . M. Danelljan, A. Robinson, F. S. Khan, and M. Felsberg. 2016. Beyond correlation filters: Learning continuous convolution operators for visual tracking. In Proceedings of ECCV. 472\u2013488."},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of IEEE ICIP. 3645\u20133649","author":"Wojke N.","unstructured":"N. Wojke , A. Bewley , and D. Paulus . 2017. Simple online and realtime tracking with a deep association metric . In Proceedings of IEEE ICIP. 3645\u20133649 . N. Wojke, A. Bewley, and D. Paulus. 2017. Simple online and realtime tracking with a deep association metric. In Proceedings of IEEE ICIP. 3645\u20133649."},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of IEEE CVPR. 6638\u20136646","author":"Danelljan M.","unstructured":"M. Danelljan , G. Bhat , F. Shahbaz Khan , and M. Felsberg . 2017. ECO: Efficient convolution operators for tracking . In Proceedings of IEEE CVPR. 6638\u20136646 . M. Danelljan, G. Bhat, F. Shahbaz Khan, and M. Felsberg. 2017. ECO: Efficient convolution operators for tracking. In Proceedings of IEEE CVPR. 6638\u20136646."},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of ECCV. 850\u2013865","author":"Bertinetto L.","unstructured":"L. Bertinetto , J. Valmadre , J. F. Henriques , A. Vedaldi , and P. H. Torr . 2016. Fully-convolutional Siamese networks for object tracking . In Proceedings of ECCV. 850\u2013865 . L. Bertinetto, J. Valmadre, J. F. Henriques, A. Vedaldi, and P. H. Torr. 2016. Fully-convolutional Siamese networks for object tracking. In Proceedings of ECCV. 850\u2013865."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"issue":"5","key":"e_1_2_1_13_1","first-page":"970","article-title":"U.S","author":"Lee M. C.","year":"1999","unstructured":"M. C. Lee and W. G. Chen . 1999 . U.S . Patent No. 5 , 970 ,173. Washington, DC: U.S. Patent and Trademark Office. M. C. Lee and W. G. Chen. 1999. U.S. Patent No. 5,970,173. Washington, DC: U.S. Patent and Trademark Office.","journal-title":"Patent"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of ECCV. 366\u2013382","author":"Zhu J.","unstructured":"J. Zhu , H. Yang , N. Liu , M. Kim , W. Zhang , and M. H. Yang . 2018. Online multi-object tracking with dual matching attention networks . In Proceedings of ECCV. 366\u2013382 . J. Zhu, H. Yang, N. Liu, M. Kim, W. Zhang, and M. H. Yang. 2018. Online multi-object tracking with dual matching attention networks. In Proceedings of ECCV. 366\u2013382."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of IEEE ICCV. 4836\u20134845","author":"Chu Q.","unstructured":"Q. Chu , W. Ouyang , H. Li , X. Wang , B. Liu , and N. Yu . 2017. Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism . In Proceedings of IEEE ICCV. 4836\u20134845 . Q. Chu, W. Ouyang, H. Li, X. Wang, B. Liu, and N. Yu. 2017. Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism. In Proceedings of IEEE ICCV. 4836\u20134845."},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of BICS. 96\u2013105","author":"Huang Z.","unstructured":"Z. Huang , J. Zhan , H. Zhao , K. Lin , P. Zheng , and J. Lv . 2019. Real-time visual tracking base on SiamRPN with generalized intersection over union . In Proceedings of BICS. 96\u2013105 . Z. Huang, J. Zhan, H. Zhao, K. Lin, P. Zheng, and J. Lv. 2019. Real-time visual tracking base on SiamRPN with generalized intersection over union. In Proceedings of BICS. 96\u2013105."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of ICONIP. 128\u2013138","author":"Cui S.","unstructured":"S. Cui , S. Tian , and X. Yin . 2019. Combined correlation filters with Siamese region proposal network for visual tracking . In Proceedings of ICONIP. 128\u2013138 . S. Cui, S. Tian, and X. Yin. 2019. Combined correlation filters with Siamese region proposal network for visual tracking. In Proceedings of ICONIP. 128\u2013138."},{"key":"e_1_2_1_18_1","unstructured":"W. Feng Z. Hu W. Wu J. Yan and W. Ouyang. 2019. Multi-object tracking with multiple cues and switcher-aware classification. arXiv:1901.06129  W. Feng Z. Hu W. Wu J. Yan and W. Ouyang. 2019. Multi-object tracking with multiple cues and switcher-aware classification. arXiv:1901.06129"},{"key":"e_1_2_1_19_1","unstructured":"A. Milan L. Leal-Taix\u00e9 I. Reid S. Roth and K. Schindler. 2016. MOT16: A benchmark for multi object tracking. arXiv:1603.00831  A. Milan L. Leal-Taix\u00e9 I. Reid S. Roth and K. Schindler. 2016. MOT16: A benchmark for multi object tracking. arXiv:1603.00831"},{"key":"e_1_2_1_20_1","unstructured":"L. Wen D. Du Z. Cai Z. Lei M. C. Chang H. Qi and S. Lyu. 2015. UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking. arXiv:1511.04136  L. Wen D. Du Z. Cai Z. Lei M. C. Chang H. Qi and S. Lyu. 2015. UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking. arXiv:1511.04136"},{"key":"e_1_2_1_22_1","unstructured":"M. Z. Alom T. M. Taha C. Yakopcic S. Westberg P. Sidike M. S. Nasrin and V. K. Asari. 2018. The history began from AlexNet: A comprehensive survey on deep learning approaches. arXiv:1803.01164  M. Z. Alom T. M. Taha C. Yakopcic S. Westberg P. Sidike M. S. Nasrin and V. K. Asari. 2018. The history began from AlexNet: A comprehensive survey on deep learning approaches. arXiv:1803.01164"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of BICS. 96\u2013105","author":"Huang Z.","unstructured":"Z. Huang , J. Zhan , H. Zhao , K. Lin , P. Zheng , and J. Lv . 2019. Real-time visual tracking base on SiamRPN with generalized intersection over union . In Proceedings of BICS. 96\u2013105 . Z. Huang, J. Zhan, H. Zhao, K. Lin, P. Zheng, and J. Lv. 2019. Real-time visual tracking base on SiamRPN with generalized intersection over union. In Proceedings of BICS. 96\u2013105."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of IEEE CVPR. 4591\u20134600","author":"Zhang Z.","unstructured":"Z. Zhang and H. Peng . 2019. Deeper and wider Siamese networks for real-time visual tracking . In Proceedings of IEEE CVPR. 4591\u20134600 . Z. Zhang and H. Peng. 2019. Deeper and wider Siamese networks for real-time visual tracking. In Proceedings of IEEE CVPR. 4591\u20134600."},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of IEEE CVPR. 4282\u20134291","author":"Li B.","unstructured":"B. Li , W. Wu , Q. Wang , F. Zhang , J. Xing , and J. Yan . 2019. SiamRPN++: Evolution of Siamese visual tracking with very deep networks . In Proceedings of IEEE CVPR. 4282\u20134291 . B. Li, W. Wu, Q. Wang, F. Zhang, J. Xing, and J. Yan. 2019. SiamRPN++: Evolution of Siamese visual tracking with very deep networks. In Proceedings of IEEE CVPR. 4282\u20134291."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of IEEE CVPR. 8971\u20138980","author":"Li B.","unstructured":"B. Li , J. Yan , W. Wu , Z. Zhu , and X. Hu . 2018. High performance visual tracking with Siamese region proposal network . In Proceedings of IEEE CVPR. 8971\u20138980 . B. Li, J. Yan, W. Wu, Z. Zhu, and X. Hu. 2018. High performance visual tracking with Siamese region proposal network. In Proceedings of IEEE CVPR. 8971\u20138980."},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of IEEE ICCE-Asia. 16\u201334","author":"Li D.","unstructured":"D. Li , X. Wang , and Y. Yu . 2019. Siamese visual tracking with deep features and robust feature fusion . In Proceedings of IEEE ICCE-Asia. 16\u201334 . D. Li, X. Wang, and Y. Yu. 2019. Siamese visual tracking with deep features and robust feature fusion. In Proceedings of IEEE ICCE-Asia. 16\u201334."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.02.080"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00384623"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of Odyssey. 15","author":"Dehak N.","unstructured":"N. Dehak , R. Dehak , J. R. Glass , D. A. Reynolds , and P. Kenny . 2010. Cosine similarity scoring without score normalization techniques . In Proceedings of Odyssey. 15 . N. Dehak, R. Dehak, J. R. Glass, D. A. Reynolds, and P. Kenny. 2010. Cosine similarity scoring without score normalization techniques. In Proceedings of Odyssey. 15."},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of IEEE CVPR. 4282\u20134291","author":"Li B.","unstructured":"B. Li , W. Wu , Q. Wang , F. Zhang , J. Xing , and J. Yan . 2019. SiamRPN++: Evolution of Siamese visual tracking with very deep networks . In Proceedings of IEEE CVPR. 4282\u20134291 . B. Li, W. Wu, Q. Wang, F. Zhang, J. Xing, and J. Yan. 2019. SiamRPN++: Evolution of Siamese visual tracking with very deep networks. In Proceedings of IEEE CVPR. 4282\u20134291."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13042-010-0002-z"},{"key":"e_1_2_1_33_1","unstructured":"R. T. Collins A. J. Lipton T. Kanade H. Fujiyoshi D. Duggins Y. Tsin and L. Wixson. 2000. A System for Video Surveillance and Monitoring. Final Report. VSAM.  R. T. Collins A. J. Lipton T. Kanade H. Fujiyoshi D. Duggins Y. Tsin and L. Wixson. 2000. A System for Video Surveillance and Monitoring. Final Report. VSAM."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of IEEE PETS. 7\u201314","author":"Bashir F.","unstructured":"F. Bashir and F. Porikli . 2006. Performance evaluation of object detection and tracking systems . In Proceedings of IEEE PETS. 7\u201314 . F. Bashir and F. Porikli. 2006. Performance evaluation of object detection and tracking systems. In Proceedings of IEEE PETS. 7\u201314."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40597-6_19"},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of IEEE AVSS. 1\u20136.","author":"Bochinski E.","unstructured":"E. Bochinski , T. Senst , and T. Sikora . 2018. Extending IoU based multi-object tracking by visual information . In Proceedings of IEEE AVSS. 1\u20136. E. Bochinski, T. Senst, and T. Sikora. 2018. Extending IoU based multi-object tracking by visual information. In Proceedings of IEEE AVSS. 1\u20136."},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of ICIRCA. 1305\u20131308","author":"Chandan G.","unstructured":"G. Chandan , A. Jain , and H. Jain . 2018. Real time object detection and tracking using deep learning and OpenCV . In Proceedings of ICIRCA. 1305\u20131308 . G. Chandan, A. Jain, and H. Jain. 2018. Real time object detection and tracking using deep learning and OpenCV. In Proceedings of ICIRCA. 1305\u20131308."},{"key":"e_1_2_1_38_1","unstructured":"W. Lotter G. Kreiman and D. Cox. 2015. Unsupervised learning of visual structure using predictive generative networks. arXiv:1511.06380  W. Lotter G. Kreiman and D. Cox. 2015. Unsupervised learning of visual structure using predictive generative networks. arXiv:1511.06380"},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"M. J. Shafiee B. Chywl F. Li and A. Wong. 2017. Fast YOLO: A fast you only look once system for real-time embedded object detection in video. arXiv:1709.05943  M. J. Shafiee B. Chywl F. Li and A. Wong. 2017. Fast YOLO: A fast you only look once system for real-time embedded object detection in video. arXiv:1709.05943","DOI":"10.15353\/vsnl.v3i1.171"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of ECCV. 135\u2013153","author":"Varior R. R.","unstructured":"R. R. Varior , B. Shuai , J. Lu , D. Xu , and G. Wang . 2016. A Siamese long short-term memory architecture for human re-identification . In Proceedings of ECCV. 135\u2013153 . R. R. Varior, B. Shuai, J. Lu, D. Xu, and G. Wang. 2016. A Siamese long short-term memory architecture for human re-identification. In Proceedings of ECCV. 135\u2013153."},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of IEEE ICCV. 618\u2013626","author":"Selvaraju R. R.","unstructured":"R. R. Selvaraju , M. Cogswell , A. Das , R. Vedantam , D. Parikh , and D. Batra . 2017. Grad-CAM: Visual explanations from deep networks via gradient-based localization . In Proceedings of IEEE ICCV. 618\u2013626 . R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra. 2017. Grad-CAM: Visual explanations from deep networks via gradient-based localization. In Proceedings of IEEE ICCV. 618\u2013626."},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of ECCV. 818\u2013833","author":"Zeiler M. D.","unstructured":"M. D. Zeiler and R. Fergus . 2014. Visualizing and understanding convolutional networks . In Proceedings of ECCV. 818\u2013833 . M. D. Zeiler and R. Fergus. 2014. Visualizing and understanding convolutional networks. In Proceedings of ECCV. 818\u2013833."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2567386"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-6377(86)90073-8"},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of DICTA. 1\u20136.","author":"Wong S. C.","unstructured":"S. C. Wong , A. Gatt , V. Stamatescu , and M. D. McDonnell . 2016. Understanding data augmentation for classification: When to warp? In Proceedings of DICTA. 1\u20136. S. C. Wong, A. Gatt, V. Stamatescu, and M. D. McDonnell. 2016. Understanding data augmentation for classification: When to warp? In Proceedings of DICTA. 1\u20136."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2388226"},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of IEEE CVPR. 5813\u20135821","author":"Zhang Z.","unstructured":"Z. Zhang , S. Qiao , C. Xie , W. Shen , B. Wang , and A. L. Yuille . 2018. Single-shot object detection with enriched semantics . In Proceedings of IEEE CVPR. 5813\u20135821 . Z. Zhang, S. Qiao, C. Xie, W. Shen, B. Wang, and A. L. Yuille. 2018. Single-shot object detection with enriched semantics. In Proceedings of IEEE CVPR. 5813\u20135821."},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of ECCV. 366\u2013382","author":"Zhu J.","unstructured":"J. Zhu , H. Yang , N. Liu , M. Kim , W. Zhang , and M. H. Yang . 2018. Online multi-object tracking with dual matching attention networks . In Proceedings of ECCV. 366\u2013382 . J. Zhu, H. Yang, N. Liu, M. Kim, W. Zhang, and M. H. Yang. 2018. Online multi-object tracking with dual matching attention networks. In Proceedings of ECCV. 366\u2013382."},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of IEEE CVPR. 3539\u20133548","author":"Tang S.","unstructured":"S. Tang , M. Andriluka , B. Andres , and B. Schiele . 2017. Multiple people tracking by lifted multicut and person re-identification . In Proceedings of IEEE CVPR. 3539\u20133548 . S. Tang, M. Andriluka, B. Andres, and B. Schiele. 2017. Multiple people tracking by lifted multicut and person re-identification. In Proceedings of IEEE CVPR. 3539\u20133548."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123452"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.5555\/3298023.3298181"},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of IEEE CVPR. 1318\u20131327","author":"He Z.","unstructured":"Z. He , J. Li , D. Liu , H. He , and D. Barber . 2019. Tracking by animation: Unsupervised learning of multi-object attentive trackers . In Proceedings of IEEE CVPR. 1318\u20131327 . Z. He, J. Li, D. Liu, H. He, and D. Barber. 2019. Tracking by animation: Unsupervised learning of multi-object attentive trackers. In Proceedings of IEEE CVPR. 1318\u20131327."},{"key":"e_1_2_1_53_1","doi-asserted-by":"crossref","unstructured":"Y. C. Yoon D. Y. Kim K. Yoon Y. M. Song and M. Jeon. 2019. Online multiple pedestrian tracking using deep temporal appearance matching association. arXiv:1907.00831  Y. C. Yoon D. Y. Kim K. Yoon Y. M. Song and M. Jeon. 2019. Online multiple pedestrian tracking using deep temporal appearance matching association. arXiv:1907.00831","DOI":"10.1109\/ICCE-ASIA.2018.8552105"},{"key":"e_1_2_1_54_1","unstructured":"W. Feng Z. Hu W. Wu J. Yan and W. Ouyang. 2019. Multi-object tracking with multiple cues and switcher-aware classification. arXiv:1901.06129  W. Feng Z. Hu W. Wu J. Yan and W. Ouyang. 2019. Multi-object tracking with multiple cues and switcher-aware classification. arXiv:1901.06129"},{"key":"e_1_2_1_55_1","unstructured":"C. Yan B. Gong Y. Wei and Y. Gao. 2020. Deep multi-view enhancement hashing for image retrieval. arXiv:2002.00169  C. Yan B. Gong Y. Wei and Y. Gao. 2020. Deep multi-view enhancement hashing for image retrieval. arXiv:2002.00169"},{"key":"e_1_2_1_56_1","unstructured":"A. Milan L. Leal-Taix\u00e9 I. Reid S. Roth and K. Schindler. 2016. MOT16: A benchmark for multi-object tracking. arXiv:1603.00831  A. Milan L. Leal-Taix\u00e9 I. Reid S. Roth and K. Schindler. 2016. MOT16: A benchmark for multi-object tracking. arXiv:1603.00831"},{"key":"e_1_2_1_57_1","unstructured":"W. Luo J. Xing A. Milan X. Zhang W. Liu X. Zhao and T. K. Kim. 2014. Multiple object tracking: A literature review. arXiv:1409.7618  W. Luo J. Xing A. Milan X. Zhang W. Liu X. Zhao and T. K. Kim. 2014. Multiple object tracking: A literature review. arXiv:1409.7618"},{"key":"e_1_2_1_58_1","unstructured":"Y. Zhang D. Wang L. Wang J. Qi and H. Lu. 2018. Learning regression and verification networks for long-term visual tracking. arXiv:1809.04320  Y. Zhang D. Wang L. Wang J. Qi and H. Lu. 2018. Learning regression and verification networks for long-term visual tracking. arXiv:1809.04320"},{"key":"e_1_2_1_59_1","volume-title":"Proceedings of ICCV. 300\u2013311","author":"Sadeghian A.","unstructured":"A. Sadeghian , A. Alahi , and S. Savarese . 2017. Tracking the untrackable: Learning to track multiple cues with long-term dependencies . In Proceedings of ICCV. 300\u2013311 . A. Sadeghian, A. Alahi, and S. Savarese. 2017. Tracking the untrackable: Learning to track multiple cues with long-term dependencies. In Proceedings of ICCV. 300\u2013311."},{"key":"e_1_2_1_60_1","volume-title":"Proceedings of CVPR. 6768\u20136777","author":"Yin J.","unstructured":"J. Yin , W. Wang , Q. Meng , R. Yang , and J. Shen . 2020. A unified object motion and affinity model for online multi-object tracking . In Proceedings of CVPR. 6768\u20136777 . J. Yin, W. Wang, Q. Meng, R. Yang, and J. Shen. 2020. A unified object motion and affinity model for online multi-object tracking. In Proceedings of CVPR. 6768\u20136777."},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of IEEE WACV. 161\u2013170","author":"Chu P.","unstructured":"P. Chu , H. Fan , C. C. Tan , and H. Ling . 2019. Online multi-object tracking with instance-aware tracker and dynamic model refreshment . In Proceedings of IEEE WACV. 161\u2013170 . P. Chu, H. Fan, C. C. Tan, and H. Ling. 2019. Online multi-object tracking with instance-aware tracker and dynamic model refreshment. In Proceedings of IEEE WACV. 161\u2013170."},{"key":"e_1_2_1_62_1","volume-title":"Proceedings of ICCV. 6172\u20136181","author":"Chu P.","unstructured":"P. Chu and H. Ling . 2019. FAMNet: Joint learning of feature, affinity and multi-dimensional assignment for online multiple object tracking . In Proceedings of ICCV. 6172\u20136181 . P. Chu and H. Ling. 2019. FAMNet: Joint learning of feature, affinity and multi-dimensional assignment for online multiple object tracking. In Proceedings of ICCV. 6172\u20136181."},{"key":"e_1_2_1_63_1","volume-title":"Anomalies Detection and Tracking Using Siamese Neural Networks. Master's Thesis","author":"An N.","unstructured":"N. An . 2020. Anomalies Detection and Tracking Using Siamese Neural Networks. Master's Thesis . Auckland University of Technology , New Zealand . N. An. 2020. Anomalies Detection and Tracking Using Siamese Neural Networks. Master's Thesis. Auckland University of Technology, New Zealand."},{"key":"e_1_2_1_64_1","volume-title":"Computational Methods for Deep Learning","author":"Yan W.","unstructured":"W. Yan . 2020. Computational Methods for Deep Learning . Springer . W. Yan. 2020. Computational Methods for Deep Learning. Springer."},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.5555\/3350817"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/dl.acm.org\/doi\/10.1145\/3441656","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/dl.acm.org\/doi\/pdf\/10.1145\/3441656","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:30Z","timestamp":1750195470000},"score":1,"resource":{"primary":{"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/dl.acm.org\/doi\/10.1145\/3441656"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,18]]},"references-count":64,"journal-issue":{"issue":"2s","published-print":{"date-parts":[[2021,6,21]]}},"alternative-id":["10.1145\/3441656"],"URL":"https:\/\/linproxy.fan.workers.dev:443\/https\/doi.org\/10.1145\/3441656","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,18]]},"assertion":[{"value":"2020-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-05-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}