2D and 3D object detection algorithms from images: A Survey

Object detection is a crucial branch of computer vision that aims to locate and classify objects in images. Using deep convolutional neural networks (CNNs) as the primary framework for object detection can efficiently extract features, which is closer to real-time performance than the traditional mo...

Full description

Saved in:
Bibliographic Details
Published in:Array (New York) Vol. 19; p. 100305
Main Authors: Chen, Wei, Li, Yan, Tian, Zijian, Zhang, Fan
Format: Journal Article
Language:English
Published: Elsevier Inc 01.09.2023
Elsevier
Subjects:
ISSN:2590-0056, 2590-0056
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Object detection is a crucial branch of computer vision that aims to locate and classify objects in images. Using deep convolutional neural networks (CNNs) as the primary framework for object detection can efficiently extract features, which is closer to real-time performance than the traditional model that extracts features manually. In recent years, the rise of Transformer with powerful self-attention mechanisms has further enhanced performance to a new level. However, when it comes to specific vision tasks in the real world, it is necessary to obtain 3D information about the spatial coordinates, orientation, and velocity of objects, which makes research on object detection in 3D scenes more active. Although LiDAR-based 3D object detection algorithms have excellent performance, they are difficult to popularize in practical applications due to their high price. Hence, we summarize the development process, different frameworks, contributions, advantages, disadvantages, and development trends of image-based 2D and 3D object detection algorithms in recent years to help more researchers better understand this field. Besides, representative datasets,evaluation metrics,related techniques and applications are introduced, and some valuable research directions are discussed.
AbstractList Object detection is a crucial branch of computer vision that aims to locate and classify objects in images. Using deep convolutional neural networks (CNNs) as the primary framework for object detection can efficiently extract features, which is closer to real-time performance than the traditional model that extracts features manually. In recent years, the rise of Transformer with powerful self-attention mechanisms has further enhanced performance to a new level. However, when it comes to specific vision tasks in the real world, it is necessary to obtain 3D information about the spatial coordinates, orientation, and velocity of objects, which makes research on object detection in 3D scenes more active. Although LiDAR-based 3D object detection algorithms have excellent performance, they are difficult to popularize in practical applications due to their high price. Hence, we summarize the development process, different frameworks, contributions, advantages, disadvantages, and development trends of image-based 2D and 3D object detection algorithms in recent years to help more researchers better understand this field. Besides, representative datasets,evaluation metrics,related techniques and applications are introduced, and some valuable research directions are discussed.
ArticleNumber 100305
Author Chen, Wei
Tian, Zijian
Zhang, Fan
Li, Yan
Author_xml – sequence: 1
  givenname: Wei
  surname: Chen
  fullname: Chen, Wei
  email: chenwdavior@163.com
  organization: School of Mechanical, Electrical & Information Engineering, China University of Mining and Technology (Beijing), Beijing, 100083, China
– sequence: 2
  givenname: Yan
  surname: Li
  fullname: Li, Yan
  email: 18600873522@163.com
  organization: School of Mechanical, Electrical & Information Engineering, China University of Mining and Technology (Beijing), Beijing, 100083, China
– sequence: 3
  givenname: Zijian
  surname: Tian
  fullname: Tian, Zijian
  email: Tianzj0726@126.com
  organization: School of Mechanical, Electrical & Information Engineering, China University of Mining and Technology (Beijing), Beijing, 100083, China
– sequence: 4
  givenname: Fan
  surname: Zhang
  fullname: Zhang, Fan
  email: zf@cumtb.edu.cn
  organization: School of Mechanical, Electrical & Information Engineering, China University of Mining and Technology (Beijing), Beijing, 100083, China
BookMark eNqFkMtKLDEQhoMoeH0CN3mBGSvXnj4HF-IdBBfqOlQnlTHNTEfSrTBvb48jIi509RcF30_Vt8-2u9wRY8cCpgKEPWmnWAquphKkGjegwGyxPWlqmAAYu_1t3mVHfd8CgDRCCDPbY__lBccucHXBc9OSH3igYYyUO46LeS5peF72PJa85GmJc-r_8TP-8FreaHXIdiIuejr6zAP2dHX5eH4zubu_vj0_u5t4LfQwwVmMdVMFHbWNFmoljbRaaxssNrGqIZIKGEVTaZrVVaUbMIGs9lA31IBWB-x20xsytu6ljHeUlcuY3Mcil7nDMiS_IGdkpWYoNSkibUEgWJTKkK8CoGzqsUttunzJfV8ofvUJcGudrnUfOt1ap9voHKn6B-XTgGtJQ8G0-IM93bA0KnpLVFzvE3WeQiqj6PGH9Cv_DrJSkXE
CitedBy_id crossref_primary_10_1109_LSP_2024_3402338
crossref_primary_10_1016_j_array_2025_100469
crossref_primary_10_3390_s24144526
crossref_primary_10_1007_s13349_025_00921_1
crossref_primary_10_3390_s25185884
crossref_primary_10_1016_j_engappai_2025_111113
crossref_primary_10_1016_j_sna_2024_116082
crossref_primary_10_3390_s24217007
crossref_primary_10_1109_ACCESS_2025_3599358
crossref_primary_10_1007_s00170_024_13874_4
crossref_primary_10_1016_j_marpetgeo_2024_106965
crossref_primary_10_1016_j_procs_2024_10_282
crossref_primary_10_3390_jmse11091658
crossref_primary_10_32604_cmc_2024_046501
crossref_primary_10_1080_17483107_2025_2530674
crossref_primary_10_1109_ACCESS_2024_3386826
crossref_primary_10_1016_j_eswa_2025_129652
crossref_primary_10_1016_j_patcog_2025_112347
crossref_primary_10_1109_ACCESS_2024_3514673
crossref_primary_10_3390_agriengineering6020065
crossref_primary_10_3390_app14010249
crossref_primary_10_1109_OJVT_2025_3542213
crossref_primary_10_1057_s41599_025_04503_w
crossref_primary_10_1109_JSEN_2024_3392918
crossref_primary_10_1109_ACCESS_2024_3484933
crossref_primary_10_1016_j_nexres_2025_100424
crossref_primary_10_1016_j_eswa_2023_122212
crossref_primary_10_1371_journal_pone_0315384
crossref_primary_10_3390_s25175264
crossref_primary_10_1109_ACCESS_2024_3431244
Cites_doi 10.1007/s11263-013-0620-5
10.1016/j.neunet.2021.12.003
10.1109/TIP.2020.3002345
10.1109/TPAMI.2015.2389824
10.1023/B:VISI.0000029664.99615.94
10.1007/s11263-009-0275-4
10.1109/TPAMI.2017.2706685
10.1109/CVPR46437.2021.00219
10.1109/ICRA40945.2020.9196660
10.1016/j.eswa.2022.116793
10.1109/CVPR.2019.00507
10.1109/ICSP56322.2022.9965335
10.1109/CVPR.2019.00720
10.1109/TPAMI.2016.2577031
10.1007/978-3-319-46448-0_2
10.1109/CVPR.2005.177
ContentType Journal Article
Copyright 2023
Copyright_xml – notice: 2023
DBID 6I.
AAFTH
AAYXX
CITATION
DOA
DOI 10.1016/j.array.2023.100305
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
Directory of Open Access Journals (DOAJ)
DatabaseTitle CrossRef
DatabaseTitleList

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 2590-0056
ExternalDocumentID oai_doaj_org_article_52738a24e3ee4601a06a235ec7d0a2b9
10_1016_j_array_2023_100305
S2590005623000309
GroupedDBID 0SF
6I.
AAEDW
AAFTH
AALRI
AAXUO
AEXQZ
AITUG
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
EBS
EJD
FDB
GROUPED_DOAJ
M41
M~E
NCXOZ
OK1
ROL
0R~
AAYWO
AAYXX
ACVFH
ADCNI
ADVLN
AEUPX
AFJKZ
AFPUW
AIGII
AKBMS
AKYEP
APXCP
CITATION
ID FETCH-LOGICAL-c414t-a8ff9b7d4f46f609325264446d6abf790fe3daf1b74e89774b05de64c09beb043
IEDL.DBID DOA
ISICitedReferencesCount 35
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001043834800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2590-0056
IngestDate Fri Oct 03 12:44:09 EDT 2025
Tue Nov 18 22:34:37 EST 2025
Thu Nov 20 00:39:06 EST 2025
Fri Aug 04 01:17:46 EDT 2023
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords 3D
Transformer
Image
Object detection
CNNs
Language English
License This is an open access article under the CC BY license.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c414t-a8ff9b7d4f46f609325264446d6abf790fe3daf1b74e89774b05de64c09beb043
OpenAccessLink https://doaj.org/article/52738a24e3ee4601a06a235ec7d0a2b9
ParticipantIDs doaj_primary_oai_doaj_org_article_52738a24e3ee4601a06a235ec7d0a2b9
crossref_primary_10_1016_j_array_2023_100305
crossref_citationtrail_10_1016_j_array_2023_100305
elsevier_sciencedirect_doi_10_1016_j_array_2023_100305
PublicationCentury 2000
PublicationDate September 2023
2023-09-00
2023-09-01
PublicationDateYYYYMMDD 2023-09-01
PublicationDate_xml – month: 09
  year: 2023
  text: September 2023
PublicationDecade 2020
PublicationTitle Array (New York)
PublicationYear 2023
Publisher Elsevier Inc
Elsevier
Publisher_xml – name: Elsevier Inc
– name: Elsevier
References Qian, Garg, Wang, You, Belongie, Hariharan (bib102) 2020
Liu, Anguelov, Erhan, Szegedy, Reed, Fu (bib13) 2016
Philion, Kar, Fidler (bib174) 2020
Sun, Chen, Xie, Zhang, Jiang, Zhou (bib85) 2020
He, Zhang, Ren, Sun (bib5) 2016
Li, Liang, Wei, Xu, Feng, Yan (bib134) 2017
Shi, Rajkumar (bib129) 2020
Wang, Wang, Dang, Liu, Hu, Yu (bib54) 2022
Liu, Wang, Liu (bib92) 2021
Inoue, Furuta, Yamasaki, Aizawa (bib127) 2018
Liu, Huang, Wang (bib176) 2017
Hasan, Liao, Li, Akram, Shao (bib157) 2022
Yu, Chang, Lv, Xu, Cui, Ji (bib117) 2021
He, Gkioxari, Dollár, Girshick, Mask (bib24) 2017
Chen, Chen, Liu, Wang, Jia (bib122) 2021
Lin, Dollár, Girshick, He, Hariharan, Belongie (bib8) 2017
Yang, Li, Jiang, Gong, Yuan, Zhao (bib121) 2022
Qin, Wang, Lu (bib71) 2019; 33
Chen, Yang, Zhang, Meng, Pan, Sun (bib113) 2019
He, Zhang, Ren, Sun (bib21) 2015; 37
Zhang, Ye, Zhang, Liu, Zhang, Tian (bib161) 2022
Hwang, Benz, Kim (bib159) 2022
Peng, Zhu, Wang, Ma (bib87) 2022
Wang, Zhang, Yang, Sun (bib62) 2022; 36
Liu, Jiang, Zhu, xu (bib162) 2023
Jeong, Park, Kwak (bib33) 2017
Dooley, Wei, Goldstein, Dickerson (bib148) 2022
Simonelli, Bulò, Porzi, Lopez-Antequera, Kontschieder (bib70) 2019
Wang, Min, Ge, Li, Li, Yang (bib108) 2022
Xu, Sun, Yang, Miao, Yang, H2Fa (bib128) 2022
Brazil, Liu (bib74) 2019
Krizhevsky, Sutskever, Hinton (bib4) 2012; 25
Chang, Wang, Yang, Yu, Yu, Xia (bib179) 2022
Redmon, Divvala, Girshick, Farhadi (bib12) 2016
Li, Bao, Ge, Yang, Sun, Li (bib107) 2022
Dai, Jiang, Wu, Bao, Wang, Liu (bib123) 2021
Gilroy, Mullins, Jones, Parsi, Glavin (bib156) 2022
Li, Mao, Girshick, He (bib66) 2022
Shamsolmoali, Zareapoor, Granger, Chanussot, Yang (bib164) 2022
Zhu, Chen, Shen, Savvides (bib49) 2020
Ren, He, Girshick, Sun, Faster (bib23) 2017; 39
Lowe (bib3) 2004; 60
Cai, Vasconcelos (bib25) 2018
Pon, Ku, Li, Waslander (bib99) 2020
Mao, Yang, Dally (bib173) 2019
Roh, Shin, Shin, Kim (bib59) 2021
Chen, Li, Sakaridis, Dai, Gool (bib124) 2018
Cen, Yun, Cai, Wang, Liu (bib182) 2021
Xu, Zhang, Ye, Tan, Yang, Wen (bib101) 2020; 34
Chopra, Khurana (bib20) 2023
Wang, Bashir, Khan, Ullah, Wang, Song (bib163) 2022; 197
Zhou, Zhuo, Krähenbühl (bib45) 2019
Yang, Chen, Tian, Tao, Zhu, Zhang (bib110) 2022
Luo, Dai, Shao, Ding (bib76) 2021
Long, Deng, Wang, Zhang, Dang, Gao (bib30) 2020
Wang, Xie, Li, Fan, Song, Liang (bib64) 2022; vol. 8
Yan, Nie, Cai, Han, Xu, Yang (bib80) 2022
Park, Xu, Yang, Keutzer, Kitani, Tomizuka (bib111) 2022
Deng, Dong, Socher, Li, Kai, Li (bib137) 2009
Najibi, Samangouei, Chellappa, Davis (bib177) 2017
Tian, Shen, Chen, He (bib46) 2019
Chen, Huang, Liu, Yu, Jia (bib90) 2023; 45
Garg, Wang, Hariharan, Weinberger, Chao (bib100) 2020
Zheng, Fu, Zhao (bib34) 2018
Huang, Huang, Zheng, Du (bib105) 2021
Wang, Shrivastava, Gupta (bib133) 2017
Chandio, Gui, Kumar, Ullah, Ranjbarzadeh, Roy (bib37) 2022
Deng, Qi, Najibi, Funkhouser, Zhou, Anguelov (bib175) 2021
Girshick, Fast (bib22) 2015
Chang, Chen (bib88) 2018
Shen, Liao, Nie, Zheng, Zhao (bib81) 2022
Uijlings, van de Sande, Gevers, Smeulders (bib19) 2013; 104
Dosovitskiy, Beyer, Kolesnikov, Weissenborn, Zhai, Unterthiner (bib17) 2020
Fu, Liu, Ranga, Tyagi, Berg (bib32) 2017
You, Wang, Chao, Garg, Pleiss, Hariharan (bib97) 2019
Bochkovskiy, Wang, Liao (bib9) 2020
Law, Teng, Russakovsky, Deng (bib42) 2019
Carion, Massa, Synnaeve, Usunier, Kirillov, Zagoruyko (bib11) 2020; 2020
Muhammad, Romaissa, Oussalah (bib152) 2023
Wang, Yin, Kong, Jiang, Li, Shen (bib95) 2020; 34
Huang, Wang, Lv, Bai, Long, Deng (bib31) 2021
Dosovitskiy, Ros, Codevilla, López, Koltun (bib144) 2017
Dalal, Triggs (bib2) 2005; 1
Girshick, Donahue, Darrell, Malik (bib18) 2014
Wang, Chao, Garg, Hariharan, Weinberger (bib93) 2018
Wang, Yuan, Zhang, Feng (bib119) 2019
Liu, Wu, Tóth (bib77) 2020
Li, Liu, Bai, Lin, Ling (bib169) 2022; 60
Duan, Bai, Xie, Qi, Huang, Tian (bib44) 2019
Man, Weng, Sivakumar, O'Toole, Kitani (bib131) 2021
Wang, Bochkovskiy, Liao (bib28) 2022
Manhardt, Kehl, Gaidon (bib69) 2019
Wang, Li, Wu, Xu, Shen, Yang (bib149) 2023
Li, Ge, Yu, Yang, Wang, Shi (bib106) 2022
Gao, Zheng, Wang, Dai, Li (bib56) 2021
Liang, Wang, Tang, Hu, Ling (bib114) 2021
Xu, Wang, Lv, Chang, Cui, Deng (bib15) 2022
Wang, Yang, Hu, Liang, Urtasun (bib103) 2021
Lu, Ma, Yang, Zhang, Liu, Chu (bib73) 2021
Fang, Huber, Damer (bib154) 2023
Kumar, Brazil, Liu (bib75) 2021
Li, Du, Zhang, Wen, Luo, Wu (bib126) 2020
Everingham, Van Gool, Williams, Winn, Zisserman (bib136) 2010
Bai, Zhang, Ding, Ghanem (bib135) 2018
Lin, Pei, Chen, Zhang, Lu (bib158) 2022
Kong, Li, Wang (bib155) 2023
Guo, Shi, Wang, Li (bib91) 2021
Sun, Cao, Yang, Kitani (bib60) 2021
Zhang, Li, Wang, Lu (bib170) 2020
Liu, Chen, Wang (bib167) 2022
Lin, Maire, Belongie, Hays, Perona, Ramanan (bib138) 2014
Ge, Liu, Wang, Li, Sun (bib51) 2021
Zheng, Gao, Wang, Li, Dong (bib55) 2020
Li, Ouyang, Sheng, Zeng, Wang (bib68) 2019
Dwivedi, Kumar, Chopra, Kothari, Singh (bib153) 2023
Li, Ku, Waslander (bib98) 2020
Zou, Wu, Zhou, Huang (bib52) 2022
Dai, Chang, Savva, Halber, Funkhouser, Nießner (bib145) 2017
Chen, Wang, Yang, Zhang, Cheng, Sun (bib10) 2021
Zhang, Lu, Zhou (bib78) 2021
Ye, Wang, Zhou, Lei, Fan, Qin (bib166) 2023
Philion, Fidler (bib104) 2020
Sun, Kretzschmar, Dotiwalla, Chouard, Patnaik, Tsui (bib141) 2020
Rashwan, Kalra, Poupart (bib43) 2019
Law, Deng (bib41) 2018
Hinton, Vinyals, Dean (bib118) 2015
Joseph, Khan, Khan, Balasubramanian (bib180) 2021
Dai, Li, He, Sun (bib14) 2016
Yi, Wu, Metaxas (bib36) 2019
Qin, Wang, Lu (bib84) 2019
Yang, Wang (bib35) 2019
Zhu, He, Savvides (bib47) 2019
Zhu, Su, Lu, Li, Wang, Dai (bib58) 2020
Liu, Lin, Cao, Hu, Wei, Zhang (bib6) 2021
Guo, Han, Wang, Zhang, Yang, Wu (bib116) 2020
Liu, Li, Zhang, Yang, Qi, Su (bib61) 2022
Ghiasi, Lin, Le (bib115) 2019
Weng, Man, Cheng, Park, O'Toole, Kitani (bib143) 2020
Lin, Sun, Liu, Bian, Cen, Zhou (bib168) 2022
Li, Li, Jiang, Weng, Geng, Li (bib53) 2022
Zong, Jiang, Song, Xue, Su, Li (bib112) 2023
Wang, Zhu, Pang, Lin (bib79) 2021
Liu, Anguelov, Erhan, Szegedy, Reed, Fu (bib178) 2016
Nie, Anwer, Cholakkal, Khan, Pang, Shao (bib40) 2019
Chen, Kundu, Zhu, Ma, Fidler, Urtasun (bib82) 2018; 40
Beal, Kim, Tzeng, Park, Zhai, Kislyuk (bib63) 2020
Piland, Czajka, Sweet (bib151) 2023
Bai, Xia (bib172) 2022; 1
Qiu, Li, Wu, Cui, Song, Wang (bib50) 2021
Guo, Han, Wang, Wu, Chen, Xu (bib120) 2021
Krasin, Duerig, Alldrin, Veit, Abu-El-Haija, Belongie (bib139) 2016
Yan, Wan, Zhang (bib165) 2022; 2023
Redmon, Farhadi (bib26) 2017
Song, Lichtenberg, Xiao, Sun (bib146) 2015
Wu, Zhao, Zhang (bib16) 2021
Silberman, Hoiem, Kohli, Fergus (bib147) 2012
Shuvo (bib171) 2023
Chen, Liu, Shen, Jia (bib89) 2020
Peng, Pan, Liu, Sun (bib86) 2020
Ansari, Meraz, Chakraborty, Javed (bib132) 2022
Kong, Sun, Liu, Jiang, Li, Shi (bib48) 2020; 29
Zhu, Pang, Yang, Shi, Lin (bib125) 2019
Ye, Du, Shi, Li, Tan, Feng (bib96) 2020
Gupta, Narayan, Joseph, Khan, Khan, Shah (bib181) 2022
Zheng, Li, Hong, Petersson, Barnes (bib183) 2022
Geiger, Lenz, Urtasun (bib140) 2012
Shi, Ye, Chen, Chen, Chen, Kim (bib72) 2021
Li, Wang, Li, Xie, Sima, Lu (bib109) 2022
Wang, Xie, Li, Fan, Song, Liang (bib7) 2021
Jocher (bib29) 2020
Lin, Goyal, Girshick, He, Dollár (bib38) 2017
Zhang, Wen, Bian, Lei, Li (bib39) 2018
Boyd, Tinsley, Bowyer, Czajka (bib150) 2021
Li, Chen, Shen (bib83) 2019
Lienhart, Maydt (bib1) 2002
Redmon, Farhadi (bib27) 2018
Ma, Wang, Li, Zhang, Ouyang, Fan (bib94) 2019
Vora, Dutta, Jain, Karthik, Gandhi (bib160) 2023
Najibi, Lai, Kundu, Lu, Rathod, Funkhouser (bib130) 2020
Caesar, Bankiti, Lang, Vora, Liong, Xu (bib142) 2020
Chen, Kundu, Zhang, Ma, Fidler, Urtasun (bib67) 2016
Dai, Chen, Yang, Zhang, Yuan, Zhang (bib57) 2021
Liu, Hu, Lin, Yao, Xie, Wei (bib65) 2021
Lu (10.1016/j.array.2023.100305_bib73) 2021
Long (10.1016/j.array.2023.100305_bib30) 2020
Yang (10.1016/j.array.2023.100305_bib121) 2022
Uijlings (10.1016/j.array.2023.100305_bib19) 2013; 104
Song (10.1016/j.array.2023.100305_bib146) 2015
Dosovitskiy (10.1016/j.array.2023.100305_bib144) 2017
Gao (10.1016/j.array.2023.100305_bib56) 2021
Philion (10.1016/j.array.2023.100305_bib174) 2020
Li (10.1016/j.array.2023.100305_bib68) 2019
Tian (10.1016/j.array.2023.100305_bib46) 2019
Redmon (10.1016/j.array.2023.100305_bib26) 2017
Dooley (10.1016/j.array.2023.100305_bib148) 2022
Li (10.1016/j.array.2023.100305_bib134) 2017
Chen (10.1016/j.array.2023.100305_bib67) 2016
Shi (10.1016/j.array.2023.100305_bib72) 2021
Liu (10.1016/j.array.2023.100305_bib162) 2023
Liu (10.1016/j.array.2023.100305_bib178) 2016
Chen (10.1016/j.array.2023.100305_bib82) 2018; 40
Yu (10.1016/j.array.2023.100305_bib117) 2021
Shen (10.1016/j.array.2023.100305_bib81) 2022
Peng (10.1016/j.array.2023.100305_bib86) 2020
Li (10.1016/j.array.2023.100305_bib126) 2020
Fang (10.1016/j.array.2023.100305_bib154) 2023
Huang (10.1016/j.array.2023.100305_bib31) 2021
Weng (10.1016/j.array.2023.100305_bib143) 2020
Law (10.1016/j.array.2023.100305_bib41) 2018
Philion (10.1016/j.array.2023.100305_bib104) 2020
Yan (10.1016/j.array.2023.100305_bib80) 2022
Chopra (10.1016/j.array.2023.100305_bib20) 2023
Kumar (10.1016/j.array.2023.100305_bib75) 2021
Zhang (10.1016/j.array.2023.100305_bib78) 2021
Xu (10.1016/j.array.2023.100305_bib128) 2022
Lienhart (10.1016/j.array.2023.100305_bib1) 2002
Deng (10.1016/j.array.2023.100305_bib137) 2009
Li (10.1016/j.array.2023.100305_bib98) 2020
Li (10.1016/j.array.2023.100305_bib109) 2022
Dwivedi (10.1016/j.array.2023.100305_bib153) 2023
Lowe (10.1016/j.array.2023.100305_bib3) 2004; 60
He (10.1016/j.array.2023.100305_bib5) 2016
Zhu (10.1016/j.array.2023.100305_bib58) 2020
Liu (10.1016/j.array.2023.100305_bib92) 2021
Li (10.1016/j.array.2023.100305_bib106) 2022
Boyd (10.1016/j.array.2023.100305_bib150) 2021
Zhu (10.1016/j.array.2023.100305_bib125) 2019
Zhang (10.1016/j.array.2023.100305_bib161) 2022
Ansari (10.1016/j.array.2023.100305_bib132) 2022
Li (10.1016/j.array.2023.100305_bib107) 2022
Zhang (10.1016/j.array.2023.100305_bib39) 2018
Ma (10.1016/j.array.2023.100305_bib94) 2019
Guo (10.1016/j.array.2023.100305_bib120) 2021
Dai (10.1016/j.array.2023.100305_bib123) 2021
Caesar (10.1016/j.array.2023.100305_bib142) 2020
Girshick (10.1016/j.array.2023.100305_bib18) 2014
Li (10.1016/j.array.2023.100305_bib83) 2019
Li (10.1016/j.array.2023.100305_bib169) 2022; 60
Jocher (10.1016/j.array.2023.100305_bib29) 2020
Chen (10.1016/j.array.2023.100305_bib124) 2018
Wang (10.1016/j.array.2023.100305_bib133) 2017
Shi (10.1016/j.array.2023.100305_bib129) 2020
Zhang (10.1016/j.array.2023.100305_bib170) 2020
Qiu (10.1016/j.array.2023.100305_bib50) 2021
Rashwan (10.1016/j.array.2023.100305_bib43) 2019
Geiger (10.1016/j.array.2023.100305_bib140) 2012
Liu (10.1016/j.array.2023.100305_bib167) 2022
Luo (10.1016/j.array.2023.100305_bib76) 2021
Wang (10.1016/j.array.2023.100305_bib28) 2022
Guo (10.1016/j.array.2023.100305_bib91) 2021
Nie (10.1016/j.array.2023.100305_bib40) 2019
Inoue (10.1016/j.array.2023.100305_bib127) 2018
Sun (10.1016/j.array.2023.100305_bib141) 2020
Girshick (10.1016/j.array.2023.100305_bib22) 2015
Hwang (10.1016/j.array.2023.100305_bib159) 2022
Zou (10.1016/j.array.2023.100305_bib52) 2022
Roh (10.1016/j.array.2023.100305_bib59) 2021
Shuvo (10.1016/j.array.2023.100305_bib171) 2023
Cai (10.1016/j.array.2023.100305_bib25) 2018
Joseph (10.1016/j.array.2023.100305_bib180) 2021
Ye (10.1016/j.array.2023.100305_bib166) 2023
Qin (10.1016/j.array.2023.100305_bib84) 2019
Garg (10.1016/j.array.2023.100305_bib100) 2020
Li (10.1016/j.array.2023.100305_bib53) 2022
Liu (10.1016/j.array.2023.100305_bib65) 2021
Zheng (10.1016/j.array.2023.100305_bib55) 2020
Yan (10.1016/j.array.2023.100305_bib165) 2022; 2023
Wang (10.1016/j.array.2023.100305_bib64) 2022; vol. 8
Beal (10.1016/j.array.2023.100305_bib63) 2020
Dosovitskiy (10.1016/j.array.2023.100305_bib17) 2020
Dalal (10.1016/j.array.2023.100305_bib2) 2005; 1
Guo (10.1016/j.array.2023.100305_bib116) 2020
Yi (10.1016/j.array.2023.100305_bib36) 2019
Chen (10.1016/j.array.2023.100305_bib113) 2019
Liu (10.1016/j.array.2023.100305_bib77) 2020
Sun (10.1016/j.array.2023.100305_bib85) 2020
Dai (10.1016/j.array.2023.100305_bib14) 2016
Krasin (10.1016/j.array.2023.100305_bib139) 2016
Liu (10.1016/j.array.2023.100305_bib6) 2021
Lin (10.1016/j.array.2023.100305_bib138) 2014
Kong (10.1016/j.array.2023.100305_bib155) 2023
Bai (10.1016/j.array.2023.100305_bib172) 2022; 1
Law (10.1016/j.array.2023.100305_bib42) 2019
Carion (10.1016/j.array.2023.100305_bib11) 2020; 2020
Hinton (10.1016/j.array.2023.100305_bib118) 2015
Wang (10.1016/j.array.2023.100305_bib62) 2022; 36
Wang (10.1016/j.array.2023.100305_bib149) 2023
Najibi (10.1016/j.array.2023.100305_bib130) 2020
Chen (10.1016/j.array.2023.100305_bib122) 2021
Man (10.1016/j.array.2023.100305_bib131) 2021
Lin (10.1016/j.array.2023.100305_bib8) 2017
Wang (10.1016/j.array.2023.100305_bib119) 2019
Everingham (10.1016/j.array.2023.100305_bib136) 2010
Ghiasi (10.1016/j.array.2023.100305_bib115) 2019
Wang (10.1016/j.array.2023.100305_bib7) 2021
Mao (10.1016/j.array.2023.100305_bib173) 2019
Bai (10.1016/j.array.2023.100305_bib135) 2018
Dai (10.1016/j.array.2023.100305_bib57) 2021
Li (10.1016/j.array.2023.100305_bib66) 2022
Dai (10.1016/j.array.2023.100305_bib145) 2017
Chen (10.1016/j.array.2023.100305_bib10) 2021
Zhu (10.1016/j.array.2023.100305_bib49) 2020
Ren (10.1016/j.array.2023.100305_bib23) 2017; 39
Redmon (10.1016/j.array.2023.100305_bib27) 2018
Wu (10.1016/j.array.2023.100305_bib16) 2021
Lin (10.1016/j.array.2023.100305_bib168) 2022
Redmon (10.1016/j.array.2023.100305_bib12) 2016
Park (10.1016/j.array.2023.100305_bib111) 2022
Piland (10.1016/j.array.2023.100305_bib151) 2023
You (10.1016/j.array.2023.100305_bib97) 2019
Deng (10.1016/j.array.2023.100305_bib175) 2021
Liang (10.1016/j.array.2023.100305_bib114) 2021
Sun (10.1016/j.array.2023.100305_bib60) 2021
Wang (10.1016/j.array.2023.100305_bib93) 2018
He (10.1016/j.array.2023.100305_bib24) 2017
Liu (10.1016/j.array.2023.100305_bib13) 2016
Wang (10.1016/j.array.2023.100305_bib79) 2021
Zhu (10.1016/j.array.2023.100305_bib47) 2019
Yang (10.1016/j.array.2023.100305_bib110) 2022
Hasan (10.1016/j.array.2023.100305_bib157) 2022
Wang (10.1016/j.array.2023.100305_bib103) 2021
Jeong (10.1016/j.array.2023.100305_bib33) 2017
Zheng (10.1016/j.array.2023.100305_bib34) 2018
Wang (10.1016/j.array.2023.100305_bib95) 2020; 34
Silberman (10.1016/j.array.2023.100305_bib147) 2012
Gupta (10.1016/j.array.2023.100305_bib181) 2022
Xu (10.1016/j.array.2023.100305_bib15) 2022
Chen (10.1016/j.array.2023.100305_bib90) 2023; 45
Liu (10.1016/j.array.2023.100305_bib176) 2017
Cen (10.1016/j.array.2023.100305_bib182) 2021
Bochkovskiy (10.1016/j.array.2023.100305_bib9) 2020
Chandio (10.1016/j.array.2023.100305_bib37) 2022
Liu (10.1016/j.array.2023.100305_bib61) 2022
Pon (10.1016/j.array.2023.100305_bib99) 2020
Chang (10.1016/j.array.2023.100305_bib88) 2018
Vora (10.1016/j.array.2023.100305_bib160) 2023
Yang (10.1016/j.array.2023.100305_bib35) 2019
Qian (10.1016/j.array.2023.100305_bib102) 2020
Ge (10.1016/j.array.2023.100305_bib51) 2021
Najibi (10.1016/j.array.2023.100305_bib177) 2017
He (10.1016/j.array.2023.100305_bib21) 2015; 37
Gilroy (10.1016/j.array.2023.100305_bib156) 2022
Huang (10.1016/j.array.2023.100305_bib105) 2021
Shamsolmoali (10.1016/j.array.2023.100305_bib164) 2022
Simonelli (10.1016/j.array.2023.100305_bib70) 2019
Manhardt (10.1016/j.array.2023.100305_bib69) 2019
Xu (10.1016/j.array.2023.100305_bib101) 2020; 34
Kong (10.1016/j.array.2023.100305_bib48) 2020; 29
Zheng (10.1016/j.array.2023.100305_bib183) 2022
Wang (10.1016/j.array.2023.100305_bib163) 2022; 197
Fu (10.1016/j.array.2023.100305_bib32) 2017
Ye (10.1016/j.array.2023.100305_bib96) 2020
Chang (10.1016/j.array.2023.100305_bib179) 2022
Muhammad (10.1016/j.array.2023.100305_bib152) 2023
Qin (10.1016/j.array.2023.100305_bib71) 2019; 33
Duan (10.1016/j.array.2023.100305_bib44) 2019
Lin (10.1016/j.array.2023.100305_bib158) 2022
Lin (10.1016/j.array.2023.100305_bib38) 2017
Brazil (10.1016/j.array.2023.100305_bib74) 2019
Wang (10.1016/j.array.2023.100305_bib108) 2022
Zong (10.1016/j.array.2023.100305_bib112) 2023
Wang (10.1016/j.array.2023.100305_bib54) 2022
Zhou (10.1016/j.array.2023.100305_bib45) 2019
Peng (10.1016/j.array.2023.100305_bib87) 2022
Chen (10.1016/j.array.2023.100305_bib89) 2020
Krizhevsky (10.1016/j.array.2023.100305_bib4) 2012; 25
References_xml – volume: 2020
  start-page: 213
  year: 2020
  end-page: 229
  ident: bib11
  article-title: End-to-End object detection with transformers
  publication-title: Computer Vision – ECCV
– start-page: 11402
  year: 2020
  end-page: 11411
  ident: bib116
  article-title: Hit-detector: hierarchical trinity architecture search for object detection
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 2968
  year: 2021
  end-page: 2977
  ident: bib57
  article-title: Dynamic DETR: end-to-end object detection with dynamic attention
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– start-page: 1
  year: 2022
  ident: bib158
  article-title: Pedestrian detection by exemplar-guided contrastive learning
  publication-title: IEEE Trans Image Process
– start-page: 567
  year: 2015
  end-page: 576
  ident: bib146
  article-title: A RGB-D scene understanding benchmark suite
  publication-title: 2015 IEEE conference on computer vision and pattern recognition (CVPR)
– year: 2021
  ident: bib150
  article-title: CYBORG: blending human saliency into the loss improves deep learning
– year: 2021
  ident: bib65
  article-title: Swin transformer V2: scaling up capacity and resolution
– start-page: 225
  year: 2022
  end-page: 234
  ident: bib87
  article-title: SIDE: center-based stereo 3D detector with structure-aware instance depth estimation
  publication-title: 2022 IEEE/CVF winter conference on applications of computer vision (WACV)
– volume: vol. 8
  start-page: 1
  year: 2022
  end-page: 10
  ident: bib64
  publication-title: PVT v2: improved baselines with pyramid vision transformer
– year: 2020
  ident: bib100
  article-title: Wasserstein distances for stereo disparity estimation
– start-page: 8383
  year: 2020
  end-page: 8389
  ident: bib99
  article-title: Object-centric stereo matching for 3D object detection
  publication-title: 2020 IEEE International Conference on Robotics and Automation (ICRA)
– start-page: 746
  year: 2012
  end-page: 760
  ident: bib147
  article-title: Indoor segmentation and support inference from RGBD images
  publication-title: Computer vision – ECCV 2012
– year: 2023
  ident: bib155
  article-title: Enhancing general face forgery detection via vision transformer with low-rank adaptation
– start-page: 3339
  year: 2018
  end-page: 3348
  ident: bib124
  article-title: Domain adaptive faster R-CNN for object detection in the wild
  publication-title: 2018 IEEE/CVF conference on computer vision and pattern recognition
– start-page: 4885
  year: 2017
  end-page: 4894
  ident: bib177
  article-title: SSH: single stage headless face detector
  publication-title: 2017 IEEE international conference on computer vision (ICCV)
– start-page: 913
  year: 2021
  end-page: 922
  ident: bib79
  article-title: FCOS3D: fully convolutional one-stage monocular 3D object detection
  publication-title: IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)2021
– start-page: 1
  year: 2022
  end-page: 18
  ident: bib109
  article-title: BEVFormer: learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers
– start-page: 573
  year: 2019
  end-page: 582
  ident: bib173
  article-title: A delay metric for video object detection: what average precision fails to tell
  publication-title: 2019 IEEE/CVF international conference on computer vision (ICCV)
– start-page: 58
  year: 2023
  end-page: 73
  ident: bib20
  article-title: Support vector machine
– start-page: 3091
  year: 2021
  end-page: 3101
  ident: bib73
  article-title: Geometry uncertainty projection network for monocular 3D object detection
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– year: 2021
  ident: bib175
  article-title: Revisiting 3D object detection from an egocentric perspective
– volume: 40
  start-page: 1259
  year: 2018
  end-page: 1272
  ident: bib82
  article-title: 3D object proposals using stereo imagery for accurate object class detection
  publication-title: IEEE Trans Pattern Anal Mach Intell
– start-page: 4928
  year: 2019
  end-page: 4937
  ident: bib119
  article-title: Distilling object detectors with fine-grained feature imitation
  publication-title: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
– start-page: 4203
  year: 2018
  end-page: 4212
  ident: bib39
  article-title: Single-shot refinement neural network for object detection
  publication-title: 2018 IEEE/CVF conference on computer vision and pattern recognition
– year: 2020
  ident: bib143
  article-title: All-in-one drive: a large-scale comprehensive perception dataset with high-density long-range point clouds
– year: 2017
  ident: bib33
  article-title: Enhancement of SSD by concatenating feature maps for object detection
– start-page: 11618
  year: 2020
  end-page: 11628
  ident: bib142
  article-title: nuScenes: a multimodal dataset for autonomous driving
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2022
  ident: bib37
  article-title: Precise single-stage detector
– start-page: 13012
  year: 2020
  end-page: 13021
  ident: bib86
  article-title: IDA-3D: instance-depth-aware 3D object detection from stereo vision for autonomous driving
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 5001
  year: 2018
  end-page: 5009
  ident: bib127
  article-title: Cross-domain weakly-supervised object detection through progressive domain adaptation
  publication-title: IEEE/CVF conference on computer vision and pattern Recognition2018
– volume: 104
  start-page: 154
  year: 2013
  end-page: 171
  ident: bib19
  article-title: Selective search for object recognition
  publication-title: Int J Comput Vis
– start-page: 2025
  year: 2019
  end-page: 2028
  ident: bib43
  article-title: Matrix nets: a new deep architecture for object detection
  publication-title: 2019 IEEE/CVF international conference on computer vision workshop (ICCVW)
– year: 2023
  ident: bib112
  article-title: Temporal enhanced training of multi-view 3D object detector via historical object prediction
– year: 2022
  ident: bib15
  article-title: PP-YOLOE: an evolved version of YOLO
– start-page: 3133
  year: 2021
  end-page: 3143
  ident: bib91
  article-title: LIGA-stereo: learning LiDAR geometry aware representations for stereo-based 3D detector
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– start-page: 3288
  year: 2021
  end-page: 3297
  ident: bib78
  article-title: Objects are different: flexible monocular 3D object detection
  publication-title: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 25
  year: 2022
  end-page: 36
  ident: bib168
  article-title: Attention guided network for salient object detection in optical remote sensing images
  publication-title: Artificial Neural Networks and Machine Learning – ICANN
– start-page: 2999
  year: 2017
  end-page: 3007
  ident: bib38
  article-title: Focal loss for dense object detection
  publication-title: 2017 IEEE international conference on computer vision (ICCV)
– start-page: 7029
  year: 2019
  end-page: 7038
  ident: bib115
  article-title: NAS-FPN: learning scalable feature pyramid architecture for object detection
  publication-title: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
– start-page: 8969
  year: 2021
  end-page: 8979
  ident: bib75
  article-title: GrooMeD-NMS: grouped mathematically differentiable NMS for monocular 3D object detection
  publication-title: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 2766
  year: 2019
  end-page: 2770
  ident: bib35
  article-title: Feature fusion and enhancement for single shot multibox detector. 2019 Chinese automation congress
– start-page: I
  year: 2002
  ident: bib1
  article-title: An extended set of Haar-like features for rapid object detection
  publication-title: Proceedings international conference on image processing
– start-page: 17122
  year: 2022
  end-page: 17131
  ident: bib80
  article-title: ONCE-3DLanes: building monocular 3D lane detection
  publication-title: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2021
  ident: bib51
  article-title: YOLOX: exceeding YOLO series in 2021
– start-page: 526
  year: 2022
  end-page: 543
  ident: bib179
  article-title: RFLA: Gaussian receptive field based label assignment for tiny object detection
– start-page: 15152
  year: 2021
  end-page: 15161
  ident: bib72
  article-title: Geometry-based distance decomposition for monocular 3D object detection
  publication-title: IEEE/CVF international conference on computer vision (ICCV)2021
– start-page: 840
  year: 2019
  end-page: 849
  ident: bib47
  article-title: Feature selective anchor-free module for single-shot object detection
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2022
  ident: bib54
  article-title: PP-YOLOE-R: an efficient anchor-free rotated object detector
– year: 2021
  ident: bib105
  article-title: BEVDet: high-performance multi-camera 3D object detection in bird-eye-view
– start-page: 1019
  year: 2019
  end-page: 1028
  ident: bib68
  article-title: An efficient 3D object detection framework for autonomous driving
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2022
  ident: bib167
  article-title: LSNet: extremely light-weight siamese network for change detection in remote sensing image
– year: 2018
  ident: bib93
  article-title: Pseudo-LiDAR from visual depth estimation: bridging the gap in 3D object detection for autonomous driving
– volume: 37
  start-page: 1904
  year: 2015
  end-page: 1916
  ident: bib21
  article-title: Spatial pyramid pooling in deep convolutional networks for visual recognition
  publication-title: IEEE Trans Pattern Anal Mach Intell
– year: 2021
  ident: bib117
  article-title: PP-PicoDet: a better real-time object detector on mobile devices
– year: 2023
  ident: bib154
  article-title: SynthASpoof: developing face presentation attack detection based on privacy-friendly synthetic data
– start-page: 850
  year: 2019
  end-page: 859
  ident: bib45
  article-title: Bottom-up object detection by grouping extreme and center points
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2019
  ident: bib113
  article-title: DetNAS: neural architecture search on object detection
– volume: 33
  start-page: 8851
  year: 2019
  end-page: 8858
  ident: bib71
  article-title: MonoGRNet: a geometric reasoning network for monocular 3D object localization
  publication-title: Proc AAAI Conf Artif Intell
– start-page: 4339
  year: 2021
  end-page: 4348
  ident: bib122
  article-title: Deep structured instance graph for distilling object detectors
  publication-title: IEEE/CVF international conference on computer vision (ICCV)2021
– start-page: 9536
  year: 2019
  end-page: 9545
  ident: bib40
  article-title: Enriched feature guided refinement network for object detection
  publication-title: 2019 IEEE/CVF international conference on computer vision (ICCV)
– start-page: 248
  year: 2009
  end-page: 255
  ident: bib137
  article-title: ImageNet: a large-scale hierarchical image database
  publication-title: 2009 IEEE conference on computer vision and pattern recognition
– volume: 36
  start-page: 2567
  year: 2022
  end-page: 2575
  ident: bib62
  article-title: Anchor DETR: query design for transformer-based detector
  publication-title: Proc AAAI Conf Artif Intell
– year: 2023
  ident: bib171
  article-title: An automated end-to-end deep learning-based framework for lung cancer diagnosis by detecting and classifying the lung nodules
– start-page: 779
  year: 2016
  end-page: 788
  ident: bib12
  article-title: You only look once: unified, real-time object detection
  publication-title: 2016 IEEE conference on computer vision and pattern recognition (CVPR)
– start-page: 4633
  year: 2022
  end-page: 4642
  ident: bib121
  article-title: Focal and global knowledge distillation for detectors
  publication-title: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2018
  ident: bib27
  article-title: YOLOv3: an incremental improvement
– start-page: 7636
  year: 2019
  end-page: 7644
  ident: bib83
  article-title: Stereo R-CNN based 3D object detection for autonomous driving
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2021
  ident: bib59
  article-title: Sparse DETR: efficient end-to-end object detection with learnable sparsity
– start-page: 9286
  year: 2019
  end-page: 9295
  ident: bib74
  article-title: M3D-RPN: monocular 3D region proposal network for object detection
  publication-title: 2019 IEEE/CVF international conference on computer vision (ICCV)
– year: 2016
  ident: bib14
  article-title: Object detection via region-based fully convolutional networks
– start-page: 5880
  year: 2020
  end-page: 5889
  ident: bib102
  article-title: End-to-End pseudo-LiDAR for image-based 3D object detection
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– volume: 60
  start-page: 1
  year: 2022
  end-page: 12
  ident: bib169
  article-title: Lightweight salient object detection in optical remote sensing images via feature correlation
  publication-title: IEEE Trans Geosci Rem Sens
– start-page: 869
  year: 2021
  end-page: 878
  ident: bib182
  article-title: Open-set 3D object detection
  publication-title: International Conference on 3D Vision
– year: 2019
  ident: bib97
  article-title: Pseudo-LiDAR++: accurate depth for 3D object detection in autonomous driving
– start-page: 6517
  year: 2017
  end-page: 6525
  ident: bib26
  article-title: YOLO9000: better, faster, stronger. 2017 IEEE conference on computer vision and pattern recognition (CVPR)
– start-page: 3601
  year: 2021
  end-page: 3610
  ident: bib56
  article-title: Fast convergence of DETR with spatially modulated Co-attention
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– year: 2022
  ident: bib106
  article-title: BEVDepth: acquisition of reliable depth for multi-view 3D object detection
– start-page: 2154
  year: 2021
  end-page: 2164
  ident: bib120
  article-title: Distilling object detectors via decoupled features
  publication-title: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
– start-page: 3591
  year: 2021
  end-page: 3600
  ident: bib60
  article-title: Rethinking transformer-based set prediction for object detection
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– year: 2023
  ident: bib153
  article-title: An efficient ensemble explainable AI (XAI) approach for morphed face detection
– start-page: 11910
  year: 2020
  end-page: 11919
  ident: bib130
  article-title: DOPS: learning to detect 3D objects and predict their 3D shapes
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 6154
  year: 2018
  end-page: 6162
  ident: bib25
  article-title: Cascade R-CNN: delving into high quality object detection
  publication-title: 2018 IEEE/CVF conference on computer vision and pattern recognition
– start-page: 7838
  year: 2021
  end-page: 7847
  ident: bib123
  article-title: General instance distillation for object detection
  publication-title: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2022
  ident: bib161
  article-title: Feature calibration network for occluded pedestrian detection
– volume: 45
  start-page: 4416
  year: 2023
  end-page: 4429
  ident: bib90
  article-title: DSGN++: exploiting visual-spatial relation for stereo-based 3D detectors
  publication-title: IEEE Trans Pattern Anal Mach Intell
– start-page: 110
  year: 2023
  end-page: 119
  ident: bib160
  article-title: Bringing generalization to deep multi-view pedestrian detection
– start-page: 195
  year: 2022
  end-page: 211
  ident: bib81
  article-title: PanoFormer: panorama transformer for indoor 360$$^{\circ }$$ depth estimation
– year: 2017
  ident: bib176
  article-title: Receptive field block net for accurate and Fast object detection
– start-page: 9992
  year: 2021
  end-page: 10002
  ident: bib6
  article-title: Swin transformer: hierarchical vision transformer using shifted windows
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– year: 2016
  ident: bib139
  article-title: OpenImages: a public dataset for large-scale multi-label and multi-class image classification
– year: 2023
  ident: bib152
  article-title: Domain generalization via ensemble stacking for face presentation attack detection
– start-page: 936
  year: 2017
  end-page: 944
  ident: bib8
  article-title: Feature pyramid networks for object detection
  publication-title: 2017 IEEE conference on computer vision and pattern recognition (CVPR)
– start-page: 1708
  year: 2020
  end-page: 1716
  ident: bib129
  article-title: Point-GNN: graph neural network for 3D object detection in a point cloud
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2020
  ident: bib85
  article-title: Disp R-CNN: stereo 3D object detection via shape prior guided instance disparity estimation
– year: 2022
  ident: bib148
  article-title: Robustness disparities in face detection
– volume: 2023
  start-page: 75
  year: 2022
  end-page: 92
  ident: bib165
  article-title: Fully transformer network for change detection of remote sensing images
  publication-title: Computer Vision – ACCV
– volume: 39
  start-page: 1137
  year: 2017
  end-page: 1149
  ident: bib23
  article-title: Towards real-time object detection with region proposal networks
  publication-title: IEEE Trans Pattern Anal Mach Intell
– start-page: 770
  year: 2016
  end-page: 778
  ident: bib5
  article-title: Deep residual learning for image recognition
  publication-title: 2016 IEEE conference on computer vision and pattern recognition (CVPR)
– start-page: 580
  year: 2014
  end-page: 587
  ident: bib18
  article-title: Rich feature hierarchies for accurate object detection and semantic segmentation
  publication-title: 2014 IEEE conference on computer vision and pattern recognition
– volume: 34
  start-page: 12257
  year: 2020
  end-page: 12264
  ident: bib95
  article-title: Task-aware monocular depth estimation for 3D object detection
  publication-title: Proc AAAI Conf Artif Intell
– start-page: 765
  year: 2018
  end-page: 781
  ident: bib41
  article-title: CornerNet: detecting objects as paired keypoints
  publication-title: Computer vision – ECCV 2018
– year: 2022
  ident: bib111
  article-title: Time will tell: new outlooks and A baseline for temporal multi-view 3D object detection
– year: 2022
  ident: bib164
  article-title: Enhanced single-shot detector for small object detection in remote sensing images
– year: 2017
  ident: bib32
  article-title: Dssd : deconvolutional single shot detector
– year: 2017
  ident: bib133
  article-title: A-Fast-RCNN: hard positive generation via adversary for object detection
– start-page: 1951
  year: 2017
  end-page: 1959
  ident: bib134
  article-title: Perceptual generative adversarial networks for small object detection
  publication-title: 2017 IEEE conference on computer vision and pattern recognition (CVPR)
– year: 2020
  ident: bib30
  article-title: PP-YOLO: an effective and efficient implementation of object detector
– year: 2022
  ident: bib52
  article-title: YOLOX-PAI: an improved YOLOX version by PAI
– start-page: 481
  year: 2020
  end-page: 497
  ident: bib126
  article-title: Spatial attention pyramid network for unsupervised domain adaptation
  publication-title: Computer vision – ECCV 2020
– start-page: 7607
  year: 2019
  end-page: 7615
  ident: bib84
  article-title: Triangulation learning network: from monocular to stereo 3D object detection
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2020
  ident: bib58
  article-title: Deformable DETR: deformable transformers for end-to-end object detection
– start-page: 280
  year: 2022
  end-page: 296
  ident: bib66
  article-title: Exploring Plain vision transformer backbones for object detection
– year: 2023
  ident: bib151
  article-title: Improving model's focus improves performance of deep learning-based synthetic face detectors
– start-page: 17
  year: 2020
  end-page: 34
  ident: bib96
  article-title: Monocular 3D object detection via feature domain adaptation
– start-page: 21
  year: 2016
  end-page: 37
  ident: bib13
  article-title: SSD: single shot MultiBox detector
  publication-title: Computer Vision – ECCV 2016
– year: 2020
  ident: bib17
  article-title: An image is worth 16x16 words: transformers for image recognition at scale
– year: 2022
  ident: bib159
  article-title: Booster-SHOT: boosting stacked homography transformations for multiview pedestrian detection with attention
– start-page: 14309
  year: 2022
  end-page: 14319
  ident: bib128
  article-title: Holistic and hierarchical feature alignment for cross-domain weakly supervised object detection
  publication-title: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 1160
  year: 2020
  end-page: 1164
  ident: bib170
  article-title: A novel and efficient tumor detection framework for pancreatic cancer via CT images
  publication-title: 42nd annual international conference of the IEEE engineering in medicine & biology society
– start-page: 419
  year: 2022
  end-page: 432
  ident: bib132
  article-title: Angle-based feature learning in GNN for 3D object detection using point cloud
– year: 2023
  ident: bib149
  article-title: EfficientFace: an efficient deep network with feature enhancement for accurate face detection
– start-page: 3960
  year: 2022
  end-page: 3969
  ident: bib183
  article-title: Towards open-set object detection and Discovery
  publication-title: 2022 IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW)
– volume: 25
  year: 2012
  ident: bib4
  article-title: ImageNet classification with deep convolutional neural networks
  publication-title: Neural Information Processing Systems
– year: 2022
  ident: bib108
  article-title: STS: surround-view temporal stereo for multi-view 3D detection
– volume: 34
  start-page: 12557
  year: 2020
  end-page: 12564
  ident: bib101
  article-title: ZoomNet: Part-aware adaptive zooming neural network for 3D object detection
  publication-title: Proc AAAI Conf Artif Intell
– start-page: 3743
  year: 2021
  end-page: 3752
  ident: bib131
  article-title: Multi-echo LiDAR for 3D object detection
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– year: 2020
  ident: bib55
  article-title: End-to-End object detection with adaptive clustering transformer
– start-page: 3354
  year: 2012
  end-page: 3361
  ident: bib140
  article-title: Are we ready for autonomous driving? The KITTI vision benchmark suite
  publication-title: 2012 IEEE conference on computer vision and pattern recognition
– start-page: 3175
  year: 2021
  end-page: 3184
  ident: bib50
  article-title: CrossDet: crossline representation for object detection
  publication-title: 2021 IEEE/CVF international conference on computer vision (ICCV)
– year: 2019
  ident: bib36
  article-title: ASSD: attentive single shot multibox detector
– year: 2020
  ident: bib98
  article-title: Confidence guided stereo 3D object detection with split depth estimation
– start-page: 3383
  year: 2021
  end-page: 3390
  ident: bib103
  article-title: PLUMENet: efficient 3D object detection from stereo images
  publication-title: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
– start-page: 210
  year: 2018
  end-page: 226
  ident: bib135
  publication-title: SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network: 15th European Conference
– start-page: 13018
  year: 2021
  end-page: 13024
  ident: bib92
  article-title: YOLOStereo3D: a step back to 2D for efficient stereo 3D detection
  publication-title: IEEE International Conference on Robotics and Automation (ICRA)2021
– start-page: 9225
  year: 2022
  end-page: 9234
  ident: bib181
  article-title: OW-DETR: open-world detection transformer
  publication-title: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– volume: 1
  start-page: 886
  year: 2005
  end-page: 893
  ident: bib2
  article-title: Histograms of oriented gradients for human detection
  publication-title: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05)2005
– start-page: 5410
  year: 2018
  end-page: 5418
  ident: bib88
  article-title: Pyramid stereo matching network
  publication-title: 2018 IEEE/CVF conference on computer vision and pattern recognition
– year: 2017
  ident: bib144
  article-title: CARLA: an open urban driving simulator
– start-page: 14052
  year: 2020
  end-page: 14061
  ident: bib174
  article-title: Learning to evaluate perception models using planner-centric metrics
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2021
  ident: bib31
  article-title: PP-YOLOv2: a practical object detector
– start-page: 687
  year: 2019
  end-page: 696
  ident: bib125
  article-title: Adapting object detectors via selective cross-domain alignment
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 2432
  year: 2017
  end-page: 2443
  ident: bib145
  article-title: ScanNet: richly-annotated 3D reconstructions of indoor scenes
  publication-title: 2017 IEEE conference on computer vision and pattern recognition (CVPR)
– start-page: 9626
  year: 2019
  end-page: 9635
  ident: bib46
  article-title: FCOS: fully convolutional one-stage object detection
– start-page: 5826
  year: 2021
  end-page: 5836
  ident: bib180
  article-title: Towards open world object detection
  publication-title: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2019
  ident: bib42
  article-title: CornerNet-lite: efficient keypoint based object detection
– start-page: 6141
  year: 2021
  end-page: 6150
  ident: bib76
  article-title: M3DSSD: monocular 3D single stage object detector
  publication-title: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– start-page: 21
  year: 2016
  end-page: 37
  ident: bib178
  article-title: SSD: single shot MultiBox detector
  publication-title: Computer vision – ECCV 2016
– start-page: 91
  year: 2020
  end-page: 107
  ident: bib49
  article-title: Soft anchor-point object detection
– volume: 1
  start-page: 411
  year: 2022
  end-page: 415
  ident: bib172
  article-title: An end-to-end framework for universal lesion detection with missing annotations
  publication-title: 2022 16th IEEE International Conference on Signal Processing (ICSP)
– start-page: 303
  year: 2010
  end-page: 338
  ident: bib136
  article-title: The pascal visual object classes (VOC) challenge
  publication-title: Int J Comput Vis
– start-page: 1440
  year: 2015
  end-page: 1448
  ident: bib22
  article-title: IEEE international conference on computer vision (ICCV)2015
– start-page: 6850
  year: 2019
  end-page: 6859
  ident: bib94
  article-title: Accurate monocular 3D object detection via color-embedded 3D reconstruction for autonomous driving
– start-page: 2980
  year: 2017
  end-page: 2988
  ident: bib24
  article-title: IEEE international conference on computer vision (ICCV)2017
– year: 2015
  ident: bib118
  article-title: Distilling the knowledge in a neural network
– start-page: 740
  year: 2014
  end-page: 755
  ident: bib138
  article-title: Microsoft COCO: common objects in context
  publication-title: Computer vision – ECCV 2014
– year: 2023
  ident: bib162
  article-title: VLPD: context-aware pedestrian detection via vision-language semantic self-supervision
– start-page: 10190
  year: 2021
  end-page: 10198
  ident: bib114
  article-title: OPANAS: one-shot path aggregation network architecture search for object detection
  publication-title: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)2021
– year: 2022
  ident: bib156
  article-title: The impact of partial occlusion on pedestrian detectability
– volume: 29
  start-page: 7389
  year: 2020
  end-page: 7398
  ident: bib48
  article-title: FoveaBox: beyound anchor-based object detection
  publication-title: IEEE Trans Image Process
– year: 2020
  ident: bib63
  article-title: Toward transformer-based object detection
– year: 2022
  ident: bib28
  article-title: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
– year: 2022
  ident: bib107
  article-title: BEVStereo: enhancing depth estimation in multi-view 3D object detection with dynamic temporal stereo
– volume: 60
  start-page: 91
  year: 2004
  end-page: 110
  ident: bib3
  article-title: Distinctive image features from scale-invariant keypoints
  publication-title: Int J Comput Vis
– start-page: 4289
  year: 2020
  end-page: 4298
  ident: bib77
  article-title: SMOKE: single-stage monocular 3D object detection via keypoint estimation
  publication-title: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)2020
– year: 2022
  ident: bib53
  article-title: YOLOv6: a single-stage object detection framework for industrial applications
– year: 2021
  ident: bib16
  article-title: Not all attention is all you need
– start-page: 12533
  year: 2020
  end-page: 12542
  ident: bib89
  article-title: DSGN: deep stereo geometry network for 3D object detection
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– volume: 197
  year: 2022
  ident: bib163
  article-title: Remote sensing image super-resolution and object detection: benchmark and state of the art
  publication-title: Expert Syst Appl
– start-page: 2064
  year: 2019
  end-page: 2073
  ident: bib69
  article-title: ROI-10D: monocular lifting of 2D detection to 6D pose and metric shape
  publication-title: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2023
  ident: bib166
  article-title: Adjacent-level feature cross-fusion with 3D CNN for remote sensing image change detection
– start-page: 548
  year: 2021
  end-page: 558
  ident: bib7
  article-title: Pyramid vision transformer: a versatile backbone for dense prediction without convolutions
– year: 2022
  ident: bib61
  article-title: DAB-DETR: dynamic anchor boxes are better queries for DETR
– start-page: 1991
  year: 2019
  end-page: 1999
  ident: bib70
  article-title: Disentangling monocular 3D object detection
  publication-title: IEEE/CVF International Conference on Computer Vision (ICCV)2019
– start-page: 6568
  year: 2019
  end-page: 6577
  ident: bib44
  article-title: CenterNet: keypoint triplets for object detection
  publication-title: 2019 IEEE/CVF international conference on computer vision (ICCV)
– start-page: 2147
  year: 2016
  end-page: 2156
  ident: bib67
  article-title: Monocular 3D object detection for autonomous driving
  publication-title: 2016 IEEE conference on computer vision and pattern recognition (CVPR)
– year: 2021
  ident: bib10
  article-title: You only look one-level feature
– year: 2020
  ident: bib104
  article-title: Lift, splat, shoot: encoding images from arbitrary camera rigs by implicitly unprojecting to 3D
– year: 2020
  ident: bib29
  publication-title: YOLOv5
– year: 2022
  ident: bib157
  article-title: Pedestrian detection: domain generalization, CNNs, transformers and beyond
– year: 2020
  ident: bib9
  article-title: YOLOv4: optimal speed and accuracy of object detection
– start-page: 141
  year: 2018
  ident: bib34
  article-title: Extend the shallow part of single shot multibox detector via convolutional neural network
– start-page: 2443
  year: 2020
  end-page: 2451
  ident: bib141
  article-title: Scalability in perception for autonomous driving: Waymo open dataset
  publication-title: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
– year: 2022
  ident: bib110
  article-title: BEVFormer v2: adapting modern image backbones to bird's-eye-view recognition via perspective supervision
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib104
– start-page: 13018
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib92
  article-title: YOLOStereo3D: a step back to 2D for efficient stereo 3D detection
  publication-title: IEEE International Conference on Robotics and Automation (ICRA)2021
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib63
– start-page: 6517
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib26
– volume: 36
  start-page: 2567
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib62
  article-title: Anchor DETR: query design for transformer-based detector
  publication-title: Proc AAAI Conf Artif Intell
– year: 2017
  ident: 10.1016/j.array.2023.100305_bib133
– start-page: 1
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib158
  article-title: Pedestrian detection by exemplar-guided contrastive learning
  publication-title: IEEE Trans Image Process
– start-page: 9626
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib46
– start-page: 8969
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib75
  article-title: GrooMeD-NMS: grouped mathematically differentiable NMS for monocular 3D object detection
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib171
– volume: 104
  start-page: 154
  issue: 2
  year: 2013
  ident: 10.1016/j.array.2023.100305_bib19
  article-title: Selective search for object recognition
  publication-title: Int J Comput Vis
  doi: 10.1007/s11263-013-0620-5
– start-page: 17
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib96
– start-page: 4885
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib177
  article-title: SSH: single stage headless face detector
– year: 2019
  ident: 10.1016/j.array.2023.100305_bib36
– start-page: 25
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib168
  article-title: Attention guided network for salient object detection in optical remote sensing images
  publication-title: Artificial Neural Networks and Machine Learning – ICANN
  doi: 10.1016/j.neunet.2021.12.003
– volume: 33
  start-page: 8851
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib71
  article-title: MonoGRNet: a geometric reasoning network for monocular 3D object localization
  publication-title: Proc AAAI Conf Artif Intell
– volume: 29
  start-page: 7389
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib48
  article-title: FoveaBox: beyound anchor-based object detection
  publication-title: IEEE Trans Image Process
  doi: 10.1109/TIP.2020.3002345
– start-page: 12533
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib89
  article-title: DSGN: deep stereo geometry network for 3D object detection
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib85
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib151
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib59
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib37
– start-page: 5410
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib88
  article-title: Pyramid stereo matching network
– volume: 34
  start-page: 12557
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib101
  article-title: ZoomNet: Part-aware adaptive zooming neural network for 3D object detection
  publication-title: Proc AAAI Conf Artif Intell
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib29
  publication-title: YOLOv5
– start-page: 7607
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib84
  article-title: Triangulation learning network: from monocular to stereo 3D object detection
– start-page: 779
  year: 2016
  ident: 10.1016/j.array.2023.100305_bib12
  article-title: You only look once: unified, real-time object detection
– year: 2018
  ident: 10.1016/j.array.2023.100305_bib93
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib110
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib112
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib15
– start-page: 840
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib47
  article-title: Feature selective anchor-free module for single-shot object detection
– start-page: 9225
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib181
  article-title: OW-DETR: open-world detection transformer
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib117
– start-page: 573
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib173
  article-title: A delay metric for video object detection: what average precision fails to tell
– year: 2017
  ident: 10.1016/j.array.2023.100305_bib33
– start-page: 4203
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib39
  article-title: Single-shot refinement neural network for object detection
– start-page: 280
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib66
– start-page: 17122
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib80
  article-title: ONCE-3DLanes: building monocular 3D lane detection
– volume: 37
  start-page: 1904
  issue: 9
  year: 2015
  ident: 10.1016/j.array.2023.100305_bib21
  article-title: Spatial pyramid pooling in deep convolutional networks for visual recognition
  publication-title: IEEE Trans Pattern Anal Mach Intell
  doi: 10.1109/TPAMI.2015.2389824
– start-page: 9992
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib6
  article-title: Swin transformer: hierarchical vision transformer using shifted windows
– volume: 2023
  start-page: 75
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib165
  article-title: Fully transformer network for change detection of remote sensing images
  publication-title: Computer Vision – ACCV
– start-page: 936
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib8
  article-title: Feature pyramid networks for object detection
– start-page: 13012
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib86
  article-title: IDA-3D: instance-depth-aware 3D object detection from stereo vision for autonomous driving
– start-page: 91
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib49
– start-page: 5880
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib102
  article-title: End-to-End pseudo-LiDAR for image-based 3D object detection
– start-page: 3383
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib103
  article-title: PLUMENet: efficient 3D object detection from stereo images
  publication-title: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
– volume: 60
  start-page: 91
  issue: 2
  year: 2004
  ident: 10.1016/j.array.2023.100305_bib3
  article-title: Distinctive image features from scale-invariant keypoints
  publication-title: Int J Comput Vis
  doi: 10.1023/B:VISI.0000029664.99615.94
– start-page: I
  year: 2002
  ident: 10.1016/j.array.2023.100305_bib1
  article-title: An extended set of Haar-like features for rapid object detection
– start-page: 2766
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib35
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib157
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib108
– start-page: 110
  year: 2023
  ident: 10.1016/j.array.2023.100305_bib160
– start-page: 11618
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib142
  article-title: nuScenes: a multimodal dataset for autonomous driving
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib10
– start-page: 4339
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib122
  article-title: Deep structured instance graph for distilling object detectors
– start-page: 14052
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib174
  article-title: Learning to evaluate perception models using planner-centric metrics
– start-page: 5826
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib180
  article-title: Towards open world object detection
– start-page: 3960
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib183
  article-title: Towards open-set object detection and Discovery
– start-page: 15152
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib72
  article-title: Geometry-based distance decomposition for monocular 3D object detection
– start-page: 1160
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib170
  article-title: A novel and efficient tumor detection framework for pancreatic cancer via CT images
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib175
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib100
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib111
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib16
– year: 2019
  ident: 10.1016/j.array.2023.100305_bib113
– start-page: 303
  year: 2010
  ident: 10.1016/j.array.2023.100305_bib136
  article-title: The pascal visual object classes (VOC) challenge
  publication-title: Int J Comput Vis
  doi: 10.1007/s11263-009-0275-4
– start-page: 2968
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib57
  article-title: Dynamic DETR: end-to-end object detection with dynamic attention
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib65
– volume: 40
  start-page: 1259
  issue: 5
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib82
  article-title: 3D object proposals using stereo imagery for accurate object class detection
  publication-title: IEEE Trans Pattern Anal Mach Intell
  doi: 10.1109/TPAMI.2017.2706685
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib153
– year: 2017
  ident: 10.1016/j.array.2023.100305_bib176
– start-page: 687
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib125
  article-title: Adapting object detectors via selective cross-domain alignment
– start-page: 419
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib132
– start-page: 3354
  year: 2012
  ident: 10.1016/j.array.2023.100305_bib140
  article-title: Are we ready for autonomous driving? The KITTI vision benchmark suite
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib161
– start-page: 913
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib79
  article-title: FCOS3D: fully convolutional one-stage monocular 3D object detection
  publication-title: IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)2021
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib107
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib98
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib148
– year: 2017
  ident: 10.1016/j.array.2023.100305_bib32
– start-page: 195
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib81
– start-page: 3339
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib124
  article-title: Domain adaptive faster R-CNN for object detection in the wild
– start-page: 2980
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib24
– volume: 60
  start-page: 1
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib169
  article-title: Lightweight salient object detection in optical remote sensing images via feature correlation
  publication-title: IEEE Trans Geosci Rem Sens
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib9
– start-page: 3288
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib78
  article-title: Objects are different: flexible monocular 3D object detection
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib143
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib51
– start-page: 6850
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib94
– start-page: 3743
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib131
  article-title: Multi-echo LiDAR for 3D object detection
– start-page: 580
  year: 2014
  ident: 10.1016/j.array.2023.100305_bib18
  article-title: Rich feature hierarchies for accurate object detection and semantic segmentation
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib54
– start-page: 7838
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib123
  article-title: General instance distillation for object detection
– volume: 34
  start-page: 12257
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib95
  article-title: Task-aware monocular depth estimation for 3D object detection
  publication-title: Proc AAAI Conf Artif Intell
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib28
– start-page: 2154
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib120
  article-title: Distilling object detectors via decoupled features
  publication-title: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  doi: 10.1109/CVPR46437.2021.00219
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib155
– start-page: 14309
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib128
  article-title: Holistic and hierarchical feature alignment for cross-domain weakly supervised object detection
– year: 2019
  ident: 10.1016/j.array.2023.100305_bib42
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib30
– start-page: 6141
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib76
  article-title: M3DSSD: monocular 3D single stage object detector
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib106
– start-page: 526
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib179
– start-page: 3175
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib50
  article-title: CrossDet: crossline representation for object detection
– start-page: 8383
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib99
  article-title: Object-centric stereo matching for 3D object detection
  publication-title: 2020 IEEE International Conference on Robotics and Automation (ICRA)
  doi: 10.1109/ICRA40945.2020.9196660
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib150
– volume: 197
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib163
  article-title: Remote sensing image super-resolution and object detection: benchmark and state of the art
  publication-title: Expert Syst Appl
  doi: 10.1016/j.eswa.2022.116793
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib167
– start-page: 740
  year: 2014
  ident: 10.1016/j.array.2023.100305_bib138
  article-title: Microsoft COCO: common objects in context
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib159
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib152
– start-page: 9286
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib74
  article-title: M3D-RPN: monocular 3D region proposal network for object detection
– start-page: 1708
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib129
  article-title: Point-GNN: graph neural network for 3D object detection in a point cloud
– year: 2016
  ident: 10.1016/j.array.2023.100305_bib14
– year: 2017
  ident: 10.1016/j.array.2023.100305_bib144
– start-page: 2432
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib145
  article-title: ScanNet: richly-annotated 3D reconstructions of indoor scenes
– start-page: 869
  issue: 3DV
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib182
  article-title: Open-set 3D object detection
  publication-title: International Conference on 3D Vision
– start-page: 1951
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib134
  article-title: Perceptual generative adversarial networks for small object detection
– start-page: 9536
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib40
  article-title: Enriched feature guided refinement network for object detection
– volume: 45
  start-page: 4416
  issue: 4
  year: 2023
  ident: 10.1016/j.array.2023.100305_bib90
  article-title: DSGN++: exploiting visual-spatial relation for stereo-based 3D detectors
  publication-title: IEEE Trans Pattern Anal Mach Intell
– start-page: 248
  year: 2009
  ident: 10.1016/j.array.2023.100305_bib137
  article-title: ImageNet: a large-scale hierarchical image database
– start-page: 4633
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib121
  article-title: Focal and global knowledge distillation for detectors
– start-page: 4928
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib119
  article-title: Distilling object detectors with fine-grained feature imitation
  publication-title: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  doi: 10.1109/CVPR.2019.00507
– start-page: 567
  year: 2015
  ident: 10.1016/j.array.2023.100305_bib146
  article-title: A RGB-D scene understanding benchmark suite
– start-page: 746
  year: 2012
  ident: 10.1016/j.array.2023.100305_bib147
  article-title: Indoor segmentation and support inference from RGBD images
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib149
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib164
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib52
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib17
– start-page: 1
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib109
– start-page: 4289
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib77
  article-title: SMOKE: single-stage monocular 3D object detection via keypoint estimation
  publication-title: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)2020
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib162
– start-page: 2443
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib141
  article-title: Scalability in perception for autonomous driving: Waymo open dataset
– start-page: 765
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib41
  article-title: CornerNet: detecting objects as paired keypoints
– start-page: 58
  year: 2023
  ident: 10.1016/j.array.2023.100305_bib20
– start-page: 1440
  year: 2015
  ident: 10.1016/j.array.2023.100305_bib22
– start-page: 7636
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib83
  article-title: Stereo R-CNN based 3D object detection for autonomous driving
– start-page: 6154
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib25
  article-title: Cascade R-CNN: delving into high quality object detection
– start-page: 141
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib34
– year: 2016
  ident: 10.1016/j.array.2023.100305_bib139
– start-page: 850
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib45
  article-title: Bottom-up object detection by grouping extreme and center points
– start-page: 10190
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib114
  article-title: OPANAS: one-shot path aggregation network architecture search for object detection
  publication-title: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)2021
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib154
– volume: 1
  start-page: 411
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib172
  article-title: An end-to-end framework for universal lesion detection with missing annotations
  publication-title: 2022 16th IEEE International Conference on Signal Processing (ICSP)
  doi: 10.1109/ICSP56322.2022.9965335
– start-page: 11402
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib116
  article-title: Hit-detector: hierarchical trinity architecture search for object detection
– start-page: 1991
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib70
  article-title: Disentangling monocular 3D object detection
  publication-title: IEEE/CVF International Conference on Computer Vision (ICCV)2019
– start-page: 3591
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib60
  article-title: Rethinking transformer-based set prediction for object detection
– start-page: 21
  year: 2016
  ident: 10.1016/j.array.2023.100305_bib178
  article-title: SSD: single shot MultiBox detector
– start-page: 2999
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib38
  article-title: Focal loss for dense object detection
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib55
– volume: 25
  year: 2012
  ident: 10.1016/j.array.2023.100305_bib4
  article-title: ImageNet classification with deep convolutional neural networks
  publication-title: Neural Information Processing Systems
– year: 2019
  ident: 10.1016/j.array.2023.100305_bib97
– start-page: 7029
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib115
  article-title: NAS-FPN: learning scalable feature pyramid architecture for object detection
  publication-title: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  doi: 10.1109/CVPR.2019.00720
– start-page: 2064
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib69
  article-title: ROI-10D: monocular lifting of 2D detection to 6D pose and metric shape
– year: 2023
  ident: 10.1016/j.array.2023.100305_bib166
– volume: 39
  start-page: 1137
  issue: 6
  year: 2017
  ident: 10.1016/j.array.2023.100305_bib23
  article-title: Towards real-time object detection with region proposal networks
  publication-title: IEEE Trans Pattern Anal Mach Intell
  doi: 10.1109/TPAMI.2016.2577031
– start-page: 11910
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib130
  article-title: DOPS: learning to detect 3D objects and predict their 3D shapes
– start-page: 3133
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib91
  article-title: LIGA-stereo: learning LiDAR geometry aware representations for stereo-based 3D detector
– volume: 2020
  start-page: 213
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib11
  article-title: End-to-End object detection with transformers
  publication-title: Computer Vision – ECCV
– start-page: 770
  year: 2016
  ident: 10.1016/j.array.2023.100305_bib5
  article-title: Deep residual learning for image recognition
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib31
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib61
– year: 2015
  ident: 10.1016/j.array.2023.100305_bib118
– start-page: 21
  year: 2016
  ident: 10.1016/j.array.2023.100305_bib13
  article-title: SSD: single shot MultiBox detector
  publication-title: Computer Vision – ECCV 2016
  doi: 10.1007/978-3-319-46448-0_2
– year: 2018
  ident: 10.1016/j.array.2023.100305_bib27
– volume: vol. 8
  start-page: 1
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib64
– start-page: 225
  year: 2022
  ident: 10.1016/j.array.2023.100305_bib87
  article-title: SIDE: center-based stereo 3D detector with structure-aware instance depth estimation
– start-page: 1019
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib68
  article-title: An efficient 3D object detection framework for autonomous driving
– start-page: 3091
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib73
  article-title: Geometry uncertainty projection network for monocular 3D object detection
– start-page: 6568
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib44
  article-title: CenterNet: keypoint triplets for object detection
– start-page: 2147
  year: 2016
  ident: 10.1016/j.array.2023.100305_bib67
  article-title: Monocular 3D object detection for autonomous driving
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib156
– volume: 1
  start-page: 886
  year: 2005
  ident: 10.1016/j.array.2023.100305_bib2
  article-title: Histograms of oriented gradients for human detection
  publication-title: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05)2005
  doi: 10.1109/CVPR.2005.177
– year: 2022
  ident: 10.1016/j.array.2023.100305_bib53
– year: 2021
  ident: 10.1016/j.array.2023.100305_bib105
– start-page: 548
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib7
– year: 2020
  ident: 10.1016/j.array.2023.100305_bib58
– start-page: 5001
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib127
  article-title: Cross-domain weakly-supervised object detection through progressive domain adaptation
– start-page: 481
  year: 2020
  ident: 10.1016/j.array.2023.100305_bib126
  article-title: Spatial attention pyramid network for unsupervised domain adaptation
– start-page: 210
  year: 2018
  ident: 10.1016/j.array.2023.100305_bib135
  publication-title: SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network: 15th European Conference
– start-page: 2025
  year: 2019
  ident: 10.1016/j.array.2023.100305_bib43
  article-title: Matrix nets: a new deep architecture for object detection
– start-page: 3601
  year: 2021
  ident: 10.1016/j.array.2023.100305_bib56
  article-title: Fast convergence of DETR with spatially modulated Co-attention
SSID ssj0002511158
Score 2.4816787
SecondaryResourceType review_article
Snippet Object detection is a crucial branch of computer vision that aims to locate and classify objects in images. Using deep convolutional neural networks (CNNs) as...
SourceID doaj
crossref
elsevier
SourceType Open Website
Enrichment Source
Index Database
Publisher
StartPage 100305
SubjectTerms CNNs
Object detection
Transformer
Title 2D and 3D object detection algorithms from images: A Survey
URI https://dx.doi.org/10.1016/j.array.2023.100305
https://doaj.org/article/52738a24e3ee4601a06a235ec7d0a2b9
Volume 19
WOSCitedRecordID wos001043834800001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2590-0056
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0002511158
  issn: 2590-0056
  databaseCode: DOA
  dateStart: 20190101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2590-0056
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0002511158
  issn: 2590-0056
  databaseCode: M~E
  dateStart: 20190101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELYQYmDhjSgveWAkkMSuncBUXmKhQgKkbpEdn6GopChtkbrw2_E5SZUJFpYMkXOOLhf7O-vu-wg50WnEcxnbQMbYkgMyD5KEGQfkjEu1lUUQ68UmZL-fDAbpY0vqC2vCKnrgynHnniBMxRwYAHfZgwqFilkXcmlCFWvfuudQTyuZwjUYgXPkxTkdvMfW6a5oKId8cZcqSzU_Q-lwLBNgKF7X2pY8e39rd2rtOHcbZK2GirRXveImWYJii6w3Mgy0_iu3yWV8Q1VhKLuhY43HKtTA1FdYFVSNXscu_X_7mFBsJKHDD7d-TC5ojz7Nyi-Y75CXu9vn6_ug1kQIch7xaaASa1MtDbdcWBE69NVFSMOFEUpbmYYWmFE20pJDgthOh10DgudhqkGHnO2S5WJcwB6hSPRiuTMYWcEjAMVEqCNg3Fk3TMYdEjcuyfKaMBx1K0ZZUxn2nnk_ZujHrPJjh5wuHvqs-DJ-H36Fvl4MRbJrf8OFQFaHQPZXCHSIaL5UVuOGCg84U8PfZt__j9kPyCqarOrODsnytJzBEVnJv6bDSXnsw9JdH75vfwA9z-Lt
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=2D+and+3D+object+detection+algorithms+from+images%3A+A+Survey&rft.jtitle=Array+%28New+York%29&rft.au=Chen%2C+Wei&rft.au=Li%2C+Yan&rft.au=Tian%2C+Zijian&rft.au=Zhang%2C+Fan&rft.date=2023-09-01&rft.pub=Elsevier+Inc&rft.issn=2590-0056&rft.eissn=2590-0056&rft.volume=19&rft_id=info:doi/10.1016%2Fj.array.2023.100305&rft.externalDocID=S2590005623000309
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2590-0056&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2590-0056&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2590-0056&client=summon