Search Results - "Computer Science - Computer Vision and Pattern Recognition"

1

Loading…

DINOv2: Learning Robust Visual Features without Supervision by Oquab, Maxime, Darcet, Timothée, Moutakanni, Théo, Vo, Huy, Szafraniec, Marc, Khalidov, Vasil, Fernandez, Pierre, Haziza, Daniel, Massa, Francisco, El-Nouby, Alaaeldin, Assran, Mahmoud, Ballas, Nicolas, Galuba, Wojciech, Howes, Russell, Huang, Po-Yao, Li, Shang-Wen, Misra, Ishan, Rabbat, Michael, Sharma, Vasu, Synnaeve, Gabriel, Xu, Hu, Jegou, Hervé, Mairal, Julien, Labatut, Patrick, Joulin, Armand, Bojanowski, Piotr

ISSN: 2835-8856

Published: [Amherst Massachusetts]: OpenReview.net, 2022 2024

Published in Transactions on Machine Learning Research Journal (2024)
“…The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in…”

Get full text

Journal Article

Save to List

Saved in:
2

Loading…

Analysis of Classifier-Free Guidance Weight Schedulers by Wang, Xi, Dufour, Nicolas, Andreou, Nefeli, Cani, Marie-Paule, Fernández Abrevaya, Victoria, Picard, David, Kalogeiton, Vicky

ISSN: 2835-8856

Published: [Amherst Massachusetts]: OpenReview.net, 2022 2024

Published in Transactions on Machine Learning Research Journal (2024)
“…Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-toimage diffusion models. It operates by combining the conditional and…”

Get full text

Journal Article

Save to List

Saved in:
3

Loading…

Quantifying Societal Bias Amplification in Image Captioning by Hirota, Yusuke, Nakashima, Yuta, Garcia, Noa

ISSN: 1063-6919

Published: IEEE 01.06.2022

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)
“…We study societal bias amplification in image captioning. Image captioning models have been shown to perpetuate gender and racial biases, however, metrics to…”

Get full text

Conference Proceeding

Save to List

Saved in:
4

Loading…

Learning Self-Prior for Mesh Inpainting Using Self-Supervised Graph Convolutional Networks by Shota Hattori, Tatsuya Yatagawa, Yutaka Ohtake, Hiromasa Suzuki

ISSN: 1077-2626, 2160-9306

Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024

Published in IEEE Transactions on Visualization and Computer Graphics (01.01.2024)

Get full text

Journal Article

Save to List

Saved in:
5

Loading…

HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features by Dey, Arnab, Lu, Cheng-You, Comport, Andrew I., Sridhar, Srinath, Lin, Chin-Teng, Martinet, Jean

ISSN: 2691-4581

Published: IEEE 09.12.2025

Published in IEEE transactions on artificial intelligence (09.12.2025)
“…Recent advancements in radiance field rendering show promising results in 3D scene representation, where Gaussian splatting-based techniques emerge as…”

Get full text

Journal Article

Save to List

Saved in:
6

Loading…

SoccerNet 2023 challenges results by Cioppa, Anthony, Giancola, Silvio, Somers, Vladimir, Magera, Floriane, Zhou, Xin, Mkhallati, Hassan, Deliège, Adrien, Held, Jan, Hinojosa, Carlos, Mansourian, Amir M., Miralles, Pierre, Barnich, Olivier, De Vleeschouwer, Christophe, Alahi, Alexandre, Ghanem, Bernard, Van Droogenbroeck, Marc, Kamal, Abdullah, Maglo, Adrien, Clapés, Albert, Abdelaziz, Amr, Xarles, Artur, Orcesi, Astrid, Scott, Atom, Liu, Bin, Lim, Byoungkwon, Chen, Chen, Deuser, Fabian, Yan, Feng, Yu, Fufu, Shitrit, Gal, Wang, Guanshuo, Choi, Gyusik, Kim, Hankyul, Guo, Hao, Fahrudin, Hasby, Koguchi, Hidenari, Ardö, Håkan, Salah, Ibrahim, Yerushalmy, Ido, Muhammad, Iftikar, Uchida, Ikuma, Be’ery, Ishay, Rabarisoa, Jaonary, Lee, Jeongae, Fu, Jiajun, Yin, Jianqin, Xu, Jinghang, Nang, Jongho, Denize, Julien, Li, Junjie, Zhang, Junpei, Kim, Juntae, Synowiec, Kamil, Kobayashi, Kenji, Zhang, Kexin, Habel, Konrad, Nakajima, Kota, Jiao, Licheng, Ma, Lin, Wang, Lizhi, Wang, Luping, Li, Menglong, Zhou, Mengying, Nasr, Mohamed, Abdelwahed, Mohamed, Liashuha, Mykola, Falaleev, Nikolay, Oswald, Norbert, Jia, Qiong, Pham, Quoc-Cuong, Song, Ran, Hérault, Romain, Peng, Rui, Chen, Ruilong, Liu, Ruixuan, Baikulov, Ruslan, Fukushima, Ryuto, Escalera, Sergio, Lee, Seungcheon, Chen, Shimin, Ding, Shouhong, Someya, Taiga, Moeslund, Thomas B., Li, Tianjiao, Shen, Wei, Zhang, Wei, Li, Wei, Dai, Wei, Luo, Weixin, Zhao, Wending, Zhang, Wenjie, Yang, Xinquan, Ma, Yanbiao, Joo, Yeeun, Zeng, Yingsen, Gan, Yiyang, Zhu, Yongqiang, Zhong, Yujie, Ruan, Zheng, Li, Zhiheng

ISSN: 1369-7072, 1460-2687, 1460-2687

Published: Heidelberg Springer Nature B.V 01.12.2024

Published in Sports engineering (01.12.2024)
“…The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were…”

Get full text

Journal Article

Save to List

Saved in:
7

Loading…

Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers by Nokabadi, Fatemeh Nourilenjan, Lalonde, Jean-François, Gagné, Christian

ISSN: 2835-8856

Published: [Amherst Massachusetts]: OpenReview.net, 2022 03.06.2024

Published in Transactions on Machine Learning Research Journal (03.06.2024)
“…New transformer networks have been integrated into object tracking pipelines and have demonstrated strong performance on the latest benchmarks. This paper…”

Get full text

Journal Article

Save to List

Saved in:
8

Loading…

CNN-based real-time 2D-3D deformable registration from a single X-ray projection by Lecomte, Francois, Dillenseger, Jean-Louis, Cotin, Stéphane

ISSN: 2077-0383, 2077-0383

Published: MDPI 01.03.2024

Published in Journal of clinical medicine (01.03.2024)
“…Purpose: The purpose of this paper is to present a method for realtime 2D-3D non-rigid registration using a single fluoroscopic image. Such a method can find…”

Get full text

Journal Article

Save to List

Saved in:
9

Loading…

CSE: Surface Anomaly Detection with Contrastively Selected Embedding by Thomine, Simon, Snoussi, Hichem

ISSN: 2510-523X

Published: Hindawi/SpringerOpen 04.03.2024

Published in EURASIP Journal on Information Security (04.03.2024)
“…Detecting surface anomalies of industrial materials poses a significant challenge within a myriad of industrial manufacturing processes. In recent times,…”

Get full text

Journal Article

Save to List

Saved in:
10

Loading…

Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri by West, Graham, Swindall, Matthew I., Keener, Ben, Player, Timothy, Williams, Alex C., Brusuelas, James H., Wallin, John F.

ISSN: 2416-5999, 2416-5999

Published: Nicolas Turenne 07.02.2024

Published in Journal of data mining and digital humanities (07.02.2024)
“…Performing classification on noisy, crowdsourced image datasets can prove challenging even for the best neural networks. Two issues which complicate the…”

Get full text

Journal Article

Save to List

Saved in:
11

Loading…

The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique by Couture, Beatrice, Verret, Farah, Gohier, Maxime, Deslandres, Dominique

ISSN: 2416-5999, 2416-5999

Published: Nicolas Turenne 06.12.2023

Published in Journal of data mining and digital humanities (06.12.2023)
“…The arrival of handwriting recognition technologies offers new possibilities for research in heritage studies. However, it is now necessary to reflect on the…”

Get full text

Journal Article

Save to List

Saved in:
12

Loading…

Combining Morphological and Histogram based Text Line Segmentation in the OCR Context by Schneider, Pit

ISSN: 2416-5999, 2416-5999

Published: Nicolas Turenne 04.11.2021

Published in Journal of data mining and digital humanities (04.11.2021)
“…Text line segmentation is one of the pre-stages of modern optical character recognition systems. The algorithmic approach proposed by this paper has been…”

Get full text

Journal Article

Save to List

Saved in:
13

Loading…

Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of" Never Enough Training Data" by Govind, Kishan, Oliveros, Daniela, Dlouhy, Antonin, Legros, Marc, Sandfeld, Stefan

ISSN: 2632-2153

Published: IOP Publishing Ltd 12.07.2023

Published in Machine learning: science and technology (12.07.2023)
“…Crystalline defects, such as line-like dislocations, play an important role for the performance and reliability of many metallic devices. Their interaction and…”

Get full text

Journal Article

Save to List

Saved in:
14

Loading…

AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation by Sun, Yasheng, Chu, Wenqing, Zhou, Hang, Wang, Kaisiyuan, Koike, Hideki

ISSN: 2169-3536, 2169-3536

Published: Piscataway IEEE 2024

Published in IEEE Access (2024)
“…While considerable progress has been made in achieving accurate lip synchronization for 3D speech-driven talking face generation, the task of incorporating…”

Get full text

Journal Article

Save to List

Saved in:
15

Loading…

Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes by Jin, Dongkwon, Park, Wonhui, Jeong, Seong-Gyun, Kwon, Heeyeon, Kim, Chang-Su

ISSN: 1063-6919

Published: IEEE 01.06.2022

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)
“…A novel algorithm to detect road lanes in the eigen-lane space is proposed in this paper. First, we introduce the notion of eigenlanes, which are data-driven…”

Get full text

Conference Proceeding

Save to List

Saved in:
16

Loading…

Physical Adversarial Attack Meets Computer Vision: A Decade Survey by Hui Wei, Hao Tang, Xuemei Jia, Zhixiang Wang, Hanxun Yu, Zhubo Li, Shin’ichi Satoh, Luc Van Gool, Zheng Wang

ISSN: 0162-8828, 1939-3539

Published: Institute of Electrical and Electronics Engineers (IEEE) 01.12.2024

Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (01.12.2024)

Get full text

Journal Article

Save to List

Saved in:
17

Loading…

Dual-Pixel Raindrop Removal by Yizhou Li, Yusuke Monno, Masatoshi Okutomi

ISSN: 0162-8828, 1939-3539

Published: Institute of Electrical and Electronics Engineers (IEEE) 01.12.2024

Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (01.12.2024)

Get full text

Journal Article

Save to List

Saved in:
18

Loading…

Computer Analysis of Architecture Using Automatic Image Understanding by Wei, Fan, Li, Yuan, Shamir, Lior

ISSN: 2416-5999, 2416-5999

Published: Nicolas Turenne 22.01.2019

Published in Journal of data mining and digital humanities (22.01.2019)
“…In the past few years, computer vision and pattern recognition systems have been becoming increasingly more powerful, expanding the range of automatic tasks…”

Get full text

Journal Article

Save to List

Saved in:
19

Loading…

Computer Analysis of Architecture Using Automatic Image Understanding by Fan Wei, Yuan Li, Lior Shamir

ISSN: 2416-5999, 2416-5999

Published: Nicolas Turenne 01.01.2019

Published in Journal of data mining and digital humanities (01.01.2019)
“…In the past few years, computer vision and pattern recognition systems have been becoming increasingly more powerful, expanding the range of automatic tasks…”

Get full text

Journal Article

Save to List

Saved in:
20

Loading…

SpectralGPT: Spectral Remote Sensing Foundation Model by Danfeng Hong, Bing Zhang, Xuyang Li, Yuxuan Li, Chenyu Li, Jing Yao, Naoto Yokoya, Hao Li, Pedram Ghamisi, Xiuping Jia, Antonio Plaza, Paolo Gamba, Jon Atli Benediktsson, Jocelyn Chanussot

ISSN: 0162-8828, 1939-3539

Published: Institute of Electrical and Electronics Engineers (IEEE) 01.08.2024

Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (01.08.2024)

Get full text

Journal Article

Save to List

Saved in:

Search Results - "Computer Science - Computer Vision and Pattern Recognition"

Analysis of Classifier-Free Guidance Weight Schedulers by Wang, Xi, Dufour, Nicolas, Andreou, Nefeli, Cani, Marie-Paule, Fernández Abrevaya, Victoria, Picard, David, Kalogeiton, Vicky

Quantifying Societal Bias Amplification in Image Captioning by Hirota, Yusuke, Nakashima, Yuta, Garcia, Noa

Learning Self-Prior for Mesh Inpainting Using Self-Supervised Graph Convolutional Networks by Shota Hattori, Tatsuya Yatagawa, Yutaka Ohtake, Hiromasa Suzuki

HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features by Dey, Arnab, Lu, Cheng-You, Comport, Andrew I., Sridhar, Srinath, Lin, Chin-Teng, Martinet, Jean

Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers by Nokabadi, Fatemeh Nourilenjan, Lalonde, Jean-François, Gagné, Christian

CNN-based real-time 2D-3D deformable registration from a single X-ray projection by Lecomte, Francois, Dillenseger, Jean-Louis, Cotin, Stéphane

CSE: Surface Anomaly Detection with Contrastively Selected Embedding by Thomine, Simon, Snoussi, Hichem

Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri by West, Graham, Swindall, Matthew I., Keener, Ben, Player, Timothy, Williams, Alex C., Brusuelas, James H., Wallin, John F.

The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique by Couture, Beatrice, Verret, Farah, Gohier, Maxime, Deslandres, Dominique

Combining Morphological and Histogram based Text Line Segmentation in the OCR Context by Schneider, Pit

Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of" Never Enough Training Data" by Govind, Kishan, Oliveros, Daniela, Dlouhy, Antonin, Legros, Marc, Sandfeld, Stefan

AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation by Sun, Yasheng, Chu, Wenqing, Zhou, Hang, Wang, Kaisiyuan, Koike, Hideki

Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes by Jin, Dongkwon, Park, Wonhui, Jeong, Seong-Gyun, Kwon, Heeyeon, Kim, Chang-Su

Physical Adversarial Attack Meets Computer Vision: A Decade Survey by Hui Wei, Hao Tang, Xuemei Jia, Zhixiang Wang, Hanxun Yu, Zhubo Li, Shin’ichi Satoh, Luc Van Gool, Zheng Wang

Dual-Pixel Raindrop Removal by Yizhou Li, Yusuke Monno, Masatoshi Okutomi

Computer Analysis of Architecture Using Automatic Image Understanding by Wei, Fan, Li, Yuan, Shamir, Lior

Computer Analysis of Architecture Using Automatic Image Understanding by Fan Wei, Yuan Li, Lior Shamir

SpectralGPT: Spectral Remote Sensing Foundation Model by Danfeng Hong, Bing Zhang, Xuyang Li, Yuxuan Li, Chenyu Li, Jing Yao, Naoto Yokoya, Hao Li, Pedram Ghamisi, Xiuping Jia, Antonio Plaza, Paolo Gamba, Jon Atli Benediktsson, Jocelyn Chanussot

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication