Search Results - "Computer Science - Computer Vision and Pattern Recognition"

Refine Results
  1. 1
  2. 2

    Analysis of Classifier-Free Guidance Weight Schedulers by Wang, Xi, Dufour, Nicolas, Andreou, Nefeli, Cani, Marie-Paule, Fernández Abrevaya, Victoria, Picard, David, Kalogeiton, Vicky

    ISSN: 2835-8856
    Published: [Amherst Massachusetts]: OpenReview.net, 2022 2024
    “…Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-toimage diffusion models. It operates by combining the conditional and…”
    Get full text
    Journal Article
  3. 3

    Quantifying Societal Bias Amplification in Image Captioning by Hirota, Yusuke, Nakashima, Yuta, Garcia, Noa

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “…We study societal bias amplification in image captioning. Image captioning models have been shown to perpetuate gender and racial biases, however, metrics to…”
    Get full text
    Conference Proceeding
  4. 4

    Learning Self-Prior for Mesh Inpainting Using Self-Supervised Graph Convolutional Networks by Shota Hattori, Tatsuya Yatagawa, Yutaka Ohtake, Hiromasa Suzuki

    ISSN: 1077-2626, 2160-9306
    Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024
    Get full text
    Journal Article
  5. 5

    Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers by Nokabadi, Fatemeh Nourilenjan, Lalonde, Jean-François, Gagné, Christian

    ISSN: 2835-8856
    Published: [Amherst Massachusetts]: OpenReview.net, 2022 03.06.2024
    “…New transformer networks have been integrated into object tracking pipelines and have demonstrated strong performance on the latest benchmarks. This paper…”
    Get full text
    Journal Article
  6. 6

    CNN-based real-time 2D-3D deformable registration from a single X-ray projection by Lecomte, Francois, Dillenseger, Jean-Louis, Cotin, Stéphane

    ISSN: 2077-0383, 2077-0383
    Published: MDPI 01.03.2024
    Published in Journal of clinical medicine (01.03.2024)
    “…Purpose: The purpose of this paper is to present a method for realtime 2D-3D non-rigid registration using a single fluoroscopic image. Such a method can find…”
    Get full text
    Journal Article
  7. 7

    Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri by West, Graham, Swindall, Matthew I., Keener, Ben, Player, Timothy, Williams, Alex C., Brusuelas, James H., Wallin, John F.

    ISSN: 2416-5999, 2416-5999
    Published: Nicolas Turenne 07.02.2024
    “…Performing classification on noisy, crowdsourced image datasets can prove challenging even for the best neural networks. Two issues which complicate the…”
    Get full text
    Journal Article
  8. 8

    The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique by Couture, Beatrice, Verret, Farah, Gohier, Maxime, Deslandres, Dominique

    ISSN: 2416-5999, 2416-5999
    Published: Nicolas Turenne 06.12.2023
    “…The arrival of handwriting recognition technologies offers new possibilities for research in heritage studies. However, it is now necessary to reflect on the…”
    Get full text
    Journal Article
  9. 9

    Combining Morphological and Histogram based Text Line Segmentation in the OCR Context by Schneider, Pit

    ISSN: 2416-5999, 2416-5999
    Published: Nicolas Turenne 04.11.2021
    “…Text line segmentation is one of the pre-stages of modern optical character recognition systems. The algorithmic approach proposed by this paper has been…”
    Get full text
    Journal Article
  10. 10

    SoccerNet 2023 challenges results by Cioppa, Anthony, Giancola, Silvio, Somers, Vladimir, Magera, Floriane, Zhou, Xin, Mkhallati, Hassan, Deliège, Adrien, Held, Jan, Hinojosa, Carlos, Mansourian, Amir M., Miralles, Pierre, Barnich, Olivier, De Vleeschouwer, Christophe, Alahi, Alexandre, Ghanem, Bernard, Van Droogenbroeck, Marc, Kamal, Abdullah, Maglo, Adrien, Clapés, Albert, Abdelaziz, Amr, Xarles, Artur, Orcesi, Astrid, Scott, Atom, Liu, Bin, Lim, Byoungkwon, Chen, Chen, Deuser, Fabian, Yan, Feng, Yu, Fufu, Shitrit, Gal, Wang, Guanshuo, Choi, Gyusik, Kim, Hankyul, Guo, Hao, Fahrudin, Hasby, Koguchi, Hidenari, Ardö, Håkan, Salah, Ibrahim, Yerushalmy, Ido, Muhammad, Iftikar, Uchida, Ikuma, Be’ery, Ishay, Rabarisoa, Jaonary, Lee, Jeongae, Fu, Jiajun, Yin, Jianqin, Xu, Jinghang, Nang, Jongho, Denize, Julien, Li, Junjie, Zhang, Junpei, Kim, Juntae, Synowiec, Kamil, Kobayashi, Kenji, Zhang, Kexin, Habel, Konrad, Nakajima, Kota, Jiao, Licheng, Ma, Lin, Wang, Lizhi, Wang, Luping, Li, Menglong, Zhou, Mengying, Nasr, Mohamed, Abdelwahed, Mohamed, Liashuha, Mykola, Falaleev, Nikolay, Oswald, Norbert, Jia, Qiong, Pham, Quoc-Cuong, Song, Ran, Hérault, Romain, Peng, Rui, Chen, Ruilong, Liu, Ruixuan, Baikulov, Ruslan, Fukushima, Ryuto, Escalera, Sergio, Lee, Seungcheon, Chen, Shimin, Ding, Shouhong, Someya, Taiga, Moeslund, Thomas B., Li, Tianjiao, Shen, Wei, Zhang, Wei, Li, Wei, Dai, Wei, Luo, Weixin, Zhao, Wending, Zhang, Wenjie, Yang, Xinquan, Ma, Yanbiao, Joo, Yeeun, Zeng, Yingsen, Gan, Yiyang, Zhu, Yongqiang, Zhong, Yujie, Ruan, Zheng, Li, Zhiheng

    ISSN: 1369-7072, 1460-2687, 1460-2687
    Published: Heidelberg Springer Nature B.V 01.12.2024
    Published in Sports engineering (01.12.2024)
    “…The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were…”
    Get full text
    Journal Article
  11. 11

    Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of" Never Enough Training Data" by Govind, Kishan, Oliveros, Daniela, Dlouhy, Antonin, Legros, Marc, Sandfeld, Stefan

    ISSN: 2632-2153
    Published: IOP Publishing Ltd 12.07.2023
    Published in Machine learning: science and technology (12.07.2023)
    “…Crystalline defects, such as line-like dislocations, play an important role for the performance and reliability of many metallic devices. Their interaction and…”
    Get full text
    Journal Article
  12. 12
  13. 13

    Dual-Pixel Raindrop Removal by Yizhou Li, Yusuke Monno, Masatoshi Okutomi

    ISSN: 0162-8828, 1939-3539
    Published: Institute of Electrical and Electronics Engineers (IEEE) 01.12.2024
    Get full text
    Journal Article
  14. 14

    Computer Analysis of Architecture Using Automatic Image Understanding by Wei, Fan, Li, Yuan, Shamir, Lior

    ISSN: 2416-5999, 2416-5999
    Published: Nicolas Turenne 22.01.2019
    “…In the past few years, computer vision and pattern recognition systems have been becoming increasingly more powerful, expanding the range of automatic tasks…”
    Get full text
    Journal Article
  15. 15

    Computer Analysis of Architecture Using Automatic Image Understanding by Fan Wei, Yuan Li, Lior Shamir

    ISSN: 2416-5999, 2416-5999
    Published: Nicolas Turenne 01.01.2019
    “…In the past few years, computer vision and pattern recognition systems have been becoming increasingly more powerful, expanding the range of automatic tasks…”
    Get full text
    Journal Article
  16. 16

    Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes by Jin, Dongkwon, Park, Wonhui, Jeong, Seong-Gyun, Kwon, Heeyeon, Kim, Chang-Su

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “…A novel algorithm to detect road lanes in the eigen-lane space is proposed in this paper. First, we introduce the notion of eigenlanes, which are data-driven…”
    Get full text
    Conference Proceeding
  17. 17
  18. 18

    Instrument-Tissue Interaction Detection Framework for Surgical Video Understanding by Wenjun Lin, Yan Hu, Huazhu Fu, Mingming Yang, Chin-Boon Chng, Ryo Kawasaki, Cheekong Chui, Jiang Liu

    ISSN: 0278-0062, 1558-254X
    Published: Institute of Electrical and Electronics Engineers (IEEE) 01.08.2024
    Published in IEEE Transactions on Medical Imaging (01.08.2024)
    Get full text
    Journal Article
  19. 19
  20. 20

    AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation by Sun, Yasheng, Chu, Wenqing, Zhou, Hang, Wang, Kaisiyuan, Koike, Hideki

    ISSN: 2169-3536, 2169-3536
    Published: Piscataway IEEE 2024
    Published in IEEE Access (2024)
    “…While considerable progress has been made in achieving accurate lip synchronization for 3D speech-driven talking face generation, the task of incorporating…”
    Get full text
    Journal Article