Search Results - "Computer Science - Computer Vision and Pattern Recognition"
-
1
DINOv2: Learning Robust Visual Features without Supervision
ISSN: 2835-8856Published: [Amherst Massachusetts]: OpenReview.net, 2022 2024Published in Transactions on Machine Learning Research Journal (2024)“…The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in…”
Get full text
Journal Article -
2
Analysis of Classifier-Free Guidance Weight Schedulers
ISSN: 2835-8856Published: [Amherst Massachusetts]: OpenReview.net, 2022 2024Published in Transactions on Machine Learning Research Journal (2024)“…Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-toimage diffusion models. It operates by combining the conditional and…”
Get full text
Journal Article -
3
Quantifying Societal Bias Amplification in Image Captioning
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…We study societal bias amplification in image captioning. Image captioning models have been shown to perpetuate gender and racial biases, however, metrics to…”
Get full text
Conference Proceeding -
4
Learning Self-Prior for Mesh Inpainting Using Self-Supervised Graph Convolutional Networks
ISSN: 1077-2626, 2160-9306Published: Institute of Electrical and Electronics Engineers (IEEE) 01.01.2024Published in IEEE Transactions on Visualization and Computer Graphics (01.01.2024)Get full text
Journal Article -
5
Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers
ISSN: 2835-8856Published: [Amherst Massachusetts]: OpenReview.net, 2022 03.06.2024Published in Transactions on Machine Learning Research Journal (03.06.2024)“…New transformer networks have been integrated into object tracking pipelines and have demonstrated strong performance on the latest benchmarks. This paper…”
Get full text
Journal Article -
6
CNN-based real-time 2D-3D deformable registration from a single X-ray projection
ISSN: 2077-0383, 2077-0383Published: MDPI 01.03.2024Published in Journal of clinical medicine (01.03.2024)“…Purpose: The purpose of this paper is to present a method for realtime 2D-3D non-rigid registration using a single fluoroscopic image. Such a method can find…”
Get full text
Journal Article -
7
Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri
ISSN: 2416-5999, 2416-5999Published: Nicolas Turenne 07.02.2024Published in Journal of data mining and digital humanities (07.02.2024)“…Performing classification on noisy, crowdsourced image datasets can prove challenging even for the best neural networks. Two issues which complicate the…”
Get full text
Journal Article -
8
The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique
ISSN: 2416-5999, 2416-5999Published: Nicolas Turenne 06.12.2023Published in Journal of data mining and digital humanities (06.12.2023)“…The arrival of handwriting recognition technologies offers new possibilities for research in heritage studies. However, it is now necessary to reflect on the…”
Get full text
Journal Article -
9
Combining Morphological and Histogram based Text Line Segmentation in the OCR Context
ISSN: 2416-5999, 2416-5999Published: Nicolas Turenne 04.11.2021Published in Journal of data mining and digital humanities (04.11.2021)“…Text line segmentation is one of the pre-stages of modern optical character recognition systems. The algorithmic approach proposed by this paper has been…”
Get full text
Journal Article -
10
SoccerNet 2023 challenges results
ISSN: 1369-7072, 1460-2687, 1460-2687Published: Heidelberg Springer Nature B.V 01.12.2024Published in Sports engineering (01.12.2024)“…The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were…”
Get full text
Journal Article -
11
Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of" Never Enough Training Data"
ISSN: 2632-2153Published: IOP Publishing Ltd 12.07.2023Published in Machine learning: science and technology (12.07.2023)“…Crystalline defects, such as line-like dislocations, play an important role for the performance and reliability of many metallic devices. Their interaction and…”
Get full text
Journal Article -
12
Physical Adversarial Attack Meets Computer Vision: A Decade Survey
ISSN: 0162-8828, 1939-3539Published: Institute of Electrical and Electronics Engineers (IEEE) 01.12.2024Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (01.12.2024)Get full text
Journal Article -
13
Dual-Pixel Raindrop Removal
ISSN: 0162-8828, 1939-3539Published: Institute of Electrical and Electronics Engineers (IEEE) 01.12.2024Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (01.12.2024)Get full text
Journal Article -
14
Computer Analysis of Architecture Using Automatic Image Understanding
ISSN: 2416-5999, 2416-5999Published: Nicolas Turenne 22.01.2019Published in Journal of data mining and digital humanities (22.01.2019)“…In the past few years, computer vision and pattern recognition systems have been becoming increasingly more powerful, expanding the range of automatic tasks…”
Get full text
Journal Article -
15
Computer Analysis of Architecture Using Automatic Image Understanding
ISSN: 2416-5999, 2416-5999Published: Nicolas Turenne 01.01.2019Published in Journal of data mining and digital humanities (01.01.2019)“…In the past few years, computer vision and pattern recognition systems have been becoming increasingly more powerful, expanding the range of automatic tasks…”
Get full text
Journal Article -
16
Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…A novel algorithm to detect road lanes in the eigen-lane space is proposed in this paper. First, we introduce the notion of eigenlanes, which are data-driven…”
Get full text
Conference Proceeding -
17
SpectralGPT: Spectral Remote Sensing Foundation Model
ISSN: 0162-8828, 1939-3539Published: Institute of Electrical and Electronics Engineers (IEEE) 01.08.2024Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (01.08.2024)Get full text
Journal Article -
18
Instrument-Tissue Interaction Detection Framework for Surgical Video Understanding
ISSN: 0278-0062, 1558-254XPublished: Institute of Electrical and Electronics Engineers (IEEE) 01.08.2024Published in IEEE Transactions on Medical Imaging (01.08.2024)Get full text
Journal Article -
19
DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs
ISSN: 0920-5691, 1573-1405Published: Springer Science and Business Media LLC 17.06.2024Published in International Journal of Computer Vision (17.06.2024)Get full text
Journal Article -
20
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation
ISSN: 2169-3536, 2169-3536Published: Piscataway IEEE 2024Published in IEEE Access (2024)“…While considerable progress has been made in achieving accurate lip synchronization for 3D speech-driven talking face generation, the task of incorporating…”
Get full text
Journal Article