ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

We focus on electronic theses and dissertations (ETDs), aiming to improve access and expand their utility, since more than 6 million are publicly available, and they constitute an important corpus to aid research and education across disciplines. The corpus is growing as new born-digital documents a...

Full description

Saved in:
Bibliographic Details
Published in:2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) pp. 180 - 191
Main Authors: Kahu, Sampanna Yashwant, Ingram, William A, Fox, Edward A, Wu, Jian
Format: Conference Proceeding
Language:English
Published: IEEE 01.09.2021
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first