TorchIO: A Python library for efficient loading, preprocessing, augmentation and patch-based sampling of medical images in deep learning

•Open-source Python library for preprocessing, augmentation and sampling of medical images for deep learning.•Support for 2D, 3D and 4D images such as X-ray, histopathology, CT, ultrasound and diffusion MRI.•Modular design inspired by the deep learning framework PyTorch.•Focus on reproducibility and...

Full description

Saved in:

Bibliographic Details
Published in:	Computer methods and programs in biomedicine Vol. 208; p. 106236
Main Authors:	Pérez-García, Fernando, Sparks, Rachel, Ourselin, Sébastien
Format:	Journal Article
Language:	English
Published:	Elsevier B.V 01.09.2021 Elsevier Scientific Publishers
Subjects:	Data augmentation Deep learning Medical image computing Preprocessing Deep learning Medical image computing Data augmentation Preprocessing
ISSN:	0169-2607, 1872-7565, 1872-7565
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	•Open-source Python library for preprocessing, augmentation and sampling of medical images for deep learning.•Support for 2D, 3D and 4D images such as X-ray, histopathology, CT, ultrasound and diffusion MRI.•Modular design inspired by the deep learning framework PyTorch.•Focus on reproducibility and traceability to encourage open-science practices.•Compatible with related frameworks for medical image processing with deep learning. Processing of medical images such as MRI or CT presents different challenges compared to RGB images typically used in computer vision. These include a lack of labels for large datasets, high computational costs, and the need of metadata to describe the physical properties of voxels. Data augmentation is used to artificially increase the size of the training datasets. Training with image subvolumes or patches decreases the need for computational power. Spatial metadata needs to be carefully taken into account in order to ensure a correct alignment and orientation of volumes. We present TorchIO, an open-source Python library to enable efficient loading, preprocessing, augmentation and patch-based sampling of medical images for deep learning. TorchIO follows the style of PyTorch and integrates standard medical image processing libraries to efficiently process images during training of neural networks. TorchIO transforms can be easily composed, reproduced, traced and extended. Most transforms can be inverted, making the library suitable for test-time augmentation and estimation of aleatoric uncertainty in the context of segmentation. We provide multiple generic preprocessing and augmentation operations as well as simulation of MRI-specific artifacts. Source code, comprehensive tutorials and extensive documentation for TorchIO can be found at http://torchio.rtfd.io/. The package can be installed from the Python Package Index (PyPI) running pip install torchio. It includes a command-line interface which allows users to apply transforms to image files without using Python. Additionally, we provide a graphical user interface within a TorchIO extension in 3D Slicer to visualize the effects of transforms. TorchIO was developed to help researchers standardize medical image processing pipelines and allow them to focus on the deep learning experiments. It encourages good open-science practices, as it supports experiment reproducibility and is version-controlled so that the software can be cited precisely. Due to its modularity, the library is compatible with other frameworks for deep learning with medical images.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0169-2607 1872-7565 1872-7565
DOI:	10.1016/j.cmpb.2021.106236