Elsevier

Medical Image Analysis

Volume 53, April 2019, Pages 142-155
Medical Image Analysis

Iterative fully convolutional neural networks for automatic vertebra segmentation and identification

https://doi.org/10.1016/j.media.2019.02.005Get rights and content

Highlights

  • Vertebrae are segmented with an iterative instance segmentation algorithm.

  • The method does not make assumptions about the number of visible vertebrae.

  • Detected vertebrae are anatomically labeled using a global probabilistic model.

  • A fully convolutional neural network performs both segmentation and identification.

  • Vertebra segmentations and identifications are evaluated on five CT and MR datasets.

Abstract

Precise segmentation and anatomical identification of the vertebrae provides the basis for automatic analysis of the spine, such as detection of vertebral compression fractures or other abnormalities. Most dedicated spine CT and MR scans as well as scans of the chest, abdomen or neck cover only part of the spine. Segmentation and identification should therefore not rely on the visibility of certain vertebrae or a certain number of vertebrae. We propose an iterative instance segmentation approach that uses a fully convolutional neural network to segment and label vertebrae one after the other, independently of the number of visible vertebrae. This instance-by-instance segmentation is enabled by combining the network with a memory component that retains information about already segmented vertebrae. The network iteratively analyzes image patches, using information from both image and memory to search for the next vertebra. To efficiently traverse the image, we include the prior knowledge that the vertebrae are always located next to each other, which is used to follow the vertebral column. The network concurrently performs multiple tasks, which are segmentation of a vertebra, regression of its anatomical label and prediction whether the vertebra is completely visible in the image, which allows to exclude incompletely visible vertebrae from further analyses. The predicted anatomical labels of the individual vertebrae are additionally refined with a maximum likelihood approach, choosing the overall most likely labeling if all detected vertebrae are taken into account. This method was evaluated with five diverse datasets, including multiple modalities (CT and MR), various fields of view and coverages of different sections of the spine, and a particularly challenging set of low-dose chest CT scans. For vertebra segmentation, the average Dice score was 94.9 ± 2.1% with an average absolute symmetric surface distance of 0.2 ± 10.1mm. The anatomical identification had an accuracy of 93%, corresponding to a single case with mislabeled vertebrae. Vertebrae were classified as completely or incompletely visible with an accuracy of 97%. The proposed iterative segmentation method compares favorably with state-of-the-art methods and is fast, flexible and generalizable.

Introduction

Segmentation and identification of the vertebrae is often a prerequisite for automatic analysis of the spine, such as detection of vertebral fractures (Yao et al., 2012), assessment of spinal deformities (Forsberg et al., 2013), or computer-assisted surgical interventions (Knez et al., 2016). Automatic spine analysis can be performed with a large variety of tomographic scans, including dedicated spine scans but also scans of the neck, chest or abdomen that incidentally cover part of the spine. A generic vertebra segmentation algorithm therefore needs to be robust with respect to different image resolutions and different coverages of the spine. This especially means that no assumptions should be made about the number of visible vertebrae and their anatomical identity, i.e., to which section of the spine they belong. Vertebra segmentation is therefore essentially an instance segmentation problem with an a priori unknown number of instances (i.e. vertebrae). However, in contrast to generic instance segmentation the individual instances are not independent of each other. The instances are known to be located in close proximity to each other in the image, forming together the vertebral column. We propose to approach vertebra segmentation with an instance segmentation algorithm that explicitly incorporates this prior knowledge to locate instances, but that makes no further assumptions.

Approaching vertebra segmentation as an instance segmentation problem entails treating all vertebrae as instances of the same class of objects. However, an anatomical identification of the segmented vertebrae is often also needed, for instance, for further analysis steps or for reporting purposes. Especially in images originally not intended for spine imaging, anatomical labeling of the vertebrae can be challenging due to variations in the field of view. These variations lead to variable coverage of the spine and also of structures that provide anatomical cues for identification of the vertebrae, such as the ribs or the sacrum. Additionally, neighboring vertebrae often have similar shape and appearance so that independent labeling of each vertebra may result in mistakes. Vertebra identification therefore requires a global rather than a per-instance approach to ensure an overall plausible, anatomically correct labeling.

Another challenge inherent to an instance segmentation approach is the identification of partially visible instances. While occlusion is a typical problem in two-dimensional but not in three-dimensional images, some vertebrae may be only partially visible due to the limited field of view of the scan. If these incompletely visible vertebrae are included in subsequent analyses that are based on the obtained vertebra segmentations, such as measurement of vertebral heights for detection and classification of vertebral compression fractures (Grigoryan et al., 2003), their results may be unreliable. Therefore, incompletely visible instances need to be either ignored or explicitly identified as incomplete so that they can be excluded from subsequent analyses.

In this paper, we propose an iterative instance-by-instance segmentation approach for vertebra segmentation based on a fully convolutional neural network. This network performs vertebra detection, segmentation, anatomical identification and classification of their completeness concurrently and therefore presents an entirely supervised approach that can be trained end-to-end. While we propose to attempt a per-instance identification of the individual vertebrae together with the segmentation, the labeling is subsequently adjusted taking all segmented vertebrae into account. In contrast to previous approaches, the presented method can be used for any imaging modality, any field of view and any number and type (cervical, thoracic, lumbar) of visible vertebrae because it avoids explicit modeling of shape and appearance of the vertebrae and the vertebral column. We evaluate these claims using a diverse selection of datasets, including scans from different modalities (CT and MR), various fields of view, cases with severe compression fractures and a particularly challenging set of low-dose chest CT.

Section snippets

Related work

While a few other methods have been published that address both vertebra segmentation and identification (Klinder, Ostermann, Ehm, Franz, Kneser, Lorenz, 2009, Kelm, Wels, Zhou, Seifert, Suehling, Zheng, Comaniciu, 2013, Chu, Belavỳ, Armbrecht, Bansmann, Felsenberg, Zheng, 2015, Suzani, Rasoulian, Seitel, Fels, Rohling, Abolmaesumi, 2015, Sekuboyina, Valentinitsch, Kirschke, Menze), the majority of methods in the literature focused on one of these problems. The existing literature is therefore

Methods

We propose a vertebra segmentation and identification method based on a single fully convolutional neural network (FCN) that performs multiple tasks concurrently. In contrast to existing methods, this avoids a multi-stage process with successive instance detection and segmentation, or segmentation and instance separation steps. Other existing generic instance segmentation methods with 2D deep neural networks often do not generalize well to 3D image volumes because they analyze the entire image

Datasets

We trained and evaluated the method with five sets of CT and MR scans that visualize the spine. Reference segmentation masks for four of these datasets are publicly available, which allowed for a comparison with other publications that used the same data. Examples of images from the datasets are shown in Fig. 3.

The thoracolumbar spine CT dataset consists of 15 dedicated spine CT scans that visualize all thoracic and lumbar vertebrae. It was originally used for the spine segmentation challenge

Experiments and results

We trained modality-specific instances of the network adjusted to the different ground truths, i.e., to perform vertebra segmentation in CT and vertebral body segmentation in MR. The CT training set consisted of 60 scans, of which 10 were thoracolumbar spine CT, 10 lumbar spine CT scans with compression fractures and 40 NLST scans. The CT evaluation set consisted of 30 scans, of which 5 were thoracolumbar spine CT scans, 5 lumbar spine CT with compression fractures, 10 NLST scans, and 10 normal

Discussion

This paper demonstrates that fully convolutional neural networks, which have been widely used for semantic segmentation (Litjens et al., 2017), are also capable of learning a complex instance segmentation task. Vertebra segmentation performed instance-by-instance required the network to learn to infer from an additional memory input which vertebra to segment and to ignore other vertebrae. Additionally, the same network was able to perform multiple tasks concurrently, namely vertebra

Acknowledgements

We would like to thank the organizers of the CSI 2014 spine segmentation challenge, the Laboratory of Imaging Technologies at the University of Ljubljana and the authors of the MR dataset for making scans and reference segmentations publicly available. We are furthermore grateful to the United States National Cancer Institute (NCI) for providing access to NCI’s data collected by the National Lung Screening Trial. The statements contained in this publication are solely ours and do not represent

References (62)

  • P.A. Bromiley et al.

    Fully automatic localisation of vertebrae in CT images using random forest regression voting

    International Workshop on Computational Methods and Clinical Applications for Spine Imaging. LNCS, 10182

    (2016)
  • Y. Cai et al.

    Multi-modality vertebra recognition in arbitrary views using 3D deformable hierarchical model

    IEEE Trans. Med. Imag.

    (2015)
  • I. Castro-Mateos et al.

    Statistical interspace models (SIMs): application to robust 3D spine segmentation

    IEEE Trans. Med. Imag.

    (2015)
  • H. Chen et al.

    Automatic localization and identification of vertebrae in Spine CT via a joint learning model with deep neural networks

    International Conference on Medical Image Computing and Computer-Assisted Intervention. LNCS, 9349

    (2015)
  • C. Chu et al.

    Fully automatic localization and segmentation of 3D vertebral bodies from CT/MR images via a learning-based method

    PLoS ONE

    (2015)
  • Ö. Çiçek et al.

    3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation

    International Conference on Medical Image Computing and Computer-Assisted Intervention. LNCS, 9901

    (2016)
  • J. Cohen

    Weighted kappa: nominal scale agreement provision for scaled disagreement or partial credit

    Psychol. Bull.

    (1968)
  • B. De Brabandere et al.

    Semantic instance segmentation for autonomous driving

    Computer Vision and Pattern Recognition Workshops (CVPRW), 2017 IEEE Conference on

    (2017)
  • D. Forsberg et al.

    Fully automatic measurements of axial vertebral rotation for assessment of spinal deformity in idiopathic scoliosis

    Phys. Med. Biol.

    (2013)
  • D. Forsberg et al.

    Detection and labeling of vertebrae in MR images using deep learning with clinical annotations as training data

    J. Digit. Imag.

    (2017)
  • B. Glocker et al.

    Automatic localization and identification of vertebrae in arbitrary field-of-view CT scans

    International Conference on Medical Image Computing and Computer-Assisted Intervention. LNCS, 7512

    (2012)
  • B. Glocker et al.

    Vertebrae localization in pathological spine CT via dense classification from sparse annotations

    International Conference on Medical Image Computing and Computer-Assisted Intervention. LNCS, 8150

    (2013)
  • M. Grigoryan et al.

    Recognizing and reporting osteoporotic vertebral fractures

    Eur. Spine J.

    (2003)
  • K. Hammernik et al.

    Vertebrae segmentation in 3D CT images based on a variational framework

    Recent Advances in Computational Methods and Clinical Applications for Spine Imaging. Lecture Notes in Computational Vision and Biomechanics

    (2015)
  • K. He et al.

    Mask R-CNN

    IEEE International Conference on Computer Vision (ICCV)

    (2017)
  • B. Ibragimov et al.

    Interpolation-based detection of lumbar vertebrae in CT spine images

    Recent Advances in Computational Methods and Clinical Applications for Spine Imaging. Lecture Notes in Computational Vision and Biomechanics

    (2015)
  • B. Ibragimov et al.

    Segmentation of pathological structures by landmark-assisted deformable models

    IEEE Trans. Med. Imag.

    (2017)
  • B. Ibragimov et al.

    Shape representation for efficient landmark-based segmentation in 3-D

    IEEE Trans. Med. Imag.

    (2014)
  • R. Janssens et al.

    Fully automatic segmentation of lumbar vertebrae from CT images using cascaded 3D fully convolutional networks

    IEEE 15th International Symposium on Biomedical Imaging (ISBI)

    (2018)
  • S. Kadoury et al.

    Spine segmentation in medical images using manifold embeddings and higher-order MRFs

    IEEE Trans. Med. Imag.

    (2013)
  • B.M. Kelm et al.

    Spine detection in CT and MR using iterated marginal space learning

    Med. Image Anal.

    (2013)
  • Cited by (0)

    View full text