Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation

Abstract

Convolutional Neural Networks (CNNs) can be shifted across 2D images or 3D videos to segment them. They have a fixed input size and typically perceive only small local contexts of the pixels to be classified as foreground or background. In contrast, Multi-Dimensional Recurrent NNs (MD-RNNs) can perceive the entire spatio-temporal context of each pixel in a few sweeps through all pixels, especially when the RNN is a Long Short-Term Memory (LSTM). Despite these theoretical advantages, however, unlike CNNs, previous MD-LSTM variants were hard to parallelise on GPUs. Here we re-arrange the traditional cuboid order of computations in MD-LSTM in pyramidal fashion. The resulting PyraMiD-LSTM is easy to parallelise, especially for 3D data such as stacks of brain slice images. PyraMiD-LSTM achieved best known pixel-wise brain image segmentation results on MRBrainS13 (and competitive results on EM-ISBI12).

Cite

Text

Stollenga et al. "Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation." Neural Information Processing Systems, 2015.

Markdown

[Stollenga et al. "Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/stollenga2015neurips-parallel/)

BibTeX

@inproceedings{stollenga2015neurips-parallel,
  title     = {{Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation}},
  author    = {Stollenga, Marijn F and Byeon, Wonmin and Liwicki, Marcus and Schmidhuber, Jürgen},
  booktitle = {Neural Information Processing Systems},
  year      = {2015},
  pages     = {2998-3006},
  url       = {https://mlanthology.org/neurips/2015/stollenga2015neurips-parallel/}
}