Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation
Abstract
Convolutional Neural Networks (CNNs) can be shifted across 2D images or 3D videos to segment them. They have a fixed input size and typically perceive only small local contexts of the pixels to be classified as foreground or background. In contrast, Multi-Dimensional Recurrent NNs (MD-RNNs) can perceive the entire spatio-temporal context of each pixel in a few sweeps through all pixels, especially when the RNN is a Long Short-Term Memory (LSTM). Despite these theoretical advantages, however, unlike CNNs, previous MD-LSTM variants were hard to parallelise on GPUs. Here we re-arrange the traditional cuboid order of computations in MD-LSTM in pyramidal fashion. The resulting PyraMiD-LSTM is easy to parallelise, especially for 3D data such as stacks of brain slice images. PyraMiD-LSTM achieved best known pixel-wise brain image segmentation results on MRBrainS13 (and competitive results on EM-ISBI12).
Cite
Text
Stollenga et al. "Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation." Neural Information Processing Systems, 2015.Markdown
[Stollenga et al. "Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/stollenga2015neurips-parallel/)BibTeX
@inproceedings{stollenga2015neurips-parallel,
title = {{Parallel Multi-Dimensional LSTM, with Application to Fast Biomedical Volumetric Image Segmentation}},
author = {Stollenga, Marijn F and Byeon, Wonmin and Liwicki, Marcus and Schmidhuber, Jürgen},
booktitle = {Neural Information Processing Systems},
year = {2015},
pages = {2998-3006},
url = {https://mlanthology.org/neurips/2015/stollenga2015neurips-parallel/}
}