Use of a Multi-Layer Perceptron to Predict Malignancy in Ovarian Tumors

Abstract

This paper is concerned with the problem of Reinforcement Learn(cid:173) ing (RL) for continuous state space and time stocha.stic control problems. We state the Harnilton-Jacobi-Bellman equation satis(cid:173) fied by the value function and use a Finite-Difference method for designing a convergent approximation scheme. Then we propose a RL algorithm based on this scheme and prove its convergence to the optimal solution.

Cite

Text

Verrelst et al. "Use of a Multi-Layer Perceptron to Predict Malignancy in Ovarian Tumors." Neural Information Processing Systems, 1997.

Markdown

[Verrelst et al. "Use of a Multi-Layer Perceptron to Predict Malignancy in Ovarian Tumors." Neural Information Processing Systems, 1997.](https://mlanthology.org/neurips/1997/verrelst1997neurips-use/)

BibTeX

@inproceedings{verrelst1997neurips-use,
  title     = {{Use of a Multi-Layer Perceptron to Predict Malignancy in Ovarian Tumors}},
  author    = {Verrelst, Herman and Moreau, Yves and Vandewalle, Joos and Timmerman, Dirk},
  booktitle = {Neural Information Processing Systems},
  year      = {1997},
  pages     = {978-984},
  url       = {https://mlanthology.org/neurips/1997/verrelst1997neurips-use/}
}