Multimodal Prescriptive Deep Learning

Abstract

We introduce a multimodal deep learning framework, Prescriptive Neural Networks (PNNs), that combines ideas from optimization and machine learning to perform treatment recommendation, and that, to the best of our knowledge, is among the first prescriptive approaches tested with both structured and unstructured data within a unified model. The PNN is a feedforward neural network trained on embeddings to output an outcome-optimizing prescription. In two real-world multimodal datasets, we demonstrate that PNNs prescribe treatments that are able to greatly improve estimated outcome rewards; by over 40% in transcatheter aortic valve replacement (TAVR) procedures and by 25% in liver trauma injuries. In four real-world, unimodal tabular datasets, we demonstrate that PNNs outperform or perform comparably to other well-known, state-of-the-art prescriptive models; importantly, on tabular datasets, we also recover interpretability through knowledge distillation, fitting interpretable Optimal Classification Tree models onto the PNN prescriptions as classification targets, which is critical for many real-world applications. Finally, we demonstrate that our multimodal PNN models achieve stability across randomized data splits comparable to other prescriptive methods and produce realistic prescriptions across the different datasets.

Cite

Text

Bertsimas et al. "Multimodal Prescriptive Deep Learning." Transactions on Machine Learning Research, 2026.

Markdown

[Bertsimas et al. "Multimodal Prescriptive Deep Learning." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/bertsimas2026tmlr-multimodal/)

BibTeX

@article{bertsimas2026tmlr-multimodal,
  title     = {{Multimodal Prescriptive Deep Learning}},
  author    = {Bertsimas, Dimitris and Everest, Lisa and Stoumpou, Vasiliki},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/bertsimas2026tmlr-multimodal/}
}