The "Softmax" Nonlinearity: Derivation Using Statistical Mechanics and Useful Properties as a Multiterminal Analog Circuit Element
Abstract
We use mean-field theory methods from Statistical Mechanics to derive the "softmax" nonlinearity from the discontinuous winner(cid:173) take-all (WTA) mapping. We give two simple ways of implementing "soft max" as a multiterminal network element. One of these has a number of important network-theoretic properties. It is a recipro(cid:173) cal, passive, incrementally passive, nonlinear, resistive multitermi(cid:173) nal element with a content function having the form of information(cid:173) theoretic entropy. These properties should enable one to use this element in nonlinear RC networks with such other reciprocal el(cid:173) ements as resistive fuses and constraint boxes to implement very high speed analog optimization algorithms using a minimum of hardware.
Cite
Text
Elfadel and Jr.. "The "Softmax" Nonlinearity: Derivation Using Statistical Mechanics and Useful Properties as a Multiterminal Analog Circuit Element." Neural Information Processing Systems, 1993.Markdown
[Elfadel and Jr.. "The "Softmax" Nonlinearity: Derivation Using Statistical Mechanics and Useful Properties as a Multiterminal Analog Circuit Element." Neural Information Processing Systems, 1993.](https://mlanthology.org/neurips/1993/elfadel1993neurips-softmax/)BibTeX
@inproceedings{elfadel1993neurips-softmax,
title = {{The "Softmax" Nonlinearity: Derivation Using Statistical Mechanics and Useful Properties as a Multiterminal Analog Circuit Element}},
author = {Elfadel, I. M. and Jr., J. L. Wyatt},
booktitle = {Neural Information Processing Systems},
year = {1993},
pages = {882-887},
url = {https://mlanthology.org/neurips/1993/elfadel1993neurips-softmax/}
}