Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Logistic function
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==== Neural networks ==== Logistic functions are often used in [[artificial neural network]]s to introduce [[nonlinearity]] in the model or to clamp signals to within a specified [[interval (mathematics)|interval]]. A popular [[artificial neuron|neural net element]] computes a [[linear combination]] of its input signals, and applies a bounded logistic function as the [[activation function]] to the result; this model can be seen as a "smoothed" variant of the classical [[perceptron|threshold neuron]].<!-- A reason for its popularity in neural networks is because the logistic function satisfies the differential equation <math display="block">y' = y(1-y).</math> The right hand side is a low-degree polynomial. Furthermore, the polynomial has factors <math>y</math> and <math>1 β y</math>, both of which are simple to compute. Given <math>y = sig(t)</math> at a particular <math>t</math>, the derivative of the logistic function at that <math>t</math> can be obtained by multiplying the two factors together. --> A common choice for the activation or "squashing" functions, used to clip large magnitudes to keep the response of the neural network bounded,<ref name="Gershenfeld-1999">Gershenfeld 1999, p.Β 150.</ref> is <math display="block">g(h) = \frac{1}{1 + e^{-2 \beta h}},</math> which is a logistic function. These relationships result in simplified implementations of [[artificial neural network]]s with [[artificial neuron]]s. Practitioners caution that sigmoidal functions which are [[Odd functions|antisymmetric]] about the origin (e.g. the [[hyperbolic tangent]]) lead to faster convergence when training networks with [[backpropagation]].<ref name="LeCun-1998">{{cite book | author1 = LeCun, Y. | author2 = Bottou, L. | author3 = Orr, G. | author4 = Muller, K. | editor = Orr, G. | editor2 = Muller, K. | year = 1998 | contribution = Efficient BackProp | title = Neural Networks: Tricks of the trade | isbn = 3-540-65311-2 | publisher = Springer | contribution-url = http://yann.lecun.com/exdb/publis/pdf/lecun-98b.pdf | archive-date = 31 August 2018 | access-date = 16 September 2009 | archive-url = https://web.archive.org/web/20180831075352/http://yann.lecun.com/exdb/publis/pdf/lecun-98b.pdf | url-status = dead }}</ref> The logistic function is itself the derivative of another proposed activation function, the [[softplus]].
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)