Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Hidden Markov model
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Structural architecture == The diagram below shows the general architecture of an instantiated HMM. Each oval shape represents a random variable that can adopt any of a number of values. The random variable ''x''(''t'') is the hidden state at time {{mvar|t}} (with the model from the above diagram, {{nowrap|''x''(''t'') β { ''x''<sub>1</sub>, ''x''<sub>2</sub>, ''x''<sub>3</sub> })}}. The random variable ''y''(''t'') is the observation at time {{mvar|t}} (with {{nowrap|''y''(''t'') β { ''y''<sub>1</sub>, ''y''<sub>2</sub>, ''y''<sub>3</sub>, ''y''<sub>4</sub> })}}. The arrows in the diagram (often called a [[Trellis (graph)|trellis diagram]]) denote conditional dependencies. From the diagram, it is clear that the [[conditional probability distribution]] of the hidden variable ''x''(''t'') at time {{mvar|t}}, given the values of the hidden variable {{mvar|x}} at all times, depends ''only'' on the value of the hidden variable {{nowrap|''x''(''t'' β 1)}}; the values at time {{nowrap|''t'' β 2}} and before have no influence. This is called the [[Markov property]]. Similarly, the value of the observed variable ''y''(''t'') depends on only the value of the hidden variable ''x''(''t'') (both at time {{mvar|t}}). In the standard type of hidden Markov model considered here, the state space of the hidden variables is discrete, while the observations themselves can either be discrete (typically generated from a [[categorical distribution]]) or continuous (typically from a [[Gaussian distribution]]). The parameters of a hidden Markov model are of two types, ''transition probabilities'' and ''emission probabilities'' (also known as ''output probabilities''). The transition probabilities control the way the hidden state at time {{mvar|t}} is chosen given the hidden state at time <math>t-1</math>. The hidden state space is assumed to consist of one of {{mvar|N}} possible values, modelled as a categorical distribution. (See the section below on extensions for other possibilities.) This means that for each of the {{mvar|N}} possible states that a hidden variable at time {{mvar|t}} can be in, there is a transition probability from this state to each of the {{mvar|N}} possible states of the hidden variable at time <math>t+1</math>, for a total of <math>N^2</math> transition probabilities. The set of transition probabilities for transitions from any given state must sum to 1. Thus, the <math>N \times N</math> matrix of transition probabilities is a [[Stochastic matrix|Markov matrix]]. Because any transition probability can be determined once the others are known, there are a total of <math>N(N-1)</math> transition parameters. In addition, for each of the {{mvar|N}} possible states, there is a set of emission probabilities governing the distribution of the observed variable at a particular time given the state of the hidden variable at that time. The size of this set depends on the nature of the observed variable. For example, if the observed variable is discrete with {{mvar|M}} possible values, governed by a [[categorical distribution]], there will be <math>M-1</math> separate parameters, for a total of <math>N(M-1)</math> emission parameters over all hidden states. On the other hand, if the observed variable is an {{mvar|M}}-dimensional vector distributed according to an arbitrary [[multivariate Gaussian distribution]], there will be {{mvar|M}} parameters controlling the [[mean]]s and <math>\frac {M(M+1)} 2</math> parameters controlling the [[covariance matrix]], for a total of <math>N \left(M + \frac{M(M+1)}{2}\right) = \frac {NM(M+3)} 2 = O(NM^2)</math> emission parameters. (In such a case, unless the value of {{mvar|M}} is small, it may be more practical to restrict the nature of the covariances between individual elements of the observation vector, e.g. by assuming that the elements are independent of each other, or less restrictively, are independent of all but a fixed number of adjacent elements.) [[File:hmm temporal bayesian net.svg|500px|center|Temporal evolution of a hidden Markov model]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)