Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Naive Bayes classifier
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
===Bernoulli naive Bayes=== In the multivariate [[Bernoulli distribution|Bernoulli]] event model, features are independent [[Boolean data type|Boolean variables]] ([[binary data|binary variables]]) describing inputs. Like the multinomial model, this model is popular for document classification tasks,<ref name="mccallum"/> where binary term occurrence features are used rather than term frequencies. If <math>x_i</math> is a Boolean expressing the occurrence or absence of the {{mvar|i}}'th term from the vocabulary, then the likelihood of a document given a class <math>C_k</math> is given by:<ref name="mccallum"/> <math display="block"> p(\mathbf{x} \mid C_k) = \prod_{i=1}^n p_{ki}^{x_i} (1 - p_{ki})^{(1-x_i)} </math> where <math>p_{ki}</math> is the probability of class <math>C_k</math> generating the term <math>x_i</math>. This event model is especially popular for classifying short texts. It has the benefit of explicitly modelling the absence of terms. Note that a naive Bayes classifier with a Bernoulli event model is not the same as a multinomial NB classifier with frequency counts truncated to one.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)