Slide 3
Slide 3 text
Naive Bayes classifier
• Estimating the class-conditional probability P(y|x)
◦ x is a vector of term frequencies {x1
, . . . , xn
}
P(x|y) = P(x1, . . . , xn|y)
• “Naive” assumption: features (terms) are independent:
P(x|y) =
n
i=1
P(xi|y)
• Putting our choices together, the probability that x belongs to class y is
estimated using:
P(y|x) ∝ P(y)
n
i=1
P(xi|y)
3 / 8