NLP 20

0. Given a corpus C2, the Maximum Likelihood Estimation (MLE) for the bigram "dried berries" is 0.3 and the count of occurrence of the word "dried" is 580 for the same corpus C2, the likelihood of "dried berries" after applying add-one smoothing is 0.04. What is the vocabulary size of C2?

Cancel reply

Your email address will not be published. Required fields are marked *


Cancel reply

Your email address will not be published. Required fields are marked *