The “god-father” of LDA, David Blei, recently published a tutorial on Bayesian Non-parametric Models, with one of his student. The whole tutorial is easy-reading and provides very clear overview of Bayesian Non-parametric Models. In particular, Chinese Restaurant Process (CRP) and Indian Buffet Process are discussed in a very intuitive way. For those who are interests in technical details about these models, this tutorial may be just a starting point and the Appendix points out several ways to discuss models more formally, including inference algorithms.
One specific interesting property shown in this tutorial is the “exchangeable” property for CRP, which I wish to re-state as below.
Let be the table assignment of the th customer. A draw from CPR can be generated by sequentially assigning observations to classes with probability:
where is the number of customers sitting at table , and is the number of tables for which . The parameter is called the concentration parameter. The CRP exhibits an important invariance property: The cluster assignments under this distribution are exchangeable. This means is unchanged if the order of customers is shuffled.
Consider the joint distribution of a set of customer assignments . It decomposes according to the chain rule:
where each terms comes from above equation. To show that this distribution is exchangeable, we will introduce some new notation. Let be the number of groups in which these assignments place the customers, which is a number between 1 and . Let be the set of indices of customers assigned to the th group, and let be the number of customers assigned to that group. Now, for a particular group the joint probability of all assignments in this group is:
where each term in the equation represents a customer. The numerator can be re-written as . Therefore, we have:
Finally, notice that the union of across all groups identifies each index once, because each customer is assigned to exactly one group. This simplifies the denominator and let us write the joint as:
This equation only depends on the number of groups and the size of each group .