Poisson Matrix Factorization 1
In recent years, Matrix Factorization (MF) methods play a pivotal role in the research of Collaborative Filtering and Recommender Systems. The basic assumption is that, the data can be formed into a matrix
where
is the number of users and
is the number of items. Each rating
is modeled as a dot-product of the latent factor
for the user
and the latent factor
for the item
. In the classic Probabilistic Matrix Factorization, the rating
is defined as:
(1)
(2)
- It does not prevent that the ratings become negative, which is a natural result of the Gaussian assumption.
- If no heuristics are applied, the model needs to model all zero ratings and therefore, dramatically impacting the predictive performance.
In order to demonstrate the second point, we could see that the log likelihood of the data can be give as:
(3)
data:image/s3,"s3://crabby-images/0b8dd/0b8dd54ea4da42b3529129dd05d73257c7cc6329" alt="Rendered by QuickLaTeX.com C"
data:image/s3,"s3://crabby-images/c992e/c992eaaea1ab8342ae1c88bdaa46b29644bff638" alt="Rendered by QuickLaTeX.com X_{i,j}"
Recently, Prem Gopalan et al. [3, 4, 5, 6] have proposed a new model called Poisson Factorization (PF) to address these two issues. The central idea is to replace Gaussian assumption with Poisson distribution:
(4)
data:image/s3,"s3://crabby-images/afc16/afc161eefdfa67ea5bdfba6241e4b6895fb47377" alt="Rendered by QuickLaTeX.com \theta_{i}"
data:image/s3,"s3://crabby-images/392fa/392fa924613f28f0d4947ebdf57ae691c0baf26b" alt="Rendered by QuickLaTeX.com \phi_{j}"
data:image/s3,"s3://crabby-images/c992e/c992eaaea1ab8342ae1c88bdaa46b29644bff638" alt="Rendered by QuickLaTeX.com X_{i,j}"
(5)
(6)
data:image/s3,"s3://crabby-images/f832a/f832abc45db8f2ab29a85e6b45f79244eee11668" alt="Rendered by QuickLaTeX.com 0!=1"
data:image/s3,"s3://crabby-images/c992e/c992eaaea1ab8342ae1c88bdaa46b29644bff638" alt="Rendered by QuickLaTeX.com X_{i,j}"
data:image/s3,"s3://crabby-images/29273/292732396b267f8ea90bb3ea896605cda233d9fe" alt="Rendered by QuickLaTeX.com 1"
data:image/s3,"s3://crabby-images/c992e/c992eaaea1ab8342ae1c88bdaa46b29644bff638" alt="Rendered by QuickLaTeX.com X_{i,j}"
Another interesting property of PF, which is mentioned in [5, 6], is that, we can rewrite the Poisson observation model as a two stage process where a user first decides on a budget
she has to spend on items, and then spends this budget rating items that she is interested in:
(7)
Reference
- Ruslan Salakhutdinov, Andriy Mnih: Probabilistic Matrix Factorization. NIPS 2007: 1257-1264
- Yifan Hu, Yehuda Koren, Chris Volinsky: Collaborative Filtering for Implicit Feedback Datasets. ICDM 2008: 263-272
-
Prem Gopalan, Laurent Charlin, David M. Blei: Content-based recommendations with Poisson factorization. NIPS 2014: 3176-3184
-
Prem Gopalan, Francisco J. Ruiz, Rajesh Ranganath, David M. Blei: Bayesian Nonparametric Poisson Factorization for Recommendation Systems. AISTATS 2014: 275-283
- Prem Gopalan, Jake M. Hofman, David M. Blei: Scalable Recommendation with Poisson Factorization. CoRR abs/1311.1704 (2013)
- Prem Gopalan, Jake M. Hofman, David M. Blei: Scalable Recommendation with Poisson Factorization. UAI 2015.