[AGH+13]Sanjeev Arora, Rong Ge, Yoni Halpern, David Mimno, Ankur Moitra, David Sontag, Yichen Wu, and Michael Zhu. A practical algorithm for topic modeling with provable guarantees. In Proceedings of the 30th International Conference on Machine Learning. ACM, 2013.
[Bis07]Christopher M. Bishop. Pattern Recognition and Machine Learning. Springer, New York, 2007.
[Ble12]David Blei. Introduction to probabilistic topic models. Communications of the ACM, 55(4):77–84, 2012. doi:10.1145/2133806.2133826.
[BL06]David M. Blei and John D. Lafferty. Dynamic topic models. In Proceedings of the 23rd International Conference on Machine Learning, 113–120. Pittsburgh, PA, 2006. ACM.
[BNJ03]David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993—1022, 2003. URL:
[BAG11]Michael Brennan, Sadia Afroz, and Rachel Greenstadt. Adversarial stylometry: circumventing authorship recognition to preserve privacy and anonymity. ACM Transactions on Information and System Security, 1(1):1:1–1:21, 2011.
[CB01]George Casella and Roger L. Berger. Statistical Inference. Duxbury Press, 2 edition, 2001.
[CG95]Kenneth W Church and William A Gale. Poisson mixtures. Natural Language Engineering, 1:163—190, 1995. URL:
[Dun93]Ted Dunning. Accurate methods for the statistics of surprise and coincidence. Computational Linguistic, 19:61–74, 1993.
[GH06]Andrew Gelman and Jennifer Hill. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press, 2006.
[Hof09]Peter D. Hoff. A First Course in Bayesian Statistical Methods. Springer, New York, 2009.
[Hoo07]David L. Hoover. Corpus stylistics, stylometry, and the styles of henry james. Style, 2007. URL:
[Kad11]Joseph B. Kadane. Principles of Uncertainty. Chapman & Hall/CRC, 2011.
[Lee04]Peter M. Lee. Bayesian Statistics: An Introduction. Wiley, London, 3 edition, 2004.
[MRS08]Christopher Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to information retrieval. Cambridge University Press, 2008. URL:
[PSD00]Jonathan K. Pritchard, Matthew Stephens, and Peter Donnelly. Inference of population structure using multilocus genotype data. Genetics, 155(2):945 –959, June 2000.
[RPW+02]Noah A. Rosenberg, Jonathan K. Pritchard, James L. Weber, Howard M. Cann, Kenneth K. Kidd, Lev A. Zhivotovsky, and Marcus W. Feldman. Genetic structure of human populations. Science, 298(5602):2381 –2385, December 2002. URL:, doi:10.1126/science.1078311.
[MacKay03]David J. C. MacKay. Information Theory, Inference, and Learning Algorithms. Cambridge University Press, Cambridge, 2003. URL: