Data Mining Reading List
Spring 2008
Pre-Processing
Christopher J.C. Burges.
Geometric Methods for Feature Extraction and Dimensional Reduction: A Guided Tour
. Microsoft Research. TR, 2004.
J.B. Tenenbaum, V. de Silva and J. C. Langford.
A global geometric framework for nonlinear dimensionality reduction
. Science, vol. 290, pp. 2319--2323, 2000.
PN Belhumeur, JP Hespanha, DJ Kriegman.
Eigenfaces vs. Fisherfaces: recognition using class specific linear projection,
. TPAMI, 1997
Clustering
J. Bilmes.
A Gentle Tutorial on the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models
.
Arindam Banerjee, Srujana Merugu, Inderjit S. Dhillon, Joydeep Ghosh.
Clustering with Bregman Divergences
. Journal of Machine Learning Research, 6(Oct):1705-1749, 2005.
(Locality sensitive hashing): Taher H. Haveliwala, Aristides Gionis, Piotr Indyk.
Scalable Techniques for Clustering the Web
.
Tilman Lange, Volker Roth, Mikio L. Braun and Joachim M. Buhmann.
Stability-Based Validation of Clustering Solutions
. Neural Computation 16, pp. 1299-1323, MIT Press, 2003.
Classification
Andrew Ng, Michael Jordan.
On Discriminative Vs Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes
.
Nick Littlestone and Manfred Warmuth.
The Weighted Majority Algorithm
.
K.-R. Müller, S. Mika, G. Rätsch, K. Tsuda, and B. Schölkopf.
An introduction to kernel-based learning algorithms
. IEEE Neural Networks, 12(2):181-201, 2001.
Robert E. Schapire, Yoav Freund, Peter Bartlett and Wee Sun Lee.
Boosting the margin: A new explanation for the effectiveness of voting methods
. The Annals of Statistics, 26(5):1651-1686, 1998.
Erin L. Allwein, Robert E. Schapire and Yoram Singer.
Reducing multiclass to binary: A unifying approach for margin classifiers
. Journal of Machine Learning Research, 1:113-141, 2000.
L. Getoor, N. Friedman, D. Koller, A. Pfeffer.
Learning Probabilistic Relational Models.
. Invited contribution to the book Relational Data Mining, S. Dzeroski and N. Lavrac, Eds., Springer-Verlag, 2001.
Web
J. Kleinberg.
Authoritative sources in a hyperlinked environment
. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998.
Abhinandan Das, Mayur Datar, Ashutosh Garg, Shyam Rajaram.
Google News Personalization: Scalable Online Collaborative Filtering
. Proceedings of WWW 2007, pp. 271-280.