Bibliography

Bar98
P. Bartlett.
The size of the weights is more important than the size of the network.
IEEE Transactions on Information Theory, 44(2):525-536, 1998.

Bel61
R. Bellman.
Adaptive Control Processes: A Guided Tour.
Princeton University Press, 1961.

BGV92
B. Boser, I. Guyon, and V. Vapnik.
Optimal margin classifiers.
In In Fifth Annual Workshop on Computational Learning Theory, pages 144-152, 1992.

CGBNT02
K. Crammer, R. Gilad-Bachrach, A. Navot, and N. Tishby.
Margin analysis of the lvq algorithm.
In Proc. 17'th Conference on Neural Information Processing Systems (NIPS), 2002.

CH67
T.M. Cover and P.E. Hart.
Nearest neighbor pattern classifier.
IEEE Transactions on Information Theory, 13:21-27, 1967.

FH51
E. Fix and j. Hodges.
Discriminatory analysis. nonparametric discrimination: Consistency properties.
Technical Report 4, USAF school of Aviation Medicine, 1951.

FS97
Y. Freund and R. E. Schapire.
A decision-theoretic generalization of on-line learning and an application to boosting.
Journal of Computer and System Sciences, 55(1):119-139, 1997.

GBNT04
R. Gilad-Bachrach, A. Navot, and N. Tishby.
Margin based feature selection - theory and algorithms.
In Proc. 21'st International Conference on Machine Learning (ICML), pages 337-343, 2004.

GBNTar
R. Gilad-Bachrach, A. Navot, and N. Tishby.
Large margin principles for feature selection.
In I. Guyon and S. Gunn, editors, Feature Extraction, Foundations and Applications. Springer, to appear.

GE03
I. Guyon and A. Elisseeff.
An introduction to variable and feature selection.
Journal of Machine Learnig Research, pages 1157-1182, Mar 2003.

GG03
I. Guyon and S. Gunn.
Nips feature selection challenge.
http://www.nipsfsc.ecs.soton.ac.uk/, 2003.

KJ97
R. Kohavi and G.H. John.
Wrapper for feature subset selection.
Artificial Intelligence, 97(1-2):273-324, 1997.

Koh95
T. Kohonen.
Self-Organizing Maps.
Springer-Verlag, 1995.

Kon94
I. Kononenko.
Estimating attributes: Analysis and extensions of RELIEF.
In Proc. European Conference on Machine Learning, pages 171-182, 1994.

KR92
K. Kira and L. Rendell.
A practical approach to feature selection.
In Proc. 9th International Workshop on Machine Learning, pages 249-256, 1992.

MB98
A.M. Martinez and R. Benavente.
The ar face database.
Technical report, CVC Tech. Rep. #24, 1998.
http://rvl1.ecn.purdue.edu/$\sim$aleix/aleix_face_DB.html.

Qui90
J. R. Quinlan.
Induction of decision trees.
In Jude W. Shavlik and Thomas G. Dietterich, editors, Readings in Machine Learning. Morgan Kaufmann, 1990.
Originally published in Machine Learning 1:81-106, 1986.

SFBL98
R. E. Schapire, Y. Freund, P. Bartlett, and W. S. Lee.
Boosting the margin : A new explanation for the effectiveness of voting methods.
Annals of Statistics, 1998.

STBWA98
J. Shawe-Taylor, P.L. Bartlett, R.C. Williamson, and M. Anthony.
Structural risk minimization over data-dependent hierarchies.
IEEE transactions on Information Theory, 44(5):1926-1940, 1998.

WEBS04
J. Weston, A. Elisseeff, G. BakIr, and F. Sinz.
The spider, 2004.
http://www.kyb.tuebingen.mpg.de/bs/people/spider/.

WMC+00
J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, and V. Vapnik.
Feature selection for SVMs.
In Proc. 15th Conference on Neural Information Processing Systems (NIPS), pages 668-674, 2000.



Ran Gilad-Bachrach 2004-12-07