TY - GEN
T1 - Deep metric learning with data summarization
AU - Wang, Wenlin
AU - Chen, Changyou
AU - Chen, Wenlin
AU - Rai, Piyush
AU - Carin, Lawrence
N1 - Publisher Copyright: © Springer International Publishing AG 2016.
PY - 2016
Y1 - 2016
N2 - We present Deep Stochastic Neighbor Compression (DSNC), a framework to compress training data for instance-based methods (such as k-nearest neighbors). We accomplish this by inferring a smaller set of pseudo-inputs in a new feature space learned by a deep neural network. Our framework can equivalently be seen as jointly learning a nonlinear distance metric (induced by the deep feature space) and learning a compressed version of the training data. In particular, compressing the data in a deep feature space makes DSNC robust against label noise and issues such as within-class multi-modal distributions. This leads to DSNC yielding better accuracies and faster predictions at test time, as compared to other competing methods. We conduct comprehensive empirical evaluations, on both quantitative and qualitative tasks, and on several benchmark datasets, to show its effectiveness as compared to several baselines.
AB - We present Deep Stochastic Neighbor Compression (DSNC), a framework to compress training data for instance-based methods (such as k-nearest neighbors). We accomplish this by inferring a smaller set of pseudo-inputs in a new feature space learned by a deep neural network. Our framework can equivalently be seen as jointly learning a nonlinear distance metric (induced by the deep feature space) and learning a compressed version of the training data. In particular, compressing the data in a deep feature space makes DSNC robust against label noise and issues such as within-class multi-modal distributions. This leads to DSNC yielding better accuracies and faster predictions at test time, as compared to other competing methods. We conduct comprehensive empirical evaluations, on both quantitative and qualitative tasks, and on several benchmark datasets, to show its effectiveness as compared to several baselines.
UR - https://www.scopus.com/pages/publications/84988624518
U2 - 10.1007/978-3-319-46128-1_49
DO - 10.1007/978-3-319-46128-1_49
M3 - Conference contribution
SN - 9783319461274
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 777
EP - 794
BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2016, Proceedings
A2 - Giuseppe, Jilles
A2 - Landwehr, Niels
A2 - Manco, Giuseppe
A2 - Frasconi, Paolo
PB - Springer Verlag
T2 - 15th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2016
Y2 - 19 September 2016 through 23 September 2016
ER -