Information-theoretic metric learning JV Davis, B Kulis, P Jain, S Sra, IS Dhillon Proceedings of the 24th international conference on Machine learning, 209-216, 2007 | 2659 | 2007 |
Clustering on the Unit Hypersphere using von Mises-Fisher Distributions. A Banerjee, IS Dhillon, J Ghosh, S Sra Journal of Machine Learning Research 6 (9), 2005 | 1168 | 2005 |
Optimization for machine learning S Sra, S Nowozin, SJ Wright Mit Press, 2012 | 993 | 2012 |
Contrastive learning with hard negative samples J Robinson, CY Chuang, S Sra, S Jegelka ICLR 2021, 2021 | 692 | 2021 |
Stochastic Variance Reduction for Nonconvex Optimization SJ Reddi, A Hefny, S Sra, B Póczós, A Smola International Conference on Machine Learning (ICML), 2016 | 652 | 2016 |
Generalized nonnegative matrix approximations with Bregman divergences S Sra, I Dhillon Advances in neural information processing systems 18, 2005 | 649 | 2005 |
Minimum sum-squared residue co-clustering of gene expression data H Cho, IS Dhillon, Y Guan, S Sra Proceedings of the 2004 SIAM international conference on data mining, 114-125, 2004 | 438 | 2004 |
Why gradient clipping accelerates training: A theoretical justification for adaptivity J Zhang, T He, S Sra, A Jadbabaie arXiv:1905.11881 (ICLR 2020), 2019 | 417 | 2019 |
Efficient filter flow for space-variant multiframe blind deconvolution M Hirsch, S Sra, B Schölkopf, S Harmeling 2010 IEEE Computer Society Conference on Computer Vision and Pattern …, 2010 | 307 | 2010 |
First-order methods for geodesically convex optimization H Zhang, S Sra Conference on Learning Theory (COLT), 2016 | 296 | 2016 |
Why are adaptive methods good for attention models? J Zhang, SP Karimireddy, A Veit, S Kim, S Reddi, S Kumar, S Sra Advances in Neural Information Processing Systems 33, 15383-15393, 2020 | 279* | 2020 |
Riemannian SVRG: Fast stochastic optimization on Riemannian manifolds H Zhang, S J Reddi, S Sra Advances in Neural Information Processing Systems 29, 2016 | 269 | 2016 |
Proximal stochastic methods for nonsmooth nonconvex finite-sum optimization SJ Reddi, S Sra, B Poczos, A Smola Advances in Neural Information Processing Systems, 2016 | 239 | 2016 |
Randomized nonlinear component analysis D Lopez-Paz, S Sra, A Smola, Z Ghahramani, B Schölkopf International conference on machine learning, 1359-1367, 2014 | 228 | 2014 |
Geometric Mean Metric Learning PH Zadeh, R Hosseini, S Sra International Conference on Machine Learning (ICML), 2016 | 209 | 2016 |
On variance reduction in stochastic gradient descent and its asynchronous variants SJ Reddi, A Hefny, S Sra, B Poczos, AJ Smola Advances in neural information processing systems, 2647-2655, 2015 | 208 | 2015 |
Entropic metric alignment for correspondence problems J Solomon, G Peyré, VG Kim, S Sra ACM Transactions on Graphics (ToG) 35 (4), 1-13, 2016 | 199 | 2016 |
A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of I s (x) S Sra Computational Statistics 27, 177-190, 2012 | 197 | 2012 |
Jensen-bregman logdet divergence with application to efficient similarity search for covariance matrices A Cherian, S Sra, A Banerjee, N Papanikolopoulos IEEE transactions on pattern analysis and machine intelligence 35 (9), 2161-2174, 2012 | 196 | 2012 |
Positive definite matrices and the S-divergence S Sra Proceedings of the American Mathematical Society 144 (7), 2787-2797, 2016 | 186 | 2016 |