Follow
Mor Shpigel Nacson
Mor Shpigel Nacson
PhD Student, Technion
Verified email at campus.technion.ac.il
Title
Cited by
Cited by
Year
The implicit bias of gradient descent on separable data
D Soudry, E Hoffer, MS Nacson, S Gunasekar, N Srebro
The Journal of Machine Learning Research 19 (1), 2822-2878, 2018
8702018
Convergence of gradient descent on separable data
MS Nacson, J Lee, S Gunasekar, PHP Savarese, N Srebro, D Soudry
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
1452019
Stochastic gradient descent on separable data: Exact convergence with a fixed learning rate
MS Nacson, N Srebro, D Soudry
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
872019
On the implicit bias of initialization shape: Beyond infinitesimal mirror descent
S Azulay, E Moroshko, MS Nacson, BE Woodworth, N Srebro, ...
International Conference on Machine Learning, 468-477, 2021
582021
Lexicographic and depth-sensitive margins in homogeneous and non-homogeneous deep models
MS Nacson, S Gunasekar, J Lee, N Srebro, D Soudry
International Conference on Machine Learning, 4683-4692, 2019
562019
Implicit bias of the step size in linear diagonal neural networks
MS Nacson, K Ravichandran, N Srebro, D Soudry
International Conference on Machine Learning, 16270-16295, 2022
332022
TAEN: temporal aware embedding network for few-shot action recognition
R Ben-Ari, MS Nacson, O Azulai, U Barzelay, D Rotman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
202021
At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
N Giladi, MS Nacson, E Hoffer, D Soudry
arXiv preprint arXiv:1909.12340, 2019
192019
Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond
I Kreisler, MS Nacson, D Soudry, Y Carmon
arXiv preprint arXiv:2305.13064, 2023
72023
The implicit bias of minima stability in multivariate shallow relu networks
MS Nacson, R Mulayoff, G Ongie, T Michaeli, D Soudry
arXiv preprint arXiv:2306.17499, 2023
32023
Action recognition using limited data
R Ben-Ari, O Azulai, U Barzelay, MS Nacson
US Patent App. 17/219,322, 2022
12022
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
G Buzaglo, I Harel, MS Nacson, A Brutzkus, N Srebro, D Soudry
arXiv preprint arXiv:2402.06323, 2024
2024
How Learning Rate and Delay Affect Minima Selection in Asynchronous Training of Neural Networks
N Giladi, MS Nacson, E Hoffer, D Soudry
The system can't perform the operation now. Try again later.
Articles 1–13