Huishuai Zhang

Cited by

	All	Since 2019
Citations	2730	2535
h-index	24	22
i10-index	35	33

820

410

205

615

201520162017201820192020202120222023202410 32 64 79 132 158 284 528 801 630

Public access

View all

15 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Yingbin LiangThe Ohio State UniversityVerified email at osu.edu
Wei Chen (陈薇)Institute of Computing Technology, Chinese Academy of SciencesVerified email at ict.ac.cn
Da YuSun Yat-sen UniversityVerified email at mail2.sysu.edu.cn
Di HePeking UniversityVerified email at pku.edu.cn
Yuejie ChiCarnegie Mellon UniversityVerified email at cmu.edu
Shuxin ZhengPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Liwei WangProfessor, Peking UniversityVerified email at cis.pku.edu.cn
Yi ZhouUniversity of UtahVerified email at utah.edu
Janardhan KulkarniMicrosoft Research, RedmondVerified email at cs.washington.edu
Lifeng LaiProfessor, University of California, DavisVerified email at ucdavis.edu
Qi MengChinese Academy of Mathematics and Systems Science, CASVerified email at amss.ac.cn
Yin Tat LeePaul G. Allen School of Computer Science & Engineering, University of WashingtonVerified email at uw.edu
Gautam KamathAssistant Professor @ University of Waterloo, Faculty Member @ Vector InstituteVerified email at uwaterloo.ca
Sergey YekhaninMicrosoftVerified email at microsoft.com
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Shlomo Shamai (Shitz)Distinguished Professor, Technion - Israel Institute of TechnologyVerified email at ee.technion.ac.il
Hua WangQualcommVerified email at qti.qualcomm.com

Huishuai Zhang

Peking University

Verified email at pku.edu.cn - Homepage

Deep Learning Optimization Information Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On layer normalization in the transformer architecture R Xiong, Y Yang, D He, K Zheng, S Zheng, C Xing, H Zhang, Y Lan, ... International Conference on Machine Learning, 10524-10533, 2020	866	2020
A nonconvex approach for phase retrieval: Reshaped wirtinger flow and incremental algorithms H Zhang, Y Liang, Y Chi Journal of Machine Learning Research 18 (141), 1-35, 2017	307*	2017
Differentially private fine-tuning of language models D Yu, S Naik, A Backurs, S Gopi, HA Inan, G Kamath, J Kulkarni, YT Lee, ... arXiv preprint arXiv:2110.06500, 2021	241	2021
Provable non-convex phase retrieval with outliers: Median truncatedwirtinger flow H Zhang, Y Chi, Y Liang International conference on machine learning, 1022-1031, 2016	140*	2016
Block-diagonal hessian-free optimization for recurrent and convolutional neural networks H Zhang, C Xiong US Patent 11,386,327, 2022	98*	2022
Do not let privacy overbill utility: Gradient embedding perturbation for private learning D Yu, H Zhang, W Chen, TY Liu arXiv preprint arXiv:2102.12677, 2021	96	2021
Large scale private learning via low-rank reparametrization D Yu, H Zhang, W Chen, J Yin, TY Liu International Conference on Machine Learning, 12208-12218, 2021	82	2021
Sgd converges to global minimum in deep learning via star-convex path Y Zhou, J Yang, H Zhang, Y Liang, V Tarokh arXiv preprint arXiv:1901.00451, 2019	73	2019
Availability attacks create shortcuts D Yu, H Zhang, W Chen, J Yin, TY Liu Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022	63*	2022
Adaptive inertia: Disentangling the effects of adaptive learning rate and momentum Z Xie, X Wang, H Zhang, I Sato, M Sugiyama International conference on machine learning, 24430-24459, 2022	54*	2022
How does data augmentation affect privacy in machine learning? D Yu, H Zhang, W Chen, J Yin, TY Liu Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10746 …, 2021	50	2021
Understanding generalization error of SGD in nonconvex optimization Y Zhou, Y Liang, H Zhang Machine Learning, 1-31, 2022	45*	2022
Gradient perturbation is underrated for differentially private convex optimization D Yu, H Zhang, W Chen, TY Liu, J Yin arXiv preprint arXiv:1911.11363, 2019	41	2019
Exploring the limits of differentially private deep learning with group-wise clipping J He, X Li, D Yu, H Zhang, J Kulkarni, YT Lee, A Backurs, N Yu, J Bian arXiv preprint arXiv:2212.01539, 2022	36	2022
Convergence of distributed stochastic variance reduced methods without sampling extra data S Cen, H Zhang, Y Chi, W Chen, TY Liu IEEE Transactions on Signal Processing 68, 3976-3989, 2020	31	2020
Non-convex low-rank matrix recovery with arbitrary outliers via median-truncated gradient descent Y Li, Y Chi, H Zhang, Y Liang Information and Inference: A Journal of the IMA 9 (2), 289-325, 2020	31	2020
-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space Q Meng, S Zheng, H Zhang, W Chen, ZM Ma, TY Liu arXiv preprint arXiv:1802.03713, 2018	30	2018
Normalized/clipped sgd with perturbation for differentially private non-convex optimization X Yang, H Zhang, W Chen, TY Liu arXiv preprint arXiv:2206.13033, 2022	29	2022
The capacity region of the source-type model for secret key and private key generation H Zhang, L Lai, Y Liang, H Wang IEEE Transactions on Information Theory 60 (10), 6389-6398, 2014	29*	2014
Stabilize deep ResNet with a sharp scaling factor H Zhang, D Yu, M Yi, W Chen, TY Liu Machine Learning 111 (9), 3359-3392, 2022	27*	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors