Yi Zhang

Cited by

	All	Since 2019
Citations	5517	4997
h-index	22	22
i10-index	28	28

1900

950

475

1425

20162017201820192020202120222023202438 119 344 447 549 565 532 1878 1001

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sanjeev AroraProfessor of Computer Science, Princeton UniversityVerified email at cs.princeton.edu
Sebastien BubeckVP GenAI Research, Microsoft AIVerified email at microsoft.com
Ronen EldanWeizmann InstituteVerified email at weizmann.ac.il
Yin Tat LeePaul G. Allen School of Computer Science & Engineering, University of WashingtonVerified email at uw.edu
Yuanzhi LiAssistant Professor at CMUVerified email at andrew.cmu.edu
Rong GeDuke UniversityVerified email at cs.duke.edu
Cyril ZhangMicrosoft Research NYCVerified email at microsoft.com
Eric HorvitzMicrosoftVerified email at microsoft.com
Hamid PalangiMicrosoft Research and University of WashingtonVerified email at microsoft.com
Marco Tulio RibeiroGoogle DeepMindVerified email at cs.washington.edu
Harsha NoriMicrosoft ResearchVerified email at microsoft.com
Scott LundbergGoogle DeepMindVerified email at google.com
Ece KamarMicrosoft ResearchVerified email at microsoft.com
Varun ChandrasekaranUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Holden LeeAssistant Professor of Applied Mathematics and Statistics, Johns Hopkins UniversityVerified email at jhu.edu
Elad HazanProfessor at Princeton University and Director Google AI PrincetonVerified email at princeton.edu
Honglak LeeLG AI Research / U. MichiganVerified email at umich.edu
Yuting ZhangAmazon Web ServicesVerified email at amazon.com
Tengyu MAStanford UniversityVerified email at stanford.edu
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com

Yi Zhang

Senior Researcher at Microsoft Research Redmond

Verified email at microsoft.com - Homepage

Machine Learning Theory of Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023	1890	2023
Generalization and Equilibrium in Generative Adversarial Nets (GANs) S Arora, R Ge, Y Liang, T Ma, Y Zhang arXiv preprint arXiv:1703.00573, 2017	776	2017
Stronger generalization bounds for deep nets via a compression approach S Arora, R Ge, B Neyshabur, Y Zhang International Conference on Machine Learning, 254-263, 2018	661	2018
Convolutional neural networks with low-rank regularization C Tai, T Xiao, Y Zhang, X Wang arXiv preprint arXiv:1511.06067, 2015	515	2015
Deep visual analogy-making SE Reed, Y Zhang, Y Zhang, H Lee Advances in neural information processing systems 28, 2015	345	2015
Do GANs actually learn the distribution? An empirical study S Arora, Y Zhang arXiv:1706.08224, 2017	190	2017
Textbooks are all you need S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ... arXiv preprint arXiv:2306.11644, 2023	184	2023
Do GANs learn the distribution? some theory and empirics S Arora, A Risteski, Y Zhang International Conference on Learning Representations, 2018	171	2018
Spectral filtering for general linear dynamical systems E Hazan, H Lee, K Singh, C Zhang, Y Zhang Advances in Neural Information Processing Systems 31, 2018	94	2018
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets R Kuditipudi, X Wang, H Lee, Y Zhang, Z Li, W Hu, S Arora, R Ge arXiv:1906.06247, 2019	86	2019
Towards Understanding the Invertibility of Convolutional Neural Networks CA Gilbert, Y Zhang, K Lee, Y Zhang, H Lee arXiv preprint arXiv:1705.08664, 2017	75	2017
Efficient full-matrix adaptive regularization N Agarwal, B Bullins, X Chen, E Hazan, K Singh, C Zhang, Y Zhang International Conference on Machine Learning, 102-110, 2019	59	2019
What makes convolutional models great on long sequence modeling? Y Li, T Cai, Y Zhang, D Chen, D Dey arXiv preprint arXiv:2210.09298, 2022	57	2022
Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality Y Zhang, O Plevrakis, SS Du, X Li, Z Song, S Arora arXiv:2002.06668, 2020	49	2020
Unveiling transformers with lego: a synthetic reasoning task Y Zhang, A Backurs, S Bubeck, R Eldan, S Gunasekar, T Wagner arXiv preprint arXiv:2206.04301, 2022	48	2022
Why are convolutional nets more sample-efficient than fully-connected nets? Z Li, Y Zhang, S Arora arXiv preprint arXiv:2010.08515, 2020	48	2020
Calibration, Entropy Rates, and Memory in Language Models M Braverman, X Chen, SM Kakade, K Narasimhan, C Zhang, Y Zhang arXiv preprint arXiv:1906.05664, 2019	35	2019
Phi-2: The surprising power of small language models M Javaheripi, S Bubeck, M Abdin, J Aneja, S Bubeck, CCT Mendes, ... Microsoft Research Blog, 2023	32	2023
Towards provable control for unknown linear dynamical systems S Arora, E Hazan, H Lee, K Singh, C Zhang, Y Zhang	26	2018
Not-So-Random Features B Brian, Z Cyril, Z Yi arXiv:1710.10230, 2017	25*	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors