Anirbit Mukherjee

Cited by

	All	Since 2019
Citations	986	902
h-index	6	6
i10-index	5	5

200

100

150

2017201820192020202120222023202419 63 106 154 163 172 181 126

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Amitabh BasuJohns Hopkins UniversityVerified email at jhu.edu
Enayat UllahResearch Scientist, MetaVerified email at jhu.edu
Soham DeDeepMindVerified email at google.com
Raman AroraDepartment of Computer Science, Johns Hopkins UniversityVerified email at cs.jhu.edu
Poorya MianjyQR at Citadel SecuritiesVerified email at citadelsecurities.com
Sayar KarmakarAssistant Professor, University of FloridaVerified email at ufl.edu
Trac D. TranProsessor of Electrical and Computer Engineering, Johns Hopkins UniversityVerified email at jhu.edu
Akshay RangamaniNew Jersey Institute of TechnologyVerified email at njit.edu
Sang Peter ChinProfessor, Dartmouth EngineeringVerified email at dartmouth.edu
Tejaswini GanapathiVerified email at alumni.utoronto.ca
Theodore PapamarkouThe University of ManchesterVerified email at manchester.ac.uk
Pulkit GopalaniUniversity of Michigan, Ann ArborVerified email at umich.edu
Dibyakanti KumarIIT GuwahatiVerified email at alumni.iitg.ac.in
Phanideep GampaAmazonVerified email at iitbhu.ac.in
Soham DanIBM ResearchVerified email at ibm.com
Amartya RoyApplied ML Engineer at BoschVerified email at in.bosch.com
Angelo CangelosiProfessor of Machine Learning and Robotics, University of ManchesterVerified email at manchester.ac.uk
Hongbo ZhuThe University of ManchesterVerified email at manchester.ac.uk
Procheta SenLecturer/ Assistant Professor (University of Liverpool)Verified email at liverpool.ac.uk

Anirbit Mukherjee

Department of Computer Science, The University of Manchester

Verified email at manchester.ac.uk - Homepage

Deep Learning Theory Differential Equations


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Understanding deep neural networks with rectified linear units R Arora, A Basu, P Mianjy, A Mukherjee The International Conference on Learning Representations (ICLR) 2018, 2016	755	2016
Convergence guarantees for RMSProp and ADAM in non-convex optimization and their comparison to Nesterov acceleration on autoencoders S De, A Mukherjee, E Ullah arXiv preprint arXiv:1807.06766, 2018	149*	2018
Sparse coding and autoencoders A Rangamani, A Mukherjee, A Basu, A Arora, T Ganapathi, S Chin, ... 2018 IEEE International Symposium on Information Theory (ISIT), 36-40, 2018	28*	2018
Lower bounds over Boolean inputs for deep neural networks with ReLU gates A Mukherjee, A Basu arXiv preprint arXiv:1711.03073, 2017	23	2017
Provable training of a ReLU gate with an iterative non-gradient algorithm S Karmakar, A Mukherjee Neural Networks, 2022	11*	2022
Towards Size-Independent Generalization Bounds for Deep Operator Nets P Gopalani, S Karmakar, D Kumar, A Mukherjee arXiv preprint arXiv:2205.11359, 2022	6*	2022
Depth-2 neural networks under a data-poisoning attack S Karmakar, A Mukherjee, T Papamarkou Neurocomputing 532, 56-66, 2023	5	2023
A Study of the Mathematics of Deep Learning A Mukherjee Johns Hopkins University, 2020	5	2020
Global Convergence of SGD On Two Layer Neural Nets P Gopalani, A Mukherjee arXiv preprint arXiv:2210.11452, 2022	2	2022
Size Lowerbounds for Deep Operator Networks A Mukherjee, A Roy Transactions on Machine Learning Research, 2024	1	2024
Investigating the Role of Overparameterization While Solving the Pendulum with DeepONets P Gopalani, A Mukherjee The Symbiosis of Deep Learning and Differential Equations, 2021	1	2021
Investigating the Ability of PINNs to Solve Burgers’ PDE Near Finite-Time Blowup D Kumar, A Mukherjee Machine Learning: Science and Technology 5 (2), 025063, 2024		2024
Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks M Tucat, A Mukherjee arXiv preprint arXiv:2404.08624, 2024		2024
Global Convergence of SGD For Logistic Loss on Two Layer Neural Nets P Gopalani, S Jha, A Mukherjee Transactions on Machine Learning Research, 2024		2024
LIPEx--Locally Interpretable Probabilistic Explanations--To Look Beyond The True Class H Zhu, A Cangelosi, P Sen, A Mukherjee arXiv preprint arXiv:2310.04856, 2023		2023
An Empirical Study of the Occurrence of Heavy-Tails in Training a ReLU Gate S Karmakar, A Mukherjee arXiv preprint arXiv:2204.12554, 2022		2022
Dynamics of Local Elasticity During Training of Neural Nets S Dan, A Mukherjee, A Das, P Gampa arXiv preprint arXiv:2111.01166, 2021		2021
Identifying stochastic oracles for fast convergence of RMSProp AM Jiayao Zhang Deep Math 2020, 2020		2020
Improving PAC-Bayes bounds on risk of neural nets using geometrical properties of training A Mukherjee, D Roy, P Rastogi, J Yang ICML 2019 Workshop, Understanding and Improving Generalization in Deep Learning, 2019		2019
Renyi entropy of the critical O (N) model A Mukherjee arXiv preprint arXiv:1512.01226, 2015		2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors