Fedmd: Heterogenous federated learning via model distillation D Li, J Wang arXiv preprint arXiv:1910.03581, 2019 | 821 | 2019 |

Understanding robustness of transformers for image classification S Bhojanapalli, A Chakrabarti, D Glasner, D Li, T Unterthiner, A Veit Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 403 | 2021 |

A proof of the conformal collider bounds DM Hofman, D Li, D Meltzer, D Poland, F Rejon-Barrera Journal of High Energy Physics 2016 (6), 1-40, 2016 | 172 | 2016 |

On information loss in AdS3/CFT2 AL Fitzpatrick, J Kaplan, D Li, J Wang Journal of High Energy Physics 2016 (5), 1-57, 2016 | 151 | 2016 |

Modifying memories in transformer models C Zhu, AS Rawat, M Zaheer, S Bhojanapalli, D Li, F Yu, S Kumar arXiv preprint arXiv:2012.00363, 2020 | 113 | 2020 |

Covariant approaches to superconformal blocks AL Fitzpatrick, J Kaplan, ZU Khandker, D Li, D Poland, D Simmons-Duffin Journal of High Energy Physics 2014 (8), 1-30, 2014 | 106 | 2014 |

Exact Virasoro blocks from Wilson lines and background-independent operators AL Fitzpatrick, J Kaplan, D Li, J Wang Journal of High Energy Physics 2017 (7), 1-51, 2017 | 101 | 2017 |

Conformal bootstrap in the Regge limit D Li, D Meltzer, D Poland Journal of High Energy Physics 2017 (12), 1-40, 2017 | 92 | 2017 |

Conformal collider physics from the lightcone bootstrap D Li, D Meltzer, D Poland Journal of High Energy Physics 2016 (2), 1-52, 2016 | 88 | 2016 |

N = 1 superconformal blocks for general scalar operators IAS Zuhair U. Khandker (Boston U.) , Daliang Li, David Poland (Yale U ... JHEP 1408 (2014), 049, 2014 | 79* | 2014 |

Large language models with controllable working memory D Li, AS Rawat, M Zaheer, X Wang, M Lukasik, A Veit, F Yu, S Kumar arXiv preprint arXiv:2211.05110, 2022 | 75 | 2022 |

A numerical approach to Virasoro blocks and the information paradox H Chen, C Hussong, J Kaplan, D Li Journal of High Energy Physics 2017 (9), 1-39, 2017 | 73 | 2017 |

An exact operator that knows its location N Anand, H Chen, AL Fitzpatrick, J Kaplan, D Li Journal of High Energy Physics 2018 (2), 2018 | 63 | 2018 |

Degenerate operators and the 1/c expansion: Lorentzian resummations, high order computations, and super-Virasoro blocks H Chen, AL Fitzpatrick, J Kaplan, D Li, J Wang Journal of High Energy Physics 2017 (3), 1-47, 2017 | 61 | 2017 |

Bootstrapping mixed correlators in 4D = 1 SCFTs D Li, D Meltzer, A Stergiou Journal of High Energy Physics 2017 (7), 1-33, 2017 | 58 | 2017 |

The lazy neuron phenomenon: On emergence of activation sparsity in transformers Z Li, C You, S Bhojanapalli, D Li, AS Rawat, SJ Reddi, K Ye, F Chern, ... arXiv preprint arXiv:2210.06313, 2022 | 53 | 2022 |

Non-Abelian binding energies from the lightcone bootstrap D Li, D Meltzer, D Poland Journal of High Energy Physics 2016 (2), 1-35, 2016 | 53 | 2016 |

Probing universalities in d> 2 CFTs: from black holes to shockwaves AL Fitzpatrick, KW Huang, D Li Journal of High Energy Physics 2019 (11), 1-29, 2019 | 47 | 2019 |

Superembedding methods for current superfields WD Goldberger, ZU Khandker, D Li, W Skiba Physical Review D—Particles, Fields, Gravitation, and Cosmology 88 (12), 125010, 2013 | 44 | 2013 |

Two-point functions of conformal primary operators in N= 1 superconformal theories D Li, A Stergiou Journal of High Energy Physics 2014 (10), 1-22, 2014 | 32 | 2014 |