Follow
Quentin Anthony
Quentin Anthony
Verified email at osu.edu - Homepage
Title
Cited by
Cited by
Year
Gpt-neox-20b: An open-source autoregressive language model
S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ...
Proceedings of the ACL Workshop on Challenges & Perspectives in Creating …, 2022
538*2022
Pythia: A suite for analyzing large language models across training and scaling
S Biderman, H Schoelkopf, Q Anthony, H Bradley, K O'Brien, E Hallahan, ...
International conference on machine learning (ICML), 2023
3812023
Rwkv: Reinventing rnns for the transformer era
B Peng, E Alcaide, Q Anthony, A Albalak, S Arcadinho, H Cao, X Cheng, ...
arXiv preprint arXiv:2305.13048, 2023
148*2023
Emergent and Predictable Memorization in Large Language Models
S Biderman, US Prashanth, L Sutawika, H Schoelkopf, Q Anthony, ...
https://arxiv.org/pdf/2304.11158.pdf, 2023
502023
Gems: Gpu-enabled memory-aware model-parallelism system for distributed dnn training
A Jain, AA Awan, AM Aljuhani, JM Hashmi, QG Anthony, H Subramoni, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
442020
Performance characterization of dnn training using tensorflow and pytorch on modern clusters
A Jain, AA Awan, Q Anthony, H Subramoni, DKDK Panda
2019 IEEE International Conference on Cluster Computing (CLUSTER), 1-11, 2019
392019
Continual Pre-Training of Large Language Models: How to (re) warm your model?
K Gupta, B Thérien, A Ibrahim, ML Richter, Q Anthony, E Belilovsky, I Rish, ...
172023
GPT-NeoX: Large scale autoregressive language modeling in pytorch
A Andonian, Q Anthony, S Biderman, S Black, P Gali, L Gao, E Hallahan, ...
17*2021
Hypar-flow: Exploiting mpi and keras for scalable hybrid-parallel dnn training using tensorflow
AA Awan, A Jain, Q Anthony, H Subramoni, DK Panda
arXiv preprint arXiv:1911.05146, 2019
14*2019
Adaptive and hierarchical large message all-to-all communication algorithms for large-scale dense gpu systems
KS Khorassani, CH Chu, QG Anthony, H Subramoni, DK Panda
2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021
132021
Accelerating mpi all-to-all communication with online compression on modern gpu clusters
Q Zhou, P Kousha, Q Anthony, K Shafie Khorassani, A Shafi, ...
International Conference on High Performance Computing, 3-25, 2022
92022
Efficient training of semantic image segmentation on summit using horovod and mvapich2-gdr
Q Anthony, AA Awan, A Jain, H Subramoni, DKDK Panda
2020 IEEE International Parallel and Distributed Processing Symposium …, 2020
72020
Mcr-dl: Mix-and-match communication runtime for deep learning
Q Anthony, AA Awan, J Rasley, Y He, A Shafi, M Abduljabbar, ...
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
32023
Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters
A Jain, A Shafi, Q Anthony, P Kousha, H Subramoni, DK Panda
International Conference on High Performance Computing, 109-130, 2022
32022
Scaling single-image super-resolution training on modern hpc clusters: Early experiences
Q Anthony, L Xu, H Subramoni, DKDK Panda
2021 IEEE International Parallel and Distributed Processing Symposium …, 2021
32021
Accelerating GPU-based Machine Learning in Python using MPI Library: A Case Study with MVAPICH2-GDR
SM Ghazimirsaeed, Q Anthony, A Shafi, H Subramoni, DKDK Panda
2020 IEEE/ACM Workshop on Machine Learning in High Performance Computing …, 2020
32020
BlackMamba: Mixture of Experts for State-Space Models
Q Anthony, Y Tokpanov, P Glorioso, B Millidge
arXiv preprint arXiv:2402.01771, 2024
22024
Accelerating distributed deep learning training with compression assisted allgather and reduce-scatter communication
Q Zhou, Q Anthony, L Xu, A Shafi, M Abduljabbar, H Subramoni, ...
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
22023
Highly efficient alltoall and alltoallv communication algorithms for gpu systems
CC Chen, KS Khorassani, QG Anthony, A Shafi, H Subramoni, DK Panda
2022 IEEE International Parallel and Distributed Processing Symposium …, 2022
22022
Evaluating Multi-Level Checkpointing for Distributed Deep Neural Network Training
Q Anthony, D Dai
SC Workshops Supplementary Proceedings (SCWS), 2021
22021
The system can't perform the operation now. Try again later.
Articles 1–20