GLM-130B: an open bilingual pre-trained model Z Aohan, L Xiao, WZ Du Zhengxiao, L Hanyu, D Ming, Y Zhuoyi, X Yifan, ... arXiv preprint arXiv:2210.02414, 2022 | 1075* | 2022 |
Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023 | 375* | 2023 |
ChatGLM: a family of large language models from GLM-130B to GLM-4 all tools. arXiv e-prints GLM Team, A Zeng, B Xu arXiv preprint arXiv:2406.12793, 2024 | 275* | 2024 |
Webglm: Towards an efficient web-enhanced question answering system with human preferences X Liu, H Lai, H Yu, Y Xu, A Zeng, ... arXiv preprint arXiv:2306.07906, 2023 | 72 | 2023 |
Alignbench: Benchmarking chinese alignment of large language models X Liu, X Lei, S Wang, Y Huang, Z Feng, B Wen, J Cheng, P Ke, Y Xu, ... arXiv preprint arXiv:2311.18743, 2023 | 44 | 2023 |
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Y Xu, X Liu, X Liu, Z Hou, Y Li, X Zhang, Z Wang, A Zeng, Z Du, W Zhao, ... arXiv preprint arXiv:2404.02893, 2024 | 18* | 2024 |
GOAL: A challenging knowledge-grounded video captioning benchmark for real-time soccer commentary generation J Qi, J Yu, T Tu, K Gao, Y Xu, X Guan, X Wang, B Xu, L Hou, J Li, J Tang Proceedings of the 32nd ACM International Conference on Information and …, 2023 | 17 | 2023 |
XDAI: A tuning-free framework for exploiting pre-trained language models in knowledge grounded dialogue generation J Yu, X Zhang, Y Xu, X Lei, X Guan, J Zhang, L Hou, J Li, J Tang Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 15 | 2022 |
Visualagentbench: Towards large multimodal models as visual foundation agents X Liu, T Zhang, Y Gu, IL Iong, Y Xu, X Song, S Zhang, H Lai, X Liu, H Zhao, ... arXiv preprint arXiv:2408.06327, 2024 | 7 | 2024 |
Androidlab: Training and systematic benchmarking of android autonomous agents Y Xu, X Liu, X Sun, S Cheng, H Yu, H Lai, S Zhang, D Zhang, J Tang, ... arXiv preprint arXiv:2410.24024, 2024 | 3 | 2024 |
Autoglm: Autonomous foundation agents for guis X Liu, B Qin, D Liang, G Dong, H Lai, H Zhang, H Zhao, IL Iong, J Sun, ... arXiv preprint arXiv:2411.00820, 2024 | 2 | 2024 |
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation J Yu, X Zhang, Y Xu, X Lei, Z Yao, J Zhang, L Hou, J Li arXiv preprint arXiv:2404.03491, 2024 | 1 | 2024 |