Follow
Zun Wang
Zun Wang
Verified email at cs.unc.edu - Homepage
Title
Cited by
Cited by
Year
Internvideo: General video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
2932022
Mvbench: A comprehensive multi-modal video understanding benchmark
K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1852024
Internvideo2: Scaling video foundation models for multimodal video understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ...
arXiv e-prints, arXiv: 2403.15377, 2024
722024
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Y Hong*, Z Wang*, Q Wu, S Gould
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
632022
Scaling Data Generation in Vision-and-Language Navigation
Z Wang, J Li, Y Hong, Y Wang, Q Wu, M Bansal, S Gould, H Tan, Y Qiao
ICCV2023, 2023
472023
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
422022
Etpnav: Evolving topological planning for vision-language navigation in continuous environments
D An, H Wang, W Wang, Z Wang, Y Huang, K He, L Wang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
402024
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
D An*, Z Wang*, Y Li, Y Wang, Y Hong, Y Huang, L Wang, J Shao
arXiv preprint arXiv:2206.11610, 2022
92022
Vision-and-language navigation today and tomorrow: A survey in the era of foundation models
Y Zhang*, Z Ma*, J Li*, Y Qiao*, Z Wang*, J Chai, Q Wu, M Bansal, ...
arXiv preprint arXiv:2407.07035, 2024
62024
Navgpt-2: Unleashing navigational reasoning capability for large vision-language models
G Zhou, Y Hong, Z Wang, XE Wang, Q Wu
European Conference on Computer Vision, 2024
42024
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
Z Wang, J Li, H Lin, J Yoon, M Bansal
arXiv preprint arXiv:2411.16657, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–11