Follow
Yi Wang
Yi Wang
Shanghai AI Laboratory
Verified email at cse.cuhk.edu.hk
Title
Cited by
Cited by
Year
Videochat: Chat-centric video understanding
KC Li, Y He, Y Wang, Y Li, W Wang, P Luo, Y Wang, L Wang, Y Qiao
arXiv preprint arXiv:2305.06355, 2023
4782023
Image inpainting via generative multi-column convolutional neural networks
Y Wang, X Tao, X Qi, X Shen, J Jia
Advances in Neural Information Processing Systems, 331-340, 2018
4002018
Videomae v2: Scaling video masked autoencoders with dual masking
L Wang, B Huang, Z Zhao, Z Tong, Y He, Y Wang, Y Wang, Y Qiao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
3282023
Mat: Mask-aware transformer for large hole image inpainting
W Li, Z Lin, K Zhou, L Qi, Y Wang, J Jia
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
3252022
InternVideo: general video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
2872022
Mvbench: A comprehensive multi-modal video understanding benchmark
K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1742024
Lavie: High-quality video generation with cascaded latent diffusion models
Y Wang, X Chen, X Ma, S Zhou, Z Huang, Y Wang, C Yang, Y He, J Yu, ...
arXiv preprint arXiv:2309.15103, 2023
1702023
Internvid: A large-scale video-text dataset for multimodal understanding and generation
Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ...
arXiv preprint arXiv:2307.06942, 2023
1672023
Wide-context semantic image extrapolation
Y Wang, X Tao, X Shen, J Jia
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
1332019
Unmasked teacher: Towards training-efficient video foundation models
K Li, Y Wang, Y Li, Y Wang, Y He, L Wang, Y Qiao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1302023
Fast visual object counting via example-based density estimation
Y Wang, Y Zou
2016 IEEE international conference on image processing (ICIP), 3653-3657, 2016
1222016
Uniformerv2: Spatiotemporal learning by arming image vits with video uniformer
K Li, Y Wang, Y He, Y Li, Y Wang, L Wang, Y Qiao
arXiv preprint arXiv:2211.09552, 2022
1152022
Videomamba: State space model for efficient video understanding
K Li, X Li, Y Wang, Y He, Y Wang, L Wang, Y Qiao
European Conference on Computer Vision, 237-255, 2025
1122025
Towards implicit text-guided 3d shape generation
Z Liu, Y Wang, X Qi, CW Fu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1012022
VCNet: a robust approach to blind image inpainting
Y Wang, YC Chen, X Tao, J Jia
European Conference on Computer Vision, 2020
932020
Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network
Y Zou, L Li, Y Wang, J Yu, Y Li, WJ Deng
2015 IEEE International Conference on Digital Signal Processing (DSP), 1274-1278, 2015
872015
Learning open-vocabulary semantic segmentation models from natural language supervision
J Xu, J Hou, Y Zhang, R Feng, Y Wang, Y Qiao, W Xie
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
842023
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language
Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ...
arXiv preprint arXiv:2305.05662, 2023
792023
Open world entity segmentation
L Qi, J Kuen, Y Wang, J Gu, H Zhao, P Torr, Z Lin, J Jia
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (7), 8743-8756, 2022
792022
Multi-scale aligned distillation for low-resolution detection
L Qi, J Kuen, J Gu, Z Lin, Y Wang, Y Chen, Y Li, J Jia
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
782021
The system can't perform the operation now. Try again later.
Articles 1–20