Follow
Lijuan Wang
Lijuan Wang
Microsoft GenAI
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
JG Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang ...
European Conference on Computer Vision (ECCV), 2020
1715*2020
Large Scale Incremental Learning
YF Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
1138*2019
VinVL: Making Visual Representations Matter in Vision-Language Models
P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao
CVPR2021, 2021
959*2021
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
6352021
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
6162022
Rethinking Classification and Localization for Object Detection
YF Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
5722020
End-to-End Human Pose and Mesh Reconstruction with Transformers
K Lin, L Wang, Z Liu
CVPR2021, 2020
5472020
End-to-end semi-supervised object detection with soft teacher
M Xu, Z Zhang, H Hu, J Wang, L Wang, F Wei, X Bai, Z Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
3962021
Real-time Animation for an Expressive Avatar
N Xu, L Wang, FKP Soong, X Liang, Q Luo, YQ Xu, X Zou
US Patent App. 12/950,801, 2012
3522012
Refining of segmental boundaries in speech waveforms using contextual-dependent models
Y Zhao, M Chu, JL Zhou, L Wang
US Patent 7,496,512, 2009
3402009
Git: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
arXiv preprint arXiv:2205.14100, 2022
3212022
Handwriting-based user interface for correction of speech recognition errors
L Wang, FKP Soong
US Patent App. 12/042,344, 2009
2812009
An empirical study of training end-to-end vision-and-language transformers
ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2762022
An empirical study of gpt-3 for few-shot knowledge-based vqa
Z Yang, Z Gan, J Wang, X Hu, Y Lu, Z Liu, L Wang
Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 3081-3089, 2022
2692022
Unnatural prosody detection in speech synthesis
Y Zhao, FKP Soong, M Chu, L Wang
US Patent 8,583,438, 2013
2642013
Mesh graphormer
K Lin, L Wang, Z Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
2502021
Scaling up vision-language pre-training for image captioning
X Hu, Z Gan, J Wang, Z Yang, Z Liu, Y Lu, L Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
2042022
The dawn of lmms: Preliminary explorations with gpt-4v (ision)
Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang
arXiv preprint arXiv:2309.17421 9 (1), 1, 2023
1982023
Segment everything everywhere all at once
X Zou, J Yang, H Zhang, F Li, L Li, J Wang, L Wang, J Gao, YJ Lee
arXiv preprint arXiv:2304.06718, 2023
1952023
Mm-react: Prompting chatgpt for multimodal reasoning and action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
1862023
The system can't perform the operation now. Try again later.
Articles 1–20