Kevin Lin

Cited by

	All	Since 2019
Citations	5108	4467
h-index	23	23
i10-index	33	31

1800

900

450

1350

201520162017201820192020202120222023202415 76 189 339 380 404 435 659 1731 854

Public access

View all

2 articles

1 article

available

not available

Based on funding mandates

Co-authors

Lijuan WangMicrosoft GenAIVerified email at microsoft.com
Zicheng LiuMicrosoftVerified email at microsoft.com
Linjie (Lindsey) LiSenior Researcher, MicrosoftVerified email at microsoft.com
Chu-Song ChenNational Taiwan UniversityVerified email at csie.ntu.edu.tw
Zhengyuan YangResearcher, MicrosoftVerified email at microsoft.com
Jianfeng WangMicrosoftVerified email at microsoft.com
Chung-Ching LinMicrosoftVerified email at microsoft.com
Huei-Fang YangNational Sun Yat-sen UniversityVerified email at mis.nsysu.edu.tw
Zhe GanResearch Scientist, AppleVerified email at apple.com
Ming-Ting SunProfessor of Electrical Engineering, University of WashingtonVerified email at ee.washington.edu
Ce LiuPartner Research Manager, Microsoft GenAI; IEEE FellowVerified email at microsoft.com
Faisal Ahmed, PhDMicrosoftVerified email at microsoft.com
Yi-Ping HungNational Taiwan UniversityVerified email at csie.ntu.edu.tw
Jenhao HsiaoPrincipal AI Architect, OPPO US Research CenterVerified email at oppo.com
Jiwen Lu (鲁继文)Department of Automation, Tsinghua UniversityVerified email at tsinghua.edu.cn
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Tsu-Jui FuUC Santa BarbaraVerified email at ucsb.edu
William Yang WangMellichamp Chair Professor, University of California, Santa BarbaraVerified email at cs.ucsb.edu
Zhengyou ZhangTencent AI Lab & Tencent Robotics XVerified email at tencent.com
Xiaodong He (何晓冬)AI Lab, JD.com; IEEE/CAAI FellowVerified email at ieee.org

Kevin Lin

Microsoft

Verified email at microsoft.com - Homepage

Computer Vision Vision and Language Multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep learning of binary hash codes for fast image retrieval K Lin, HF Yang, JH Hsiao, CS Chen IEEE Conference on Computer Vision and Pattern Recognition Workshops, 27-35, 2015	739	2015
End-to-end human pose and mesh reconstruction with transformers K Lin, L Wang, Z Liu IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1954-1963, 2021	580	2021
Learning compact binary descriptors with unsupervised deep neural networks K Lin, J Lu, CS Chen, J Zhou IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1183-1192, 2016	415	2016
Adversarial ranking for language generation K Lin, D Li, X He, Z Zhang, MT Sun Advances in Neural Information Processing Systems (NeurIPS), 3158-3168, 2017	413	2017
Supervised learning of semantics-preserving hash via deep convolutional neural networks HF Yang, K Lin, CS Chen IEEE Transactions on Pattern Analysis and Machine Intelligence 40 (2), 437-451, 2018	386	2018
GIT: A generative image-to-text transformer for vision and language J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang Transactions on Machine Learning Research (TMLR), 2022	349	2022
Mesh graphormer K Lin, L Wang, Z Liu IEEE/CVF International Conference on Computer Vision (ICCV), 12939-12948, 2021	265	2021
The dawn of lmms: Preliminary explorations with gpt-4v (ision) Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang arXiv preprint arXiv:2309.17421, 2023	225	2023
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ... arXiv preprint arXiv:2303.11381, 2023	195	2023
SwinBERT: End-to-end transformers with sparse attention for video captioning K Lin, L Li, CC Lin, F Ahmed, Z Gan, Z Liu, Y Lu, L Wang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17949 …, 2022	183	2022
VIOLET: End-to-end video-language transformers with masked visual-token modeling TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu arXiv preprint arXiv:2111.12681, 2021	178	2021
Mitigating hallucination in large multi-modal models via robust instruction tuning F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang ICLR 2024, 2024	130*	2024
Mm-vet: Evaluating large multimodal models for integrated capabilities W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang arXiv preprint arXiv:2308.02490, 2023	123	2023
Abandoned object detection via temporal consistency modeling and back-tracing verification for visual surveillance K Lin, SC Chen, CS Chen, DTD Lin, YP Hung IEEE Transactions on Information Forensic and Security 10 (7), 1359-1370, 2015	110	2015
Vivo: Visual vocabulary pre-training for novel object captioning X Hu, X Yin, K Lin, L Zhang, J Gao, L Wang, Z Liu Proceedings of the AAAI Conference on Artificial Intelligence, 1575-1583, 2021	108*	2021
Rapid clothing retrieval via deep learning of binary codes and hierarchical search K Lin, HF Yang, KH Liu, JH Hsiao, CS Chen ACM International Conference on Multimedia Retrieval (ICMR), 499–502, 2015	87	2015
Cross-domain complementary learning using pose for multi-person part segmentation K Lin, L Wang, K Luo, Y Chen, Z Liu, MT Sun IEEE Transactions on Circuits and Systems for Video Technology 31 (3), 1066 …, 2020	85	2020
Unsupervised deep learning of compact binary descriptors K Lin, J Lu, CS Chen, J Zhou, MT Sun IEEE Transactions on Pattern Analysis and Machine Intelligence 41 (6), 1501-1514, 2019	77	2019
Lavender: Unifying video-language understanding as masked language modeling L Li, Z Gan, K Lin, CC Lin, Z Liu, C Liu, L Wang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 23119 …, 2023	61	2023
Reco: Region-controlled text-to-image generation Z Yang, J Wang, Z Gan, L Li, K Lin, C Wu, N Duan, Z Liu, C Liu, M Zeng, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14246 …, 2023	60	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors