Exploring principles-of-art features for image emotion recognition S Zhao, Y Gao, X Jiang, H Yao, TS Chua, X Sun Proceedings of the 22nd ACM international conference on Multimedia, 47-56, 2014 | 394 | 2014 |
Pix2vox: Context-aware 3d reconstruction from single and multi-view images H Xie, H Yao, X Sun, S Zhou, S Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 393 | 2019 |
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation G Luo, Y Zhou, X Sun, L Cao, C Wu, C Deng, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 294 | 2020 |
Dual-level collaborative transformer for image captioning Y Luo, J Ji, X Sun, L Cao, Y Wu, F Huang, CW Lin, R Ji Proceedings of the AAAI Conference on Artificial Intelligence 35 (3), 2286-2293, 2021 | 291 | 2021 |
Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length X Wang, L Gao, P Wang, X Sun, X Liu IEEE Transactions on Multimedia 20 (3), 634-644, 2017 | 270 | 2017 |
RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words X Zhang, X Sun, Y Luo, J Ji, Y Zhou, Y Wu, F Huang, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 229 | 2021 |
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval Y Ma, G Xu, X Sun, M Yan, J Zhang, R Ji Proceedings of the 30th ACM International Conference on Multimedia, 638-647, 2022 | 209 | 2022 |
Improving image captioning by leveraging intra-and inter-layer global representation in transformer network J Ji, Y Luo, X Sun, F Chen, G Luo, Y Wu, Y Gao, R Ji Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1655-1663, 2021 | 174 | 2021 |
Exploiting the complementary strengths of multi-layer CNN features for image retrieval W Yu, K Yang, H Yao, X Sun, P Xu Neurocomputing 237, 235-241, 2017 | 142 | 2017 |
Seqtr: A simple yet universal network for visual grounding C Zhu, Y Zhou, Y Shen, G Luo, X Pan, M Lin, C Chen, L Cao, X Sun, R Ji European Conference on Computer Vision, 598-615, 2022 | 127 | 2022 |
Cascade Grouped Attention Network for Referring Expression Segmentation G Luo, Y Zhou, R Ji, X Sun, J Su, CW Lin, Q Tian Proceedings of the 28th ACM International Conference on Multimedia, 1274-1282, 2020 | 119 | 2020 |
Photo assessment based on computational visual attention model X Sun, H Yao, R Ji, S Liu Proceedings of the 17th ACM international conference on Multimedia, 541-544, 2009 | 116 | 2009 |
Task-dependent visual-codebook compression R Ji, H Yao, W Liu, X Sun, Q Tian IEEE Transactions on Image Processing 21 (4), 2282-2293, 2011 | 112 | 2011 |
SPTF: a scalable probabilistic tensor factorization model for semantic-aware behavior prediction H Yin, H Chen, X Sun, H Wang, Y Wang, QVH Nguyen 2017 IEEE International Conference on Data Mining (ICDM), 585-594, 2017 | 109 | 2017 |
GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints F Chen, R Ji, X Sun, Y Wu, J Su Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 93 | 2018 |
Trar: Routing the attention spans in transformer for visual question answering Y Zhou, T Ren, C Zhu, X Sun, J Liu, X Ding, M Xu, R Ji Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 89 | 2021 |
Cheap and quick: Efficient vision-language instruction tuning for large language models G Luo, Y Zhou, T Ren, S Chen, X Sun, R Ji Advances in Neural Information Processing Systems 36, 2024 | 77 | 2024 |
What are we looking for: Towards statistical modeling of saccadic eye movements and visual saliency X Sun, H Yao, R Ji 2012 IEEE Conference on Computer Vision and Pattern Recognition, 1552-1559, 2012 | 70 | 2012 |
Towards optimal fine grained retrieval via decorrelated centralized loss with normalize-scale layer X Zheng, R Ji, X Sun, B Zhang, Y Wu, F Huang Proceedings of the AAAI conference on artificial intelligence 33 (01), 9291-9298, 2019 | 69 | 2019 |
Strategy for dynamic 3D depth data matching towards robust action retrieval S Zhao, L Chen, H Yao, Y Zhang, X Sun Neurocomputing 151, 533-543, 2015 | 63 | 2015 |