Follow
Joya Chen
Title
Cited by
Cited by
Year
Assistgpt: A general multi-modal assistant that can plan, execute, inspect, and learn
D Gao, L Ji, L Zhou, KQ Lin, J Chen, Z Fan, MZ Shou
arXiv preprint arXiv:2306.08640, 2023
392023
Univtg: Towards unified video-language temporal grounding
KQ Lin, P Zhang, J Chen, S Pramanick, D Gao, AJ Wang, R Yan, MZ Shou
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
392023
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
312024
Foreground-background imbalance problem in deep object detectors: A review
J Chen, Q Wu, D Liu, T Xu
2020 IEEE Conference on Multimedia Information Processing and Retrieval …, 2020
282020
Assistq: Affordance-centric question-driven task completion for egocentric assistant
B Wong*, J Chen*, Y Wu*, SW Lei, D Mao, D Gao, MZ Shou
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
212022
Linking the characters: Video-oriented social graph generation via hierarchical-cumulative GCN
S Wu, J Chen, T Xu, L Chen, L Wu, Y Hu, E Chen
Proceedings of the 29th ACM International Conference on Multimedia, 4716-4724, 2021
192021
Affordance grounding from demonstration video to target image
J Chen, D Gao, KQ Lin, MZ Shou
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
152023
Is sampling heuristics necessary in training deep object detectors?
J Chen, D Liu, T Xu, S Zhang, S Wu, B Luo
arXiv preprint arXiv:1909.04868, 2019
102019
Residual objectness for imbalance reduction
J Chen, D Liu, B Luo, X Peng, T Xu, E Chen
Pattern Recognition 130, 108781, 2022
92022
Is heuristic sampling necessary in training deep object detectors?
J Chen, D Liu, T Xu, S Wu, Y Cheng, E Chen
IEEE Transactions on Image Processing 30, 8454-8467, 2021
92021
Overlap sampler for region-based object detection
J Chen, B Luo, Q Wu, J Chen, X Peng
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2020
52020
Dropit: Dropping intermediate tensors for memory-efficient dnn training
J Chen, K Xu, Y Wang, Y Cheng, A Yao
arXiv preprint arXiv:2202.13808, 2022
42022
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations
M Ilaslan, C Song, J Chen, D Gao, W Lei, Q Xu, J Lim, M Shou
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
22023
Capturing Implicit Spatial Cues for Monocular 3d Hand Reconstruction
Q Wu*, J Chen*, X Zhou, Z Yao, X Yang
2021 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2021
22021
Communication-efficient federated learning with stagewise training strategy
Y Cheng, S Shen, X Liang, J Liu, J Chen, T Zhang, E Chen
Neural Networks 167, 460-472, 2023
12023
From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition
S Wu, C Zhang, J Chen, T Xu, L Wu, Y Hu, E Chen
arXiv preprint arXiv:2406.08358, 2024
2024
VideoLLM-online: Online Video Large Language Model for Streaming Video
J Chen, Z Lv, S Wu, KQ Lin, C Song, D Gao, JW Liu, Z Gao, D Mao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
Bootstrapping SparseFormers from Vision Foundation Models
Z Gao, Z Tong, KQ Lin, J Chen, MZ Shou
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–18