Fan Ma
Cited by
Cited by
Few-example object detection with model communication
X Dong, L Zheng, F Ma, Y Yang, D Meng
IEEE transactions on pattern analysis and machine intelligence 41 (7), 1641-1654, 2018
Self-paced co-training
F Ma, D Meng, Q Xie, Z Li, X Dong
International Conference on Machine Learning, 2275-2284, 2017
Sf-net: Single-frame supervision for temporal action localization
F Ma, L Zhu, Y Yang, S Zha, G Kundu, M Feiszli, Z Shou
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
Unified transformer tracker for object tracking
F Ma, MZ Shou, L Zhu, H Fan, Y Xu, Y Yang, Z Yan
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
Self-paced multi-view co-training
F Ma, D Meng, X Dong, Y Yang
Journal of Machine Learning Research 21 (57), 1-38, 2020
Context modulated dynamic networks for actor and action video segmentation with language queries
H Wang, C Deng, F Ma, Y Yang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 12152 …, 2020
A dual-network progressive approach to weakly supervised object detection
X Dong, D Meng, F Ma, Y Yang
Proceedings of the 25th ACM international conference on Multimedia, 279-287, 2017
A co-training approach to the classification of local climate zones with multi-source data
Y Xu, F Ma, D Meng, C Ren, Y Leung
2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS …, 2017
Learning with noisy labels via self-reweighting from class centroids
F Ma, Y Wu, X Yu, Y Yang
IEEE transactions on neural networks and learning systems 33 (11), 6275-6285, 2021
Vlab: Enhancing video language pre-training by feature adapting and blending
X He, S Chen, F Ma, Z Huang, X Jin, Z Liu, D Fu, Y Yang, J Liu, J Feng
arXiv preprint arXiv:2305.13167, 2023
Temporal perceiving video-language pre-training
F Ma, X Jin, H Wang, J Huang, L Zhu, J Feng, Y Yang
arXiv preprint arXiv:2301.07463, 2023
Weakly Supervised Moment Localization with Decoupled Consistent Concept Prediction
F Ma, L Zhu, Y Yang
International Journal of Computer Vision, 2022
Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
F Ma, X Jin, H Wang, Y Xian, J Feng, Y Yang
arXiv preprint arXiv:2312.08870, 2023
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Z Zhou, F Ma, H Fan, Y Yang
arXiv preprint arXiv:2402.06149, 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
D Zhou, Y Li, F Ma, Z Yang, Y Yang
arXiv preprint arXiv:2402.05408, 2024
Clustering for Protein Representation Learning
R Quan, W Wang, F Ma, H Fan, Y Yang
arXiv preprint arXiv:2404.00254, 2024
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity
R Quan, W Wang, Z Tian, F Ma, Y Yang
arXiv preprint arXiv:2403.20022, 2024
Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval
Y Suo, F Ma, L Zhu, Y Yang
arXiv preprint arXiv:2403.16005, 2024
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels
T Feng, W Wang, F Ma, Y Yang
arXiv preprint arXiv:2403.15173, 2024
CapHuman: Capture Your Moments in Parallel Universes
C Liang, F Ma, L Zhu, Y Deng, Y Yang
arXiv preprint arXiv:2402.00627, 2024
The system can't perform the operation now. Try again later.
Articles 1–20