Selected Publications

Action Recognition

2025

  1. jingmin2025dyna.png
    DynaPURLS: Dynamic Refinement of Part-aware Representations for Skeleton-based Zero-Shot Action Recognition
    Jingmin Zhu, Anqi Zhu, James Bailey, Jun Liu, Hossein Rahmani, Mohammed Bennamoun, Farid Boussaid, and Qiuhong Ke
    Arxiv 2025| arXiv preprint
  2. yanan2025.png
    TSkel-Mamba: Temporal Dynamic Modeling via State Space Model for Human Skeleton-based Action Recognition
    Yanan Liu, Jun Liu, Hao Zhang, Dan Xu, Hossein Rahmani, Mohammed Bennamoun, and Qiuhong Ke
    Arxiv 2025| arXiv preprint
  3. jing25boost.png
    Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation
    Jingmin Zhu, Anqi Zhu, Hossein Rahmani, Jun Liu, Mohammed Bennamoun, and Qiuhong Ke
    NeurIPS 2025| Advances in Neural Information Processing SystemsCCF-ACORE-A*
  4. zhu2025semantic.png
    Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition
    Anqi Zhu, Jingmin Zhu, James Bailey, Mingming Gong, and Qiuhong Ke
    CVPR 2025| Proceedings of the Computer Vision and Pattern Recognition ConferenceCCF-ACORE-A*

2024

  1. zhu2024part.png
    Part-aware unified representation of language and skeleton for zero-shot action recognition
    Anqi Zhu, Qiuhong Ke, Mingming Gong, and James Bailey
    CVPR 2024| Conference on Computer Vision and Pattern RecognitionCCF-ACORE-A*

2022

  1. sun2022human.png
    Human action recognition from various data modalities: A review
    Zehua Sun, Qiuhong Ke, Hossein Rahmani, Mohammed Bennamoun, Gang Wang, and Jun Liu
    TPAMI 2022| IEEE Transactions on Pattern Analysis and Machine IntelligenceCCF-ACORE-A*

Video Understanding

2025

  1. trisense.png
    Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
    Zinuo Li, Xian Zhang, Yongxin Guo, Mohammed Bennamoun, Farid Boussaid, Girish Dwivedi, Luqi Gong, and Qiuhong Ke
    NeurIPS 2025| Advances in Neural Information Processing SystemsCCF-ACORE-A*

2024

  1. li2024sports.png
    Sports-qa: A large-scale video question answering benchmark for complex and professional sports
    Haopeng Li, Andong Deng, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen, and Qiuhong Ke
    arXiv 2024| arXiv preprint

2023

  1. li2023progressive.png
    Progressive video summarization via multimodal self-supervised learning
    Haopeng Li, Qiuhong Ke, Mingming Gong, and Tom Drummond
    WACV 2023| IEEE/CVF Winter Conference on Applications of Ccomputer VisionCCF-BCORE-A

2022

  1. li2022video.png
    Video joint modelling based on hierarchical transformer for co-summarization
    Haopeng Li, Qiuhong Ke, Mingming Gong, and Rui Zhang
    TPAMI 2022| IEEE Transactions on Pattern Analysis and Machine IntelligenceCCF-ACORE-A*

Generation

2025

  1. li2025longdiff.png
    LongDiff: Training-Free Long Video Generation in One Go
    Zhuoling Li, Hossein Rahmani, Qiuhong Ke, and Jun Liu
    CVPR 2025| Conference on Computer Vision and Pattern RecognitionCCF-ACORE-A*
  2. sun2025point.png
    Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
    Hongyu Sun, Qiuhong Ke, Ming Cheng, Yongcai Wang, Deying Li, Chenhui Gou, and Jianfei Cai
    CVPR 2025| Conference on Computer Vision and Pattern RecognitionCCF-ACORE-A*
  3. xu2025bmvc.png
    ST-GDance: Long-Term and Collision-Free Group Choreography from Music
    Jing Xu, Weiqiang Wang, Cunjian Chen, Jun Liu, and Qiuhong Ke
    BMVC 2025| British Machine Vision ConferenceCCF-CCORE-A
    Oral Presentation

2024

  1. sun2024prc.png
    Point-PRC: A prompt learning based regulation framework for generalizable point cloud analysis
    Hongyu Sun, Qiuhong Ke, Yongcai Wang, Wang Chen, Kang Yang, Deying Li, and Jianfei Cai
    NeurIPS 2024| Advances in Neural Information Processing SystemsCCF-ACORE-A*

Other Publications