Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. camreasoner.png
    CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning
    Hang Wu, Yujun Cai, Zehao Li, and 4 more authors
    In ICML, 2026
    Under review.
  2. Framemind.png
    FrameMind: Frame-Interleaved Video Reasoning via Reinforcement Learning
    Haonan Ge, Yiwei Wang, Kai-Wei Chang, and 2 more authors
    In ICLR, 2026
    Under review.
  3. ICLR
    SPORTR.png
    SportR: A Benchmark for Multimodal Large Language Model Reasoning in Sports
    Haotian Xia*, Haonan Ge*, Junbo Zou*, and 16 more authors
    In ICLR, 2026
  4. Refineshot.png
    RefineShot: Rethinking Cinematography Understanding with Foundational Skill Evaluation
    Hang Wu, Yujun Cai, Haonan Ge, and 3 more authors
    In ICLR, 2026
    Under review.

2025

  1. EMNLP
    MRFD.png
    MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs
    Haonan Ge, Yiwei Wang, Ming-Hsuan Yang, and 1 more author
    In EMNLP, 2025