Picture for Xiaoye Qu

Xiaoye Qu

May

VCE: A zero-cost hallucination mitigation method of LVLMs via visual contrastive editing

Add code
Apr 21, 2026
Viaarxiv icon

ExFusion: Efficient Transformer Training via Multi-Experts Fusion

Add code
Mar 30, 2026
Viaarxiv icon

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Add code
Mar 30, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Add code
Mar 12, 2026
Viaarxiv icon

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

Characterizing, Evaluating, and Optimizing Complex Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

New Skills or Sharper Primitives? A Probabilistic Perspective on the Emergence of Reasoning in RLVR

Add code
Feb 09, 2026
Viaarxiv icon

Learning to Reason Faithfully through Step-Level Faithfulness Maximization

Add code
Feb 03, 2026
Viaarxiv icon

LatentMem: Customizing Latent Memory for Multi-Agent Systems

Add code
Feb 03, 2026
Viaarxiv icon