Picture for Hui Xiong

Hui Xiong

SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering

Add code
May 04, 2026
Viaarxiv icon

Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent

Add code
May 04, 2026
Viaarxiv icon

Robust Conditional Conformal Prediction via Branched Normalizing Flow

Add code
May 03, 2026
Viaarxiv icon

Holo360D: A Large-Scale Real-World Dataset with Continuous Trajectories for Advancing Panoramic 3D Reconstruction and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

Discrete Preference Learning for Personalized Multimodal Generation

Add code
Apr 22, 2026
Viaarxiv icon

Scaling Human-AI Coding Collaboration Requires a Governable Consensus Layer

Add code
Apr 20, 2026
Viaarxiv icon

Where to Focus: Query-Modulated Multimodal Keyframe Selection for Long Video Understanding

Add code
Apr 19, 2026
Viaarxiv icon

TableVision: A Large-Scale Benchmark for Spatially Grounded Reasoning over Complex Hierarchical Tables

Add code
Apr 04, 2026
Viaarxiv icon

A Pontryagin Method of Model-based Reinforcement Learning via Hamiltonian Actor-Critic

Add code
Mar 30, 2026
Viaarxiv icon

GoldenStart: Q-Guided Priors and Entropy Control for Distilling Flow Policies

Add code
Mar 15, 2026
Viaarxiv icon