Publications

(2025). Sparsity Forcing: Reinforcing Token Sparsity of MLLMs. ICLR 2026.
(2025). OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs. AAAI 2026.
(2025). Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance. NeurIPS 2025.
(2025). Neighboring Autoregressive Modeling for Efficient Visual Generation. ICCV 2025.
(2024). ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality. ICML 2025.
(2024). ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression. ICCV 2025.
(2024). ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification. NeurIPS 2024.
(2024). MiniCache: KV Cache Compression in Depth Dimension for Large Language Models. NeurIPS 2024.
(2023). EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models. ICLR 2024 (spotlight).
(2023). Paragraph-to-image generation with information-enriched diffusion model. arXiv preprint arXiv:2311.14284.