Publications

(2024). ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification. arXiv preprint arXiv:2405.14256.
(2024). EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models. ICLR 2024 (spotlight).
(2023). Ptqd: Accurate post-training quantization for diffusion models. NeurIPS 2023.
(2023). Paragraph-to-image generation with information-enriched diffusion model. arXiv preprint arXiv:2311.14284.
(2023). Datasetdm: Synthesizing data with perception annotations using diffusion models. NeurIPS 2023.
(2023). Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization. Neural Processing Letters.
(2023). BiViT: Extremely Compressed Binary Vision Transformers. ICCV 2023.
(2023). Binarizing by classification: Is soft function really necessary?. IEEE Transactions on Circuits and Systems for Video Technology.