๐ Publications
๐ 2025
๐ฅ CVPR 2025

Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang, Baoquan Zhang*, Zhiyuan Wen, Junteng Zhao,Yunming Ye, Xiaochen Qi, Yao He. (Highlights)
- We propose a novel text-augmented codebook learning framework, TA-VQ, which leverages VLMs to generate longer text for each image, improving text-aligned codebook learning.
๐ฒ 2024
- ๐ฅ
AAAI 2025
AsyncDSB: Schedule-Asynchronous Diffusion Schrรถdinger Bridge for Image Inpainting, Zihao Han, Baoquan Zhang*, Lisai Zhang, Shanshan Feng, Kenghong Lin, Guotao Liang, Yunming Ye, Xiaochen Qi.
๐ฅ NeurIPS 2024

LG-VQ: Language-Guided Codebook Learning
Guotao Liang, Baoquan Zhang*, Yaowei Wang, Yunming Ye, Xutao Li , HuaiBin Wang, Luo Chuyao, Kola Ye, Linfeng Luo.
- We propose a novel multi-modal codebook learning method, named LG-VQ, which can enable the codebook to effectively retain fine-grained reconstruction information while aligning with the text.
CVPR 2024

Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang, Wang huaibin, Luo Chuyao, Xutao Li, Guotao Liang, Yunming Ye, Kola Ye, Linfeng Luo.
- We propose a new perspective, i.e., codebook transfer from language models to VQIM, to alleviate the codebook collapse issue.
๐ฐ 2023
IJCNN 2023
HTP: Exploiting Holistic Temporal Patterns for Sequential Recommendation, Rui Chen, Guotao Liang, Chenrui Ma, Qilong Han, Li Li, Xiao Huang.