Guotao Liang

Guotao Liang

HITSZ / PCL

📝 Publications

🐍 2025

🔥 TPAMI 2026

sym

Codebook Transfer with Vision-to-Language Translation for Vector Quantization
Baoquan Zhang, Guotao Liang*, Tianran Chen, Yunming Ye, Zhiyuan Wen, Xiaochen Qi, and Yao He

Project Code

We propose a new perspective, \emph{i.e.}, codebook transfer from language models to VQ, to alleviate the codebook collapse issue. Its advantage is that the abundant transferable relationships from language codebooks can be fully exploited for enhancing VQ codebook learning.

🔥 AAAI 2026

sym

Improved Masked Image Generation with Knowledge-Augmented Token Representations
Guotao Liang, Baoquan Zhang*, Zhiyuan Wen, Zihao Han,Yunming Ye.

Project Code

We propose a novel knowledge-augmented masked image generation framework, named KA-MIG, which leverages prior semantic knowledge graphs to enrich internal token representations and improve generation performance.

🔥 CVPR 2025

sym

Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang, Baoquan Zhang*, Zhiyuan Wen, Junteng Zhao,Yunming Ye, Xiaochen Qi, Yao He. (Highlights)

We propose a novel text-augmented codebook learning framework, TA-VQ, which leverages VLMs to generate longer text for each image, improving text-aligned codebook learning.

🐲 2024

🔥 AAAI 2025 AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting, Zihao Han, Baoquan Zhang*, Lisai Zhang, Shanshan Feng, Kenghong Lin, Guotao Liang, Yunming Ye, Xiaochen Qi.

🔥 NeurIPS 2024

sym

LG-VQ: Language-Guided Codebook Learning
Guotao Liang, Baoquan Zhang*, Yaowei Wang, Yunming Ye, Xutao Li , HuaiBin Wang, Luo Chuyao, Kola Ye, Linfeng Luo.

We propose a novel multi-modal codebook learning method, named LG-VQ, which can enable the codebook to effectively retain fine-grained reconstruction information while aligning with the text.

CVPR 2024

sym

Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang, Wang huaibin, Luo Chuyao, Xutao Li, Guotao Liang, Yunming Ye, Kola Ye, Linfeng Luo.

We propose a new perspective, i.e., codebook transfer from language models to VQIM, to alleviate the codebook collapse issue.

🐰 2023

IJCNN 2023 HTP: Exploiting Holistic Temporal Patterns for Sequential Recommendation, Rui Chen, Guotao Liang, Chenrui Ma, Qilong Han, Li Li, Xiao Huang.