Guotao Liang

Guotao Liang

HITSZ / PCL

📝 Publications

🐍 2025

🔥 AAAI 2026

sym

Improved Masked Image Generation with Knowledge-Augmented Token Representations
Guotao Liang, Baoquan Zhang*, Zhiyuan Wen, Zihao Han,Yunming Ye.

we propose a novel knowledge-augmented masked image generation framework, named KA-MIG, which leverages prior semantic knowledge graphs to enrich internal token representations and improve generation performance.

🔥 CVPR 2025

sym

Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang, Baoquan Zhang*, Zhiyuan Wen, Junteng Zhao,Yunming Ye, Xiaochen Qi, Yao He. (Highlights)

We propose a novel text-augmented codebook learning framework, TA-VQ, which leverages VLMs to generate longer text for each image, improving text-aligned codebook learning.

🐲 2024

🔥 AAAI 2025 AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting, Zihao Han, Baoquan Zhang*, Lisai Zhang, Shanshan Feng, Kenghong Lin, Guotao Liang, Yunming Ye, Xiaochen Qi.

🔥 NeurIPS 2024

sym

LG-VQ: Language-Guided Codebook Learning
Guotao Liang, Baoquan Zhang*, Yaowei Wang, Yunming Ye, Xutao Li , HuaiBin Wang, Luo Chuyao, Kola Ye, Linfeng Luo.

We propose a novel multi-modal codebook learning method, named LG-VQ, which can enable the codebook to effectively retain fine-grained reconstruction information while aligning with the text.

CVPR 2024

sym

Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang, Wang huaibin, Luo Chuyao, Xutao Li, Guotao Liang, Yunming Ye, Kola Ye, Linfeng Luo.

We propose a new perspective, i.e., codebook transfer from language models to VQIM, to alleviate the codebook collapse issue.

🐰 2023

IJCNN 2023 HTP: Exploiting Holistic Temporal Patterns for Sequential Recommendation, Rui Chen, Guotao Liang, Chenrui Ma, Qilong Han, Li Li, Xiao Huang.