(1)

Chen, L. Open-World Multimodal Understanding and Generation With Efficiently Finetuned Foundation Models. AAAI 2025, 39, 28706-28706.