RPG Seminar – Can Image Models Think? Benchmarking and Empowering Models with Knowledge and Reasoning
Zoom Link https://hku.zoom.us/meetings/95832932074/invitations?signature=SCg4igomh110T7WUpW6op2xeFgb9WgU92xDTYI2sp8s Abstract Text-to-image generation models have achieved impressive visual quality, yet they largely fail when prompts require reasoning or world knowledge to interpret. Generating an image of "a park thirty minutes after heavy rainfall" or "the city hosting the 2021 Summer Olympics" demands inference and domain knowledge, not just pattern matching. This seminar […]
