Reimagining Edge AI and LLM Inference with Compute Memory Architectures

Room CB-603, 6/F, Chow Yei Ching Building, The University of Hong Kong

Abstract Recent advances in artificial intelligence (AI), especially in large language models (LLMs), have dramatically increased model sizes and computational demands, significantly straining computing system capabilities. This issue is particularly […]