RPG Seminar – Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
Abstract: Although considerable progress has been obtained in neural network quantization for efficient inference, existing methods are not scalable to heterogeneous devices as one dedicated model needs to be trained, […]
