YOLIC: An Efficient Method for Object Localization and Classification on Edge Devices
Kai Su, Yoichi Tomioka, Qiangfu Zhao, Yong Liu
TL;DR
YOLIC proposes a bounding-box-free, CoI-based object localization and classification method optimized for edge devices, blending semantic segmentation with lightweight detection. By using predefined Cells of Interest and a multi-label per-CoI head, it delivers real-time performance with competitive accuracy, avoiding bounding-box regression and NMS. The approach is validated across outdoor, indoor, and urban datasets, including Cityscapes, demonstrating robust generalization and substantial speed advantages on Raspberry Pi hardware, with quantization-aware training further enhancing efficiency. The work highlights practical benefits for IoT and autonomous systems, offering configurable CoI layouts, transferable backbones, and publicly available resources to facilitate deployment and reproduction.
Abstract
In the realm of Tiny AI, we introduce ``You Only Look at Interested Cells" (YOLIC), an efficient method for object localization and classification on edge devices. Through seamlessly blending the strengths of semantic segmentation and object detection, YOLIC offers superior computational efficiency and precision. By adopting Cells of Interest for classification instead of individual pixels, YOLIC encapsulates relevant information, reduces computational load, and enables rough object shape inference. Importantly, the need for bounding box regression is obviated, as YOLIC capitalizes on the predetermined cell configuration that provides information about potential object location, size, and shape. To tackle the issue of single-label classification limitations, a multi-label classification approach is applied to each cell for effectively recognizing overlapping or closely situated objects. This paper presents extensive experiments on multiple datasets to demonstrate that YOLIC achieves detection performance comparable to the state-of-the-art YOLO algorithms while surpassing in speed, exceeding 30fps on a Raspberry Pi 4B CPU. All resources related to this study, including datasets, cell designer, image annotation tool, and source code, have been made publicly available on our project website at https://kai3316.github.io/yolic.github.io
