Recovering Origin Destination Flows from Bus CCTV: Early Results from Nairobi and Kigali
Nthenya Kyatha, Jay Taneja
TL;DR
Sub-Saharan Africa buses lack reliable passenger-flow data; this paper presents a CCTV-based baseline OD inference pipeline that reuses existing onboard cameras and telematics. The system combines per-camera detection and tracking (YOLOv12/BotSORT/OSNet), OCR timestamp extraction, ROI-based door counting, and cross-camera association to build OD matrices. Findings show strong performance under light conditions but significant drop under overcrowding and modality shifts, revealing deployment-specific failure modes. Door-state aware counting and simple Re-ID enhancements substantially improve exit accuracy and overall OD quality, underscoring the potential and the remaining challenges for scalable SSA transit analytics.
Abstract
Public transport in sub-Saharan Africa (SSA) often operates in overcrowded conditions where existing automated systems fail to capture reliable passenger flow data. Leveraging onboard CCTV already deployed for security, we present a baseline pipeline that combines YOLOv12 detection, BotSORT tracking, OSNet embeddings, OCR-based timestamping, and telematics-based stop classification to recover bus origin--destination (OD) flows. On annotated CCTV segments from Nairobi and Kigali buses, the system attains high counting accuracy under low-density, well-lit conditions (recall $\approx$95\%, precision $\approx$91\%, F1 $\approx$93\%). It produces OD matrices that closely match manual tallies. Under realistic stressors such as overcrowding, color-to-monochrome shifts, posture variation, and non-standard door use, performance degrades sharply (e.g., $\sim$40\% undercount in peak-hour boarding and a $\sim$17 percentage-point drop in recall for monochrome segments), revealing deployment-specific failure modes and motivating more robust, deployment-focused Re-ID methods for SSA transit.
