Privacy-Preserved Taxi Demand Prediction System Utilizing Distributed Data

Ren Ozeki; Haruki Yonekura; Hamada Rizk; Hirozumi Yamaguchi

Privacy-Preserved Taxi Demand Prediction System Utilizing Distributed Data

Ren Ozeki, Haruki Yonekura, Hamada Rizk, Hirozumi Yamaguchi

TL;DR

CC-Net tackles privacy-sensitive taxi-demand prediction by fusing contrastive feature learning with decentralized neighbor-based collaboration, avoiding exposure of raw data. It introduces a hexagonal virtual grid, a Transformer-based feature encoder with self-supervised contrastive learning, and a similarity-driven collaboration strategy that handles non-IID data while personalizing per-client predictions. Empirical results on five Japanese providers over 14 months show CC-Net yields at least 2.2% higher accuracy than non-federated baselines and demonstrates robustness against membership inference attacks, with privacy preserved at the architectural level. Collectively, CC-Net offers a practical blueprint for privacy-preserving, scalable taxi-demand forecasting in distributed urban ecosystems.

Abstract

Accurate taxi-demand prediction is essential for optimizing taxi operations and enhancing urban transportation services. However, using customers' data in these systems raises significant privacy and security concerns. Traditional federated learning addresses some privacy issues by enabling model training without direct data exchange but often struggles with accuracy due to varying data distributions across different regions or service providers. In this paper, we propose CC-Net: a novel approach using collaborative learning enhanced with contrastive learning for taxi-demand prediction. Our method ensures high performance by enabling multiple parties to collaboratively train a demand-prediction model through hierarchical federated learning. In this approach, similar parties are clustered together, and federated learning is applied within each cluster. The similarity is defined without data exchange, ensuring privacy and security. We evaluated our approach using real-world data from five taxi service providers in Japan over fourteen months. The results demonstrate that CC-Net maintains the privacy of customers' data while improving prediction accuracy by at least 2.2% compared to existing techniques.

Privacy-Preserved Taxi Demand Prediction System Utilizing Distributed Data

TL;DR

Abstract

Paper Structure (27 sections, 4 equations, 13 figures, 2 tables, 1 algorithm)

This paper contains 27 sections, 4 equations, 13 figures, 2 tables, 1 algorithm.

Introduction
Threat Model
Proposed system
Hexagonal Virtual Gridding
Feature Extractor Module
Feature encoder
Feature Extractor with Contrastive Learning
Decentralized Collaborate Learning Mechanism
Similar Client Selection
Distributed Model Update
Client Tailored Classification
Evaluation
Data collection and setup
Data collection
Metrics
...and 12 more sections

Figures (13)

Figure 1: Taxi demand distribution in different regions.
Figure 2: The procedure of membership inference attack for taxi demand prediction.
Figure 3: CC-Net overview in heterogenous environments
Figure 4: Taxi demand prediction model in our proposed system
Figure 5: Contrastive learning illustration. The input data are twice augmented by both cropping and adding noise. The generated pair from the same data is encoded into a similar representation, but the representations from different data are kept away in latent space.
...and 8 more figures

Privacy-Preserved Taxi Demand Prediction System Utilizing Distributed Data

TL;DR

Abstract

Privacy-Preserved Taxi Demand Prediction System Utilizing Distributed Data

Authors

TL;DR

Abstract

Table of Contents

Figures (13)