Table of Contents
Fetching ...

Finding A Taxi with Illegal Driver Substitution Activity via Behavior Modelings

Junbiao Pang, Muhammad Ayub Sabir, Zhuyun Wang, Anjing Hu, Xue Yang, Haitao Yu, Qingming Huang

TL;DR

This work addresses the problem of identifying taxis involved in Illegal Driver Substitution (IDS) by formulating IDS detection as supervised learning over GPS traces and taximeter records. It introduces two driver-behavior models—Sleeping Time and Location (STL) and Pick-Up (PU)—and maps their outputs into a common discriminative feature space using Self-Similarity and Multi-Scale Pooling (MSP-SS). A novel MC-MIL framework fuses STL- and PU-based classifiers to align IDS-related patterns across long time horizons, showing superior performance over MIL-based baselines on the IDS@BJ dataset. The proposed approach yields robust, generalizable detection with practical efficiency, enabling targeted law-enforcement inspections in real-world settings. The work demonstrates meaningful improvements over baselines and provides actionable insights for deploying GPS–taximeter data fusion in transportation safety applications.

Abstract

In our urban life, Illegal Driver Substitution (IDS) activity for a taxi is a grave unlawful activity in the taxi industry, possibly causing severe traffic accidents and painful social repercussions. Currently, the IDS activity is manually supervised by law enforcers, i.e., law enforcers empirically choose a taxi and inspect it. The pressing problem of this scheme is the dilemma between the limited number of law-enforcers and the large volume of taxis. In this paper, motivated by this problem, we propose a computational method that helps law enforcers efficiently find the taxis which tend to have the IDS activity. Firstly, our method converts the identification of the IDS activity to a supervised learning task. Secondly, two kinds of taxi driver behaviors, i.e., the Sleeping Time and Location (STL) behavior and the Pick-Up (PU) behavior are proposed. Thirdly, the multiple scale pooling on self-similarity is proposed to encode the individual behaviors into the universal features for all taxis. Finally, a Multiple Component- Multiple Instance Learning (MC-MIL) method is proposed to handle the deficiency of the behavior features and to align the behavior features simultaneously. Extensive experiments on a real-world data set shows that the proposed behavior features have a good generalization ability across different classifiers, and the proposed MC-MIL method suppresses the baseline methods.

Finding A Taxi with Illegal Driver Substitution Activity via Behavior Modelings

TL;DR

This work addresses the problem of identifying taxis involved in Illegal Driver Substitution (IDS) by formulating IDS detection as supervised learning over GPS traces and taximeter records. It introduces two driver-behavior models—Sleeping Time and Location (STL) and Pick-Up (PU)—and maps their outputs into a common discriminative feature space using Self-Similarity and Multi-Scale Pooling (MSP-SS). A novel MC-MIL framework fuses STL- and PU-based classifiers to align IDS-related patterns across long time horizons, showing superior performance over MIL-based baselines on the IDS@BJ dataset. The proposed approach yields robust, generalizable detection with practical efficiency, enabling targeted law-enforcement inspections in real-world settings. The work demonstrates meaningful improvements over baselines and provides actionable insights for deploying GPS–taximeter data fusion in transportation safety applications.

Abstract

In our urban life, Illegal Driver Substitution (IDS) activity for a taxi is a grave unlawful activity in the taxi industry, possibly causing severe traffic accidents and painful social repercussions. Currently, the IDS activity is manually supervised by law enforcers, i.e., law enforcers empirically choose a taxi and inspect it. The pressing problem of this scheme is the dilemma between the limited number of law-enforcers and the large volume of taxis. In this paper, motivated by this problem, we propose a computational method that helps law enforcers efficiently find the taxis which tend to have the IDS activity. Firstly, our method converts the identification of the IDS activity to a supervised learning task. Secondly, two kinds of taxi driver behaviors, i.e., the Sleeping Time and Location (STL) behavior and the Pick-Up (PU) behavior are proposed. Thirdly, the multiple scale pooling on self-similarity is proposed to encode the individual behaviors into the universal features for all taxis. Finally, a Multiple Component- Multiple Instance Learning (MC-MIL) method is proposed to handle the deficiency of the behavior features and to align the behavior features simultaneously. Extensive experiments on a real-world data set shows that the proposed behavior features have a good generalization ability across different classifiers, and the proposed MC-MIL method suppresses the baseline methods.
Paper Structure (25 sections, 23 equations, 5 figures, 9 tables)

This paper contains 25 sections, 23 equations, 5 figures, 9 tables.

Figures (5)

  • Figure 1: Unreliable driver profile (best viewed in color). (a) The comparison of the ages between taxis with IDS and taxi role models. (b) The comparison of the education level between taxis with IDS and taxi role models; "PS", "MS", "HS", "JC", and "UG" are the abbreviation of Primary School, Middle School, High School, Junior College, and UnderGraduate, respectively. (c) The comparison of the first time when a taxi obtained the vocational license.
  • Figure 2: The system of inspecting a taxi with the IDS activity for law-enforcers (best viewed in color).
  • Figure 3: An illustration of the sleeping locations of taxis drivers.
  • Figure 4: An illustration of MSP pooling for SS-based features.
  • Figure 5: The distributions of the two elements in STL.

Theorems & Definitions (3)

  • Definition 1: Definition of IDS
  • Definition 2: Identification of IDS Activities
  • Definition 3: Definition of STL