Table of Contents
Fetching ...

A Unified Knowledge Graph to Permit Interoperability of Heterogeneous Digital Evidence

Ali Alshumrani, Nathan Clarke, Bogdan Ghita

TL;DR

The paper tackles interoperability challenges in digital forensics posed by heterogeneous evidence across devices and formats. It introduces a Unified Metadata Graph Model (UMGM) that ingests, harmonises, and unifies evidence metadata within a graph database to enable cross-platform analytics. The key contributions include a three-phase UMGM framework, a detailed metadata extraction/mapping/refinement process, and cross-evidence analytics demonstrated via a hypothetical data-leak case. The results suggest improved query efficiency, richer evidence relationships, and streamlined workflows for digital investigations, with future work targeting scalability and integration of machine learning.

Abstract

The modern digital world is highly heterogeneous, encompassing a wide variety of communications, devices, and services. This interconnectedness generates, synchronises, stores, and presents digital information in multidimensional, complex formats, often fragmented across multiple sources. When linked to misuse, this digital information becomes vital digital evidence. Integrating and harmonising these diverse formats into a unified system is crucial for comprehensively understanding evidence and its relationships. However, existing approaches to date have faced challenges limiting investigators' ability to query heterogeneous evidence across large datasets. This paper presents a novel approach in the form of a modern unified data graph. The proposed approach aims to seamlessly integrate, harmonise, and unify evidence data, enabling cross-platform interoperability, efficient data queries, and improved digital investigation performance. To demonstrate its efficacy, a case study is conducted, highlighting the benefits of the proposed approach and showcasing its effectiveness in enabling the interoperability required for advanced analytics in digital investigations.

A Unified Knowledge Graph to Permit Interoperability of Heterogeneous Digital Evidence

TL;DR

The paper tackles interoperability challenges in digital forensics posed by heterogeneous evidence across devices and formats. It introduces a Unified Metadata Graph Model (UMGM) that ingests, harmonises, and unifies evidence metadata within a graph database to enable cross-platform analytics. The key contributions include a three-phase UMGM framework, a detailed metadata extraction/mapping/refinement process, and cross-evidence analytics demonstrated via a hypothetical data-leak case. The results suggest improved query efficiency, richer evidence relationships, and streamlined workflows for digital investigations, with future work targeting scalability and integration of machine learning.

Abstract

The modern digital world is highly heterogeneous, encompassing a wide variety of communications, devices, and services. This interconnectedness generates, synchronises, stores, and presents digital information in multidimensional, complex formats, often fragmented across multiple sources. When linked to misuse, this digital information becomes vital digital evidence. Integrating and harmonising these diverse formats into a unified system is crucial for comprehensively understanding evidence and its relationships. However, existing approaches to date have faced challenges limiting investigators' ability to query heterogeneous evidence across large datasets. This paper presents a novel approach in the form of a modern unified data graph. The proposed approach aims to seamlessly integrate, harmonise, and unify evidence data, enabling cross-platform interoperability, efficient data queries, and improved digital investigation performance. To demonstrate its efficacy, a case study is conducted, highlighting the benefits of the proposed approach and showcasing its effectiveness in enabling the interoperability required for advanced analytics in digital investigations.
Paper Structure (17 sections, 4 figures, 3 tables)

This paper contains 17 sections, 4 figures, 3 tables.

Figures (4)

  • Figure 1: A generic workflow of the unified metadata graph model.
  • Figure 2: An exemplary depiction of the metadata harmonisation process.
  • Figure 3: A brief illustration of the metadata harmonisation case study results.
  • Figure 4: Timeline analysis of the harmonised timestamp events.