Enhanced Review Detection and Recognition: A Platform-Agnostic Approach with Application to Online Commerce

Priyabrata Karmakar; John Hawkins

Enhanced Review Detection and Recognition: A Platform-Agnostic Approach with Application to Online Commerce

Priyabrata Karmakar, John Hawkins

TL;DR

The paper addresses the automation of detecting and extracting online reviews across diverse platforms to combat credibility issues and enable large-scale analysis. It introduces PIORDR, a platform-agnostic pipeline that combines object detection (YOLOv8) to locate review areas with OCR to transcribe text, avoiding brittle HTML scraping. It expands the pipeline with three analytics modules—sentiment inconsistency analysis, multilingual extraction and translation, and fake-review detection via a large language model—demonstrating strong performance on known platforms and reasonable generalization to unseen sites. While results on unfamiliar platforms show some degradation, the approach offers a scalable solution for cross-platform review analysis and veracity filtering, with future work focused on improving generalization through distillation, few-shot, and zero-shot learning.

Abstract

Online commerce relies heavily on user generated reviews to provide unbiased information about products that they have not physically seen. The importance of reviews has attracted multiple exploitative online behaviours and requires methods for monitoring and detecting reviews. We present a machine learning methodology for review detection and extraction, and demonstrate that it generalises for use across websites that were not contained in the training data. This method promises to drive applications for automatic detection and evaluation of reviews, regardless of their source. Furthermore, we showcase the versatility of our method by implementing and discussing three key applications for analysing reviews: Sentiment Inconsistency Analysis, which detects and filters out unreliable reviews based on inconsistencies between ratings and comments; Multi-language support, enabling the extraction and translation of reviews from various languages without relying on HTML scraping; and Fake review detection, achieved by integrating a trained NLP model to identify and distinguish between genuine and fake reviews.

Enhanced Review Detection and Recognition: A Platform-Agnostic Approach with Application to Online Commerce

TL;DR

Abstract

Paper Structure (14 sections, 2 equations, 11 figures, 2 tables)

This paper contains 14 sections, 2 equations, 11 figures, 2 tables.

Introduction
Advantages and Disadvantages of Online Reviews
Advantages
Disadvantages
Importance of Automatic Detection and Recognition of Online Reviews
Methods
Data collection & Experiments
Metrics
Results
Analysis Module
Sentiment Inconsistency Analysis
Multi-language support
Fake review detection
Conclusion

Figures (11)

Figure 1: Proposed Multi-Stage Machine Learning Process for Automatic Review Detection and Analysis
Figure 2: Sample screenshot of online reviews from Amazon.com.au platform
Figure 3: A sample review detection and recognition example
Figure 4: Review detection and recognition from Menulog
Figure 5: Review detection and recognition from Product Review
...and 6 more figures

Enhanced Review Detection and Recognition: A Platform-Agnostic Approach with Application to Online Commerce

TL;DR

Abstract

Enhanced Review Detection and Recognition: A Platform-Agnostic Approach with Application to Online Commerce

Authors

TL;DR

Abstract

Table of Contents

Figures (11)