Source
IEEE Access
DATE OF PUBLICATION
02/03/2025
Authors
Share

SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis

Abstract

Fault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed atminimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervisedFDD methods offer great performance while heavily relying on large volumes of labeled data, whereasunsupervised methods do not depend on labeled data, though are inferior in performance compared tosupervised ones. In this paper, we propose SensorDBSCAN, a novel semi-supervised method for anomalydetection and diagnosis. The key innovation lies in achieving good performance with minimal labeleddata - less than 1% of the dataset - by leveraging active and contrastive learning techniques. The proposedapproach combines a transformer-based encoder trained with a triplet-based contrastive learning objectiveand the classical density-based clustering algorithm DBSCAN, enabling strong feature extraction, efficientand interpretable feature space organization and simple clustering algorithm. Unlike existing methods,SensorDBSCAN eliminates the need for manual labeling large amounts of data, cluster analysis, and predefining cluster numbers, providing greater usability in real-world cases. We validate the effectiveness of ourmethod on the Tennessee Eastman Process (TEP) and its advanced simulations (TEP Rieth and TEP Rieker).SensorDBSCAN demonstrates better performance on well-known and realistic datasets, reducing labelingrequirements while maintaining high accuracy of fault detection and diagnostics.The code is available at https://github.com/K0mp0t/sensordbscan/tree/tripletloss.

Join AIRI