Anomaly Detection and Correction of Optimizing Autonomous Systems With Inverse Reinforcement Learning.

Learning Reinforcement, Psychology Algorithms

Journal

IEEE transactions on cybernetics

ISSN: 2168-2275

Titre abrégé: IEEE Trans Cybern

Pays: United States

ID NLM: 101609393

Informations de publication

Date de publication:
Jul 2023

Historique:

medline: 21 10 2022

pubmed: 21 10 2022

entrez: 20 10 2022

Statut: ppublish

Résumé

This article considers autonomous systems whose behaviors seek to optimize an objective function. This goes beyond standard applications of condition-based maintenance, which seeks to detect faults or failures in nonoptimizing systems. Normal agents optimize a known accepted objective function, whereas abnormal or misbehaving agents may optimize a renegade objective that does not conform to the accepted one. We provide a unified framework for anomaly detection and correction in optimizing autonomous systems described by differential equations using inverse reinforcement learning (RL). We first define several types of anomalies and false alarms, including noise anomaly, objective function anomaly, intention (control gain) anomaly, abnormal behaviors, noise-anomaly false alarms, and objective false alarms. We then propose model-free inverse RL algorithms to reconstruct the objective functions and intentions for given system behaviors. The inverse RL procedure for anomaly detection and correction has the training phase, detection phase, and correction phase. First, inverse RL in the training phase infers the objective function and intention of the normal behavior system using offline stored data. Second, in the detection phase, inverse RL infers the objective function and intention for online observed test system behaviors using online observation data. They are then compared with that of the nominal system to identify anomalies. Third, correction is executed for the anomalous system to learn the normal objective and intention. Simulations and experiments on a quadrotor unmanned aerial vehicle (UAV) verify the proposed methods.

Identifiants

DOI: 10.1109/TCYB.2022.3213526 PMID: 36264741

pubmed: 36264741

doi: 10.1109/TCYB.2022.3213526

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

4555-4566

Anomaly Detection and Correction of Optimizing Autonomous Systems With Inverse Reinforcement Learning.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

Bosen Lian (B)

Yusuf Kartal (Y)

Frank L Lewis (FL)

Dariusz G Mikulski (DG)

Gregory R Hudas (GR)

Yan Wan (Y)

Ali Davoudi (A)

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Multilabel SegSRGAN-A framework for parcellation and morphometry of preterm brain in MRI.

An arithmetic operation P system based on symmetric ternary system.

Unsupervised learning for real-time and continuous gait phase detection.

Classifications MeSH