A quantitative reliability metric for querying large database.

Library search Multivariate curve resolution Opioids Quantitative reliability metric

Journal

Forensic science international
ISSN: 1872-6283
Titre abrégé: Forensic Sci Int
Pays: Ireland
ID NLM: 7902034

Informations de publication

Date de publication:
Feb 2022
Historique:
received: 20 02 2021
revised: 28 10 2021
accepted: 18 12 2021
pubmed: 1 1 2022
medline: 1 1 2022
entrez: 31 12 2021
Statut: ppublish

Résumé

A redesigned quantitative reliability metric based on the F-distribution (QRMf) is reported for evaluating the reliability of library search. The QRMf provides orthogonal information to the comparison metric (e.g., dot product) and yields a probabilistic result. An intralibrary search can be considered as an idealized search because the top hit, i.e., the closest matching object, will match perfectly. If the search of an unknown object yields the same hit list as the intralibrary search, it would indicate good reliability. For each object in the hit list, a QRMf compares the order of an intralibrary and interlibrary search results and calculates a variance of interlibrary similarity metrics between the records of the intralibrary search and records in the corresponding positions of the interlibrary search. This variance that measures the discordance of the intra and interlibrary search can simply be compared to the variance of the similarity metrics within the interlibrary search results. The ratio of these variances follows an F-distribution that can be used to determine if the discordance is statistically significant and generates the probability based on the cumulative distribution function. The QRMf works for both similarity and dissimilarity and can be used for any queried object and comparison metric that is searched against a database. In this work, the QRMf was used along with the dot product similarity to query the mass spectra of novel synthetic opioids measured by gas chromatography-mass spectrometry (GC/MS). An automated pipeline was devised that used a basis set correction to assist peak detection. The basis was constructed by mass spectra obtained from the blank measurement preceding the analytical run to remove interferences from column bleed and septum degradation. After peak detection, the pipeline applied multivariate curve resolution to the chromatographic peak window to remove background components from the mass spectra. The corrected mass spectra were searched against a customized library for identification. The QRMf can be used along with the similarity metric to detect misidentifications and assist in finding the correct identification when it is not the closest match.

Identifiants

pubmed: 34972050
pii: S0379-0738(21)00475-8
doi: 10.1016/j.forsciint.2021.111155
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

111155

Informations de copyright

Copyright © 2021 Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Zewei Chen (Z)

Chemistry Laboratories, Department of Chemistry and Biochemistry, Ohio University, Athens, OH 45701, USA.

Peter de Boves Harrington (P)

Chemistry Laboratories, Department of Chemistry and Biochemistry, Ohio University, Athens, OH 45701, USA. Electronic address: peter.harrington@ohio.edu.

Preshious Rearden (P)

Research and Development Department, Houston Forensic Science Center, Houston, TX 77002, USA.

Vivekananda Shetty (V)

Research and Development Department, Houston Forensic Science Center, Houston, TX 77002, USA.

Angelica Noyola (A)

Seized Drugs Section, Houston Forensic Science Center, Houston, TX 77002, USA.

Classifications MeSH