A quantitative reliability metric for querying large database.
Library search
Multivariate curve resolution
Opioids
Quantitative reliability metric
Journal
Forensic science international
ISSN: 1872-6283
Titre abrégé: Forensic Sci Int
Pays: Ireland
ID NLM: 7902034
Informations de publication
Date de publication:
Feb 2022
Feb 2022
Historique:
received:
20
02
2021
revised:
28
10
2021
accepted:
18
12
2021
pubmed:
1
1
2022
medline:
1
1
2022
entrez:
31
12
2021
Statut:
ppublish
Résumé
A redesigned quantitative reliability metric based on the F-distribution (QRMf) is reported for evaluating the reliability of library search. The QRMf provides orthogonal information to the comparison metric (e.g., dot product) and yields a probabilistic result. An intralibrary search can be considered as an idealized search because the top hit, i.e., the closest matching object, will match perfectly. If the search of an unknown object yields the same hit list as the intralibrary search, it would indicate good reliability. For each object in the hit list, a QRMf compares the order of an intralibrary and interlibrary search results and calculates a variance of interlibrary similarity metrics between the records of the intralibrary search and records in the corresponding positions of the interlibrary search. This variance that measures the discordance of the intra and interlibrary search can simply be compared to the variance of the similarity metrics within the interlibrary search results. The ratio of these variances follows an F-distribution that can be used to determine if the discordance is statistically significant and generates the probability based on the cumulative distribution function. The QRMf works for both similarity and dissimilarity and can be used for any queried object and comparison metric that is searched against a database. In this work, the QRMf was used along with the dot product similarity to query the mass spectra of novel synthetic opioids measured by gas chromatography-mass spectrometry (GC/MS). An automated pipeline was devised that used a basis set correction to assist peak detection. The basis was constructed by mass spectra obtained from the blank measurement preceding the analytical run to remove interferences from column bleed and septum degradation. After peak detection, the pipeline applied multivariate curve resolution to the chromatographic peak window to remove background components from the mass spectra. The corrected mass spectra were searched against a customized library for identification. The QRMf can be used along with the similarity metric to detect misidentifications and assist in finding the correct identification when it is not the closest match.
Identifiants
pubmed: 34972050
pii: S0379-0738(21)00475-8
doi: 10.1016/j.forsciint.2021.111155
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
111155Informations de copyright
Copyright © 2021 Elsevier B.V. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.