NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning.

Algorithms Computational Biology / methods Consensus Sequence Databases, Genetic Internet-Based Intervention Machine Learning Neuropeptides / chemistry Position-Specific Scoring Matrices Reproducibility of Results Software Workflow

cross-validation feature representation learning machine learning neuropeptide two-step feature selection

Journal

Briefings in bioinformatics

ISSN: 1477-4054

Titre abrégé: Brief Bioinform

Pays: England

ID NLM: 100912837

Informations de publication

Date de publication:
05 11 2021

Historique:

received: 07 02 2021

revised: 23 03 2021

accepted: 09 04 2021

pubmed: 12 5 2021

medline: 12 3 2022

entrez: 11 5 2021

Statut: ppublish

Résumé

Neuropeptides (NPs) are the most versatile neurotransmitters in the immune systems that regulate various central anxious hormones. An efficient and effective bioinformatics tool for rapid and accurate large-scale identification of NPs is critical in immunoinformatics, which is indispensable for basic research and drug development. Although a few NP prediction tools have been developed, it is mandatory to improve their NPs' prediction performances. In this study, we have developed a machine learning-based meta-predictor called NeuroPred-FRL by employing the feature representation learning approach. First, we generated 66 optimal baseline models by employing 11 different encodings, six different classifiers and a two-step feature selection approach. The predicted probability scores of NPs based on the 66 baseline models were combined to be deemed as the input feature vector. Second, in order to enhance the feature representation ability, we applied the two-step feature selection approach to optimize the 66-D probability feature vector and then inputted the optimal one into a random forest classifier for the final meta-model (NeuroPred-FRL) construction. Benchmarking experiments based on both cross-validation and independent tests indicate that the NeuroPred-FRL achieves a superior prediction performance of NPs compared with the other state-of-the-art predictors. We believe that the proposed NeuroPred-FRL can serve as a powerful tool for large-scale identification of NPs, facilitating the characterization of their functional mechanisms and expediting their applications in clinical therapy. Moreover, we interpreted some model mechanisms of NeuroPred-FRL by leveraging the robust SHapley Additive exPlanation algorithm.

Identifiants

DOI: 10.1093/bib/bbab167 PMID: 33975333

pubmed: 33975333

pii: 6272801

doi: 10.1093/bib/bbab167

pii:

doi:

Substances chimiques

Neuropeptides 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning.

Journal

Informations de publication

Résumé

Identifiants

Substances chimiques

Types de publication

Langues

Sous-ensembles de citation

Informations de copyright

Auteurs

Md Mehedi Hasan (MM)

Md Ashad Alam (MA)

Watshara Shoombuatong (W)

Hong-Wen Deng (HW)

Balachandran Manavalan (B)

Hiroyuki Kurata (H)

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Relative victimization scale: initial development and retrospective reports of the impact on mental health.

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Cultural adaptation and validation of the Sinhala version of the spiritual needs assessment for patients (S-SNAP) questionnaire.

Classifications MeSH