Harnessing the 3D-Beacons Network: A Comprehensive Guide to Accessing and Displaying Protein Structure Data.

3D‐Beacons FAIR data access federated data network macromolecular structures programmatic access structural bioinformatics

Journal

Current protocols
ISSN: 2691-1299
Titre abrégé: Curr Protoc
Pays: United States
ID NLM: 101773894

Informations de publication

Date de publication:
May 2024
Historique:
medline: 9 5 2024
pubmed: 9 5 2024
entrez: 9 5 2024
Statut: ppublish

Résumé

Recent advancements in protein structure determination and especially in protein structure prediction techniques have led to the availability of vast amounts of macromolecular structures. However, the accessibility and integration of these structures into scientific workflows are hindered by the lack of standardization among publicly available data resources. To address this issue, we introduced the 3D-Beacons Network, a unified platform that aims to establish a standardized framework for accessing and displaying protein structure data. In this article, we highlight the importance of standardized approaches for accessing protein structure data and showcase the capabilities of 3D-Beacons. We describe four protocols for finding and accessing macromolecular structures from various specialist data resources via 3D-Beacons. First, we describe three scenarios for programmatically accessing and retrieving data using the 3D-Beacons API. Next, we show how to perform sequence-based searches to find structures from model providers. Then, we demonstrate how to search for structures and fetch them directly into a workflow using JalView. Finally, we outline the process of facilitating access to data from providers interested in contributing their structures to the 3D-Beacons Network. © 2024 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Programmatic access to the 3D-Beacons API Basic Protocol 2: Sequence-based search using the 3D-Beacons API Basic Protocol 3: Accessing macromolecules from 3D-Beacons with JalView Basic Protocol 4: Enhancing data accessibility through 3D-Beacons.

Identifiants

pubmed: 38720559
doi: 10.1002/cpz1.1047
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1047

Subventions

Organisme : Wellcome Trust
ID : 223739/Z/21/Z
Pays : United Kingdom

Informations de copyright

© 2024 The Authors. Current Protocols published by Wiley Periodicals LLC.

Références

Altschul, S. F., Gish, W., Miller, W., Myers, E. W., & Lipman, D. J. (1990). Basic local alignment search tool. Journal of Molecular Biology, 215(3), 403–410. https://doi.org/10.1016/S0022‐2836(05)80360‐2
Armstrong, D. R., Berrisford, J. M., Conroy, M. J., Gutmanas, A., Anyango, S., Choudhary, P., Clark, A. R., Dana, J. M., Deshpande, M., Dunlop, R., Gane, P., Gáborová, R., Gupta, D., Haslam, P., Koča, J., Mak, L., Mir, S., Mukhopadhyay, A., Nadzirin, N., … Velankar, S. (2020). PDBe: Improved findability of macromolecular structure data in the PDB. Nucleic Acids Research, 48(D1), D335–D343. https://doi.org/10.1093/nar/gkz990
Cunningham, F., Allen, J. E., Allen, J., Alvarez‐Jarreta, J., Amode, M. R., Armean, I. M., Austine‐Orimoloye, O., Azov, A. G., Barnes, I., Bennett, R., Berry, A., Bhai, J., Bignell, A., Billis, K., Boddu, S., Brooks, L., Charkhchi, M., Cummins, C., da Rin Fioretto, L., … Flicek, P. (2022). Ensembl 2022. Nucleic Acids Research, 50(D1), D988–D995. https://doi.org/10.1093/nar/gkab1049
Ghafouri, H., Lazar, T., del Conte, A., Tenorio Ku, L. G., PED Consortium, Tompa, P., Tosatto, S. C. E., & Monzon, A. M. (2024). PED in 2024: Improving the community deposition of structural ensembles for intrinsically disordered proteins. Nucleic Acids Research, 52(D1), D536–D544. https://doi.org/10.1093/nar/gkad947
Hekkelman, M. L., de Vries, I., Joosten, R. P., & Perrakis, A. (2021). AlphaFill: Enriching the AlphaFold models with ligands and co‐factors (p. 2021.11.26.470110). bioRxiv. https://doi.org/10.1101/2021.11.26.470110 bioRxiv
Kikhney, A. G., Borges, C. R., Molodenskiy, D. S., Jeffries, C. M., & Svergun, D. I. (2020). SASBDB: Towards an automatically curated and validated repository for biological scattering data. Protein Science, 29(1), 66–75. https://doi.org/10.1002/pro.3731
Lin, Z., Akin, H., Rao, R., Hie, B., Zhu, Z., Lu, W., Smetanin, N., Verkuil, R., Kabeli, O., Shmueli, Y., dos Santos Costa, A., Fazel‐Zarandi, M., Sercu, T., Candido, S., & Rives, A. (2023). Evolutionary‐scale prediction of atomic‐level protein structure with a language model. Science, 379(6637), 1123–1130. https://doi.org/10.1126/science.ade2574
Procter, J. B., Carstairs, G. M., Soares, B., Mourão, K., Ofoegbu, T. C., Barton, D., Lui, L., Menard, A., Sherstnev, N., Roldan‐Martinez, D., Duce, S., Martin, D. M. A., & Barton, G. J. (2021). Alignment of biological sequences with Jalview. Methods in Molecular Biology, 2231, 203–224. https://doi.org/10.1007/978‐1‐0716‐1036‐7_13
Rambla, J., Baudis, M., Ariosa, R., Beck, T., Fromont, L. A., Navarro, A., Paloots, R., Rueda, M., Saunders, G., Singh, B., Spalding, J. D., Törnroos, J., Vasallo, C., Veal, C. D., & Brookes, A. J. (2022). Beacon v2 and Beacon networks: A ‘lingua franca’ for federated data discovery in biomedical genomics, and beyond. Human Mutation, 43(6), 791–799. https://doi.org/10.1002/humu.24369
Sommer, M. J., Cha, S., Varabyou, A., Rincon, N., Park, S., Minkin, I., Pertea, M., Steinegger, M., & Salzberg, S. L. (2022). Structure‐guided isoform identification for the human transcriptome. eLife, 11, e82556. https://doi.org/10.7554/eLife.82556
Tordai, H., Suhajda, E., Sillitoe, I., Nair, S., Varadi, M., & Hegedus, T. (2022). Comprehensive collection and prediction of ABC transmembrane protein structures in the AI era of structural biology. International Journal of Molecular Sciences, 23(16), 8877. https://doi.org/10.3390/ijms23168877
Varadi, M., Anyango, S., Deshpande, M., Nair, S., Natassia, C., Yordanova, G., Yuan, D., Stroe, O., Wood, G., Laydon, A., Žídek, A., Green, T., Tunyasuvunakool, K., Petersen, S., Jumper, J., Clancy, E., Green, R., Vora, A., Lutfi, M., … Velankar, S. (2022). AlphaFold protein structure database: Massively expanding the structural coverage of protein‐sequence space with high‐accuracy models. Nucleic Acids Research, 50(D1), D439–D444. https://doi.org/10.1093/nar/gkab1061
Varadi, M., Nair, S., Sillitoe, I., Tauriello, G., Anyango, S., Bienert, S., Borges, C., Deshpande, M., Green, T., Hassabis, D., Hatos, A., Hegedus, T., Hekkelman, M. L., Joosten, R., Jumper, J., Laydon, A., Molodenskiy, D., Piovesan, D., Salladini, E., … Velankar, S. (2022). 3D‐Beacons: Decreasing the gap between protein sequences and structures through a federated network of protein structure data resources. GigaScience, 11, giac118. https://doi.org/10.1093/gigascience/giac118
Velankar, S., Burley, S. K., Kurisu, G., Hoch, J. C., & Markley, J. L. (2021). The protein data bank archive. Methods in Molecular Biology, 2305, 3–21. https://doi.org/10.1007/978‐1‐0716‐1406‐8_1
Waterhouse, A., Bertoni, M., Bienert, S., Studer, G., Tauriello, G., Gumienny, R., Heer, F. T., de Beer, T. A. P., Rempfer, C., Bordoli, L., Lepore, R., & Schwede, T. (2018). SWISS‐MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Research, 46(W1), W296–W303. https://doi.org/10.1093/nar/gky427
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.‐W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., … Mons, B. (2016). The FAIR guiding principles for scientific data management and stewardship. Scientific Data, 3(1), 160018. https://doi.org/10.1038/sdata.2016.18

Auteurs

Paulyna Magaña (P)

Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.

Sreenath Nair (S)

Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.

Mihaly Varadi (M)

Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.

Sameer Velankar (S)

Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH