Building Protein Atomic Models from Cryo-EM Density Maps and Residue Co-Evolution.


Journal

Biomolecules
ISSN: 2218-273X
Titre abrégé: Biomolecules
Pays: Switzerland
ID NLM: 101596414

Informations de publication

Date de publication:
13 09 2022
Historique:
received: 29 07 2022
revised: 01 09 2022
accepted: 09 09 2022
entrez: 23 9 2022
pubmed: 24 9 2022
medline: 28 9 2022
Statut: epublish

Résumé

Electron cryo-microscopy (cryo-EM) has emerged as a powerful method by which to obtain three-dimensional (3D) structures of macromolecular complexes at atomic or near-atomic resolution. However, de novo building of atomic models from near-atomic resolution (3-5 Å) cryo-EM density maps is a challenging task, in particular because poorly resolved side-chain densities hamper sequence assignment by automatic procedures at a lower resolution. Furthermore, segmentation of EM density maps into individual subunits remains a difficult problem when the structure of the subunits is not known, or when significant conformational rearrangement occurs between the isolated and associated form of the subunits. To tackle these issues, we have developed a graph-based method to thread most of the C-α trace of the protein backbone into the EM density map. The EM density is described as a weighted graph such that the resulting minimum spanning tree encompasses the high-density regions of the map. A pruning algorithm cleans the tree and finds the most probable positions of the C-α atoms, by using side-chain density when available, as a collection of C-α trace fragments. By complementing experimental EM maps with contact predictions from sequence co-evolutionary information, we demonstrate that this approach can correctly segment EM maps into individual subunits and assign amino acid sequences to backbone traces to generate atomic models.

Identifiants

pubmed: 36139128
pii: biom12091290
doi: 10.3390/biom12091290
pmc: PMC9496541
pii:
doi:

Substances chimiques

Macromolecular Substances 0
Proteins 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Références

Nat Commun. 2018 Apr 24;9(1):1618
pubmed: 29691408
EMBO J. 1987 Apr;6(4):1107-14
pubmed: 3297672
Nature. 2015 Sep 10;525(7568):212-217
pubmed: 26280335
Biopolymers. 2012 Sep;97(9):742-60
pubmed: 22696409
Proc Natl Acad Sci U S A. 2021 Jan 12;118(2):
pubmed: 33361332
Nat Methods. 2015 Apr;12(4):335-8
pubmed: 25707029
Nat Commun. 2022 Jul 13;13(1):4066
pubmed: 35831370
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324
pubmed: 28056090
Acta Crystallogr D Biol Crystallogr. 2010 Apr;66(Pt 4):486-501
pubmed: 20383002
Nat Microbiol. 2018 Dec;3(12):1404-1416
pubmed: 30323254
Proc Natl Acad Sci U S A. 2014 Aug 12;111(32):11709-14
pubmed: 25071206
Elife. 2015 Oct 06;4:e10180
pubmed: 26439008
Methods. 2016 May 1;100:42-9
pubmed: 26988127
Methods Enzymol. 2016;579:255-76
pubmed: 27572730
Nucleic Acids Res. 2017 Jul 3;45(W1):W307-W314
pubmed: 28472458
Nature. 2020 Nov;587(7832):152-156
pubmed: 33087931
Elife. 2013 Mar 05;2:e00218
pubmed: 23483797
Nat Methods. 2014 Jan;11(1):63-5
pubmed: 24213166
Proc Natl Acad Sci U S A. 2013 Sep 24;110(39):15674-9
pubmed: 24009338
Science. 2017 Jan 20;355(6322):294-298
pubmed: 28104891
Nat Methods. 2018 Nov;15(11):905-908
pubmed: 30377346
Nat Protoc. 2008;3(7):1171-9
pubmed: 18600222
Nat Commun. 2018 Dec 19;9(1):5385
pubmed: 30568167
Methods Mol Biol. 2013;955:229-41
pubmed: 23132064
Proteins. 2018 Mar;86 Suppl 1:51-66
pubmed: 29071738
Nat Methods. 2022 Feb;19(2):195-204
pubmed: 35132244
Acta Crystallogr D Biol Crystallogr. 2006 Sep;62(Pt 9):1002-11
pubmed: 16929101
J Struct Biol. 2010 Jun;170(3):427-38
pubmed: 20338243
Elife. 2014 Feb 25;3:e01963
pubmed: 24569482
Curr Protoc Bioinformatics. 2006 Oct;Chapter 5:Unit-5.6
pubmed: 18428767
Science. 2014 Mar 28;343(6178):1443-4
pubmed: 24675944
J Struct Biol. 2007 Jan;157(1):281-7
pubmed: 16963278

Auteurs

Guillaume Bouvier (G)

Structural Bioinformatics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 3528, 75015 Paris, France.

Benjamin Bardiaux (B)

Structural Bioinformatics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 3528, 75015 Paris, France.

Riccardo Pellarin (R)

Structural Bioinformatics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 3528, 75015 Paris, France.

Chiara Rapisarda (C)

Microbiologie Fondamentale et Pathogènicité, University of Bordeaux, CNRS UMR 5234, 33076 Bordeaux, France.
Institut Européen de Chimie et Biologie, University of Bordeaux, 33600 Pessac, France.

Michael Nilges (M)

Structural Bioinformatics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 3528, 75015 Paris, France.

Articles similaires

Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
alpha-Synuclein Humans Animals Mice Lewy Body Disease
Cryoelectron Microscopy Algorithms Image Processing, Computer-Assisted Consensus Software

Classifications MeSH