Hostname: page-component-77f85d65b8-45ctf Total loading time: 0 Render date: 2026-03-29T00:21:59.002Z Has data issue: false hasContentIssue false

Myofilament-associated proteins with intrinsic disorder (MAPIDs) and their resolution by computational modeling

Published online by Cambridge University Press:  11 January 2023

Bin Sun
Affiliation:
Research Center for Pharmacoinformatics (The State-Province Key Laboratories of Biomedicine-Pharmaceutics of China), Department of Medicinal Chemistry and Natural Medicine Chemistry, College of Pharmacy, Harbin Medical University, Harbin 150081, China
Peter M. Kekenes-Huskey*
Affiliation:
Department of Cell and Molecular Physiology, Loyola University Chicago, IL 60153, USA
*
Author for correspondence: Peter M. Kekenes-Huskey, E-mail: pkekeneshuskey@luc.edu
Rights & Permissions [Opens in a new window]

Abstract

The cardiac sarcomere is a cellular structure in the heart that enables muscle cells to contract. Dozens of proteins belong to the cardiac sarcomere, which work in tandem to generate force and adapt to demands on cardiac output. Intriguingly, the majority of these proteins have significant intrinsic disorder that contributes to their functions, yet the biophysics of these intrinsically disordered regions (IDRs) have been characterized in limited detail. In this review, we first enumerate these myofilament-associated proteins with intrinsic disorder (MAPIDs) and recent biophysical studies to characterize their IDRs. We secondly summarize the biophysics governing IDR properties and the state-of-the-art in computational tools toward MAPID identification and characterization of their conformation ensembles. We conclude with an overview of future computational approaches toward broadening the understanding of intrinsic disorder in the cardiac sarcomere.

Information

Type
Review
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press
Figure 0

Fig. 1. (a) Schematic illustration of the sarcomere (drawn with BioRender). (b) In this review, we focus on the cardiac proteins proposed in Kooij et al. (2014) with some additional noteworthy examples. For proteins with multiple isoforms, only isoforms with spectra counts (SC) >10 were selected. Proteins with IDR(s) are indicated by red *. Double ** indicates that the IDR(s) has been experimentally confirmed for the gene, while a single * indicates that the confirmation was based on a related isoform or via bioinformatic predictions (see Table 1 for details).

Figure 1

Fig. 2. Core proteins of the thin and thick filament, based on the schematic from Harris et al. (2011). The thin filament structure PDB 6KN8 was constructed from a cryo-EM study (Yamada et al., 2020). PDB 5TBY was generated from homology modeling. The MyBPC3 structure was predicted by AlphaFold and was downloaded from the UniprotKB database. A 2 nm resolution model of tarantula thick filament was built by fitting atomistic component structures to EM density map (PDB 3DTP (Alamo et al., 2008)).

Figure 2

Fig. 3. (a) The IDP-phase diagram developed by the Pappu lab (Holehouse et al., 2017), which groups proteins by characteristic disorder including molten, extended, or compact (Uversky, 2020) classes have also been proposed based on their charge patterns (R1–R5) (Holehouse et al., 2017): R1 corresponds to weak polyampholytes and resembles pre-molten globules. R3 signifies strong polyampholytes with a comparable amount of positively and negatively charged residues and is described as hairpins/coils/chimeras. R2 is the boundary between R1 and R3 where coils and pre-molten globules coexist. R4 and R5 are strong polyampholytes like R3, but with dominant negative and positive residues, respectively. IDPs in R4 and R5 are swollen coils. (be) PONDR-VLXT predicted disordered regions in cardiac myofilament proteins. These proteins are categorized into thin/thick filament(s), Z-disk, and miscellaneous. The IDP region is colored red and interlaced with folded regions. The blue line depicts the first and last amino acid and the number is increasing counterclockwise. The numbers in the parentheses present the percentage of predicted IDR residues, ‘pathogenic or likely pathogenic’ mutations, and phosphorylation sites located in the predicted IDP regions, respectively. The structural state estimation of predicted >5 residue IDR regions before (black dots) and after phosphorylation (magenta dots, if PTM site exists in the IDP region) in the IDP-phase diagram were also shown.

Figure 3

Fig. 4. Solution NMR structures of the common modular domains that are used as building blocks of Z-disk proteins. These structures show highly dynamic termini and evidence of intrinsic disorder in the Z-disk. The PDB IDs are given in parentheses.

Figure 4

Fig. 5. (a) Major secondary structural elements present in folded proteins like actin (PDB 6KN8 (Yamada et al., 2020)). (b) NMR structures of a MAPID, the MyBPC3 construct consisting of the M- and C2 domains (Michie et al., 2016), are shown as an example to illustrate the IDR ensemble. (c) Scaling of radius of gyration (Rg) versus chain length for proteins at different states (Lazar et al., 2021). (d) The IDP phase-diagram for classifying IDPs into five structural states (R1–R5) based on the charge patterns (Holehouse et al., 2017).

Figure 5

Fig. 6. Schematic of transient and covalent influences on IDR structure. Ions can screen intramolecular electrostatics or directly coordinate with charged residues. pH affects the protonation state of ionizable residues (e.g. histidine). Phosphorylation introduces negative charges into the sequence. For crowding, many factors such as size and surface charges of crowders, and volume fraction affect IDR structure (Eq. (7)).

Figure 6

Table 1. Brief summary of reported experimental and computational studies on MAPIDs

Figure 7

Fig. 7. Computational methods commonly used to model IDR structural properties. The IDR propensity of a structure can be predicted from its amino acid sequence by tens of established tools (Liu et al., 2019), and structural states of IDP can be either explicitly modeled (Ozenne et al., 2012) or characterized by an implicit phase diagram (Holehouse et al., 2017). Polymer models including the simplistic freely jointed monomer model and advanced models that account for intramonomer interactions can be used to characterize IDR ensembles (Milstein and Meiners, 2013; Schuler et al., 2016). Particle-based simulations are commonly used to predict IDR conformer structures and associated kinetics (Wang, 2021). All atom simulations consider every individual atom in the system to determine detailed descriptions of the potential energy surface (PES) but are computationally intensive, while coarse-grained simulations lump atoms together to increase sampling efficiency at a modest loss of accuracy. Lastly, statistical models frequently use partition functions to obtain thermodynamic descriptions of IDRs (Hadzi et al., 2021).

Figure 8

Fig. 8. (a) Dynamics of an IDP ensemble represented by a Markov state model (MSM). (b) Experimental methods for structure determination and their temporal resolutions. Timescale information of NMR, X-ray, cryo-EM, and SAXS are taken from Ban (2020). FRET time resolution spans ns to seconds (Okamoto and Sako, 2017). Size information: EM is appropriate for proteins >100 kD (Yeates et al., 2020). NMR is most suitable for <30 kD proteins (Xu et al., 2006). X-ray can solve structures up to 4000 kD (PDB website statistics). FRET is used for medium-sized proteins but has been reported for up to a 540 kD protein (Sielaff et al., 2022).

Figure 9

Fig. 9. (a) Binding of IDP to its partner often goes through a coupled-folding-and-binding process (Sugase et al., 2007), in which both the intramolecular conversion kinetics (section ‘Computational methods for predicting intramolecular dynamics of MAPIDs’) of the IDP and its intermolecular association kinetics (section ‘Computational methods for predicting the MAPID co-assembly’) are important. (b) Theoretical frameworks for describing IDP intermolecular kinetics. The Smoluchowski equation is an approximation for a diffusion-limited association rate (Kim et al., 2018). The Van Valen et al. model (Van Valen et al., 2009) (biochemistry on a leash) combines IDP-enhanced effective concentrations and competitive binding to describe IDP/target binding (Eqs. (10) and (56)). The fly-casting model (Shoemakeret al., 2000) explains the kinetic advantage of IDP/target assembly through its fast searching for binding partners. The Brownian dynamics (browndye (Huber and McCammon, 2010)) and the SEEKR (Votapka et al., 2017) programs both use the Northrup Allison McCammon algorithm (Northrup et al., 1984) provide simulation-based estimates of association kinetics and thermodynamics quantities.

Supplementary material: PDF

Sun and Kekenes-Huskey supplementary material

Sun and Kekenes-Huskey supplementary material

Download Sun and Kekenes-Huskey supplementary material(PDF)
PDF 442.9 KB