Hostname: page-component-77f85d65b8-8v9h9 Total loading time: 0 Render date: 2026-03-30T03:50:11.585Z Has data issue: false hasContentIssue false

A structural and functional bioinformatics study of QTY-designed retinylidene proteins

Published online by Cambridge University Press:  14 July 2025

Siqi Pan*
Affiliation:
Independent Researcher
*
Corresponding author: Siqi Pan; Email: siqipan2008@outlook.com
Rights & Permissions [Opens in a new window]

Abstract

Retinylidene proteins are retinal-binding light-sensitive proteins found in organisms ranging from microbes to human. Microbial opsins have been utilized in optogenetics, while animal opsins are essential for vision and light-dependent metabolic functions. However, retinylidene proteins have hydrophobic transmembrane (TM) domains, which makes them challenging to study. In this structural and functional bioinformatics study, I use the QTY (glutamine, threonine, tyrosine) code to design water-soluble QTY analogues of retinylidene proteins, including nine human and three microbial opsins. I provide superpositions of the AlphaFold3-predicted hydrophobic native proteins and their water-soluble QTY analogues, and experimentally determined structures when available. I also provide a comparison of surface hydrophobicity of the variants. Despite significant changes to the protein sequence (35.53–50.24% in the TM domain), protein characteristics and structures are well preserved. Furthermore, I run molecular dynamics (MD) simulations of native and QTY-designed OPN2 (rhodopsin) and analyze their response to the isomerization of 11-cis-retinal to all-trans-retinal. The results show that the QTY analogue has similar functional behavior to the native protein. The findings of this study indicate that the QTY code can be used as a robust tool to design water-soluble retinylidene proteins. These have potential applications in protein studies, therapeutic treatments, and bioengineering.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Protein characteristics of 12 retinylidene proteins and their QTY analogs

Figure 1

Figure 1. The protein sequence alignments of 12 retinylidene proteins and their QTY analogs. Blue tables: human opsins; purple tables: microbial opsins. The symbols $ \mid $ and $ \ast $ indicate that amino acids are identical or different, respectively. Amino acids L, I/V, and F in TM (transmembrane) alpha helices (shown in blue above the sequences) are replaced with Q, T, and Y, respectively. The variation in the TM domain ranges from 35.53% to 50.24%, while the overall variation rate ranges from 15.48% to 30.04%. The characteristics of the proteins are shown above the sequences. Despite large variation rates, the pI only experiences a slight change between 0.00 and 0.16 and the MW increases slightly by a value between 0.04 and 0.60 kDa. The enlarged individual sequence alignments are available in supplementary information. The alignments are (a) OPN1MW versus OPN1MW$ {}^{\mathrm{QTY}} $, (b) OPN1LW versus OPN1LW$ {}^{\mathrm{QTY}} $, (c) OPN1SW versus OPN1SW$ {}^{\mathrm{QTY}} $, (d) OPN2 versus OPN2$ {}^{\mathrm{QTY}} $, (e) OPN3 versus OPN3$ {}^{\mathrm{QTY}} $, (f) OPN4 versus OPN4$ {}^{\mathrm{QTY}} $, (g) OPN5 versus OPN5$ {}^{\mathrm{QTY}} $, (h) RGR versus RGR$ {}^{\mathrm{QTY}} $, (i) RRH versus RRH$ {}^{\mathrm{QTY}} $, (j) BACR versus BACR$ {}^{\mathrm{QTY}} $, (k) BACH versus BACH$ {}^{\mathrm{QTY}} $, and (l) ChR2 versus ChR2$ {}^{\mathrm{QTY}} $.

Figure 2

Figure 2. Superposition of AlphaFold3-predicted native human retinylidene proteins, their QTY analogs, and experimentally determined structures. For clarity, unstructured N- and C-terminal ends are deleted. For (a) to (i), despite significant changes in the protein sequence, the structures superpose very well. The root-mean-square distance (RMSD) values are quite small, from 0.307 to 0.611 Å, with only one exception (OPN2$ {}^{\mathrm{QTY}} $ vs. OPN2$ {}^{\mathrm{EXP}} $, RMSD = 0.999 Å). Green: AlphaFold3-predicted native structure; cyan: AlphaFold3-predicted QTY analog structure; magenta: experimentally determined structure. The superpositions are (a) OPN1MW$ {}^{\mathrm{AF}3} $ versus OPN1MW$ {}^{\mathrm{QTY}} $, (b) OPN1LW$ {}^{\mathrm{AF}3} $ versus OPN1LW$ {}^{\mathrm{QTY}} $, (c) OPN1SW$ {}^{\mathrm{AF}3} $ versus OPN1SW$ {}^{\mathrm{QTY}} $, (d) OPN2$ {}^{\mathrm{AF}3} $ versus OPN2$ {}^{\mathrm{QTY}} $ versus OPN2$ {}^{\mathrm{EXP}} $, (e) OPN3$ {}^{\mathrm{AF}3} $ versus OPN3$ {}^{\mathrm{QTY}} $, (f) OPN4$ {}^{\mathrm{AF}3} $ versus OPN4$ {}^{\mathrm{QTY}} $, (g) OPN5$ {}^{\mathrm{AF}3} $ versus OPN5$ {}^{\mathrm{QTY}} $, (h) RGR$ {}^{\mathrm{AF}3} $ versus RGR$ {}^{\mathrm{QTY}} $, and (i) RRH$ {}^{\mathrm{AF}3} $ versus RRH$ {}^{\mathrm{QTY}} $. For (j) and (k), there is a large degree of similarity between the RMSD between a pair of native proteins and that between the corresponding pair of QTY analogs. Green: OPN1MW; red: OPN1LW; blue: OPN1SW; purple: OPN2; cyan: OPN3; gray: OPN4; olive: OPN5; orange: RGR; pink: RRH. The superpositions are (j) OPN1MW$ {}^{\mathrm{AF}3} $ versus OPN1LW$ {}^{\mathrm{AF}3} $ versus OPN1SW$ {}^{\mathrm{AF}3} $ versus OPN2$ {}^{\mathrm{AF}3} $ versus OPN3$ {}^{\mathrm{AF}3} $ versus OPN4$ {}^{\mathrm{AF}3} $ versus OPN5$ {}^{\mathrm{AF}3} $ versus RGR$ {}^{\mathrm{AF}3} $ versus RRH$ {}^{\mathrm{AF}3} $ and (k) OPN1MW$ {}^{\mathrm{QTY}} $ versus OPN1LW$ {}^{\mathrm{QTY}} $ versus OPN1SW$ {}^{\mathrm{QTY}} $ versus OPN2$ {}^{\mathrm{QTY}} $ versus OPN3$ {}^{\mathrm{QTY}} $ versus OPN4$ {}^{\mathrm{QTY}} $ versus OPN5$ {}^{\mathrm{QTY}} $ versus RGR$ {}^{\mathrm{QTY}} $ versus RRH$ {}^{\mathrm{QTY}} $.

Figure 3

Figure 3. Superposition of AlphaFold3-predicted native microbial retinylidene proteins, their QTY analogs, and experimentally determined structures. Despite significant changes in the protein sequence, the structures superpose very well. The root-mean-square distance (RMSD) values are quite small, with the highest being 0.685 Å. For clarity, unstructured N- and C-terminal ends are deleted. Green: AlphaFold3-predicted native structure; cyan: AlphaFold3-predicted QTY analog structure; magenta: experimentally determined structure. The superpositions are (a) BACR$ {}^{\mathrm{AF}3} $ versus BACR$ {}^{\mathrm{QTY}} $ versus BACR$ {}^{\mathrm{EXP}} $ monomer, (b) BACR$ {}^{\mathrm{AF}3} $ versus BACR$ {}^{\mathrm{QTY}} $ versus BACR$ {}^{\mathrm{EXP}} $ trimer, (c) BACH$ {}^{\mathrm{AF}3} $ versus BACH$ {}^{\mathrm{QTY}} $ versus BACH$ {}^{\mathrm{EXP}} $ monomer, (d) BACH$ {}^{\mathrm{AF}3} $ versus BACH$ {}^{\mathrm{QTY}} $ versus BACH$ {}^{\mathrm{EXP}} $ trimer, (e) ChR2$ {}^{\mathrm{AF}3} $ versus ChR2$ {}^{\mathrm{QTY}} $ versus ChR2$ {}^{\mathrm{EXP}} $ monomer, and (f) ChR2$ {}^{\mathrm{AF}3} $ versus ChR2$ {}^{\mathrm{QTY}} $ versus ChR2$ {}^{\mathrm{EXP}} $ dimer.

Figure 4

Figure 4. Hydrophobic surface of 12 retinylidene proteins and their water-soluble QTY analogs. Hydrophobic patches are shown in yellow, while hydrophilic patches are shown in cyan. The native proteins have many hydrophobic patches due to the presence of hydrophobic amino acids, including L, I, V, and F. After QTY substitution, hydrophilic Q, T, and Y have respectively replaced hydrophobic L, I/V, and F, and the hydrophobic patches in the surface of transmembrane helices have become more hydrophilic. In addition, the surface shape of the native and QTY analogs are very similar. For clarity, unstructured N- and C-terminal ends are deleted. The comparisons are (a) OPN1MW versus OPN1MW$ {}^{\mathrm{QTY}} $, (b) OPN1LW versus OPN1LW$ {}^{\mathrm{QTY}} $, (c) OPN1SW versus OPN1SW$ {}^{\mathrm{QTY}} $, (d) OPN2 versus OPN2$ {}^{\mathrm{QTY}} $, (e) OPN3 versus OPN3$ {}^{\mathrm{QTY}} $, (f) OPN4 versus OPN4$ {}^{\mathrm{QTY}} $, (g) OPN5 versus OPN5$ {}^{\mathrm{QTY}} $, (h) RGR versus RGR$ {}^{\mathrm{QTY}} $, (i) RRH versus RRH$ {}^{\mathrm{QTY}} $, (j) BACR versus BACR$ {}^{\mathrm{QTY}} $ monomer, (k) BACH versus BACH$ {}^{\mathrm{QTY}} $ monomer, (l) ChR2 versus ChR2$ {}^{\mathrm{QTY}} $ monomer, (m) BACR versus BACR$ {}^{\mathrm{QTY}} $ trimer, (n) BACH versus BACH$ {}^{\mathrm{QTY}} $ trimer, and (o) ChR2 versus ChR2$ {}^{\mathrm{QTY}} $ dimer.

Figure 5

Figure 5. The conformational changes of native OPN2 and its QTY analog before and after 11-cis to all-trans isomerization of the chromophore, retinal. (a, b) 1 ns running averages of the root-mean-square distances (RMSD) of the protein–retinal complex, transmembrane helix 6 (TM6), the retinal-binding pocket, and retinal. By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line. (c) Superpositions between cis-state OPN2, trans-state OPN2, cis-state OPN2$ {}^{\mathrm{QTY}} $, and trans-state OPN2$ {}^{\mathrm{QTY}} $. Both OPN2 and OPN2$ {}^{\mathrm{QTY}} $ exhibit conformational changes, with RMSDs greater than 2 Å. Blue: cis-state protein, orange: 11-cis-retinal; yellow: trans-state protein; greenish cyan: all-trans retinal.

Figure 6

Figure 6. Changes in the retinal-binding pocket and protein–ligand interaction in native OPN2 and its QTY analog before and after 11-cis to all-trans isomerization of the chromophore. (a, b) Close-ups of the binding pocket in cis-state. Protein–ligand interactions with lengths shorter or equal to 3.5 Å are shown in the figure. Blue: protein residues, orange: 11-cis-retinal; green dashed lines: ion bridge; yellow dashed lines: van der Waals and/or hydrophobic interactions. (c, d) Close-ups of the binding pocket in trans-state. Protein–ligand interactions with lengths shorter or equal to 3.5 Å are shown in the figure. Yellow: protein residues, greenish cyan: all-trans-retinal; green dashed lines: ion bridge; yellow dashed lines: van der Waals and/or hydrophobic interactions. (e, f) The interaction energies (IEs) between the protein, the binding pocket, individual residues, and retinal. IE is the sum of the short-range Coulombic interaction energy and short-range Lennard–Jones energy. Note that IE is a product of MD simulation and is not necessarily a ‘real’ physical quantity. For clarity, IE is rescaled using the signed pseudo logarithm ($ y=\operatorname{sign}(x)\cdot \ln \left(|x|+1\right) $). By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line. The large changes in IE in OPN2 around $ t=50 $ ns are due to the entrance of water molecules into the binding pocket, near 113E. The QTY analog has more negative IE at residues 208 and 212 due to F to Y substitutions. Besides from these, the similarity between the IE of OPN2 and OPN2$ {}^{\mathrm{QTY}} $ is a strong suggestion of functional conservation. (g, h) The number of hydrogen bonds formed between residues in the binding pocket, and the number of water molecules within the binding pocket. By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line.

Supplementary material: File

Pan supplementary material

Pan supplementary material
Download Pan supplementary material(File)
File 14.4 MB

Author comment: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR1

Comments

Dear Editor,

This study is part of a series of studies on protein design with the QTY code by Prof. Shuguang Zhang and his team. I extended QTY-design to a new family of proteins, the retinylidene proteins.

I completed this manuscript individually while maintaining close communication with Prof. Zhang. He has recommended this manuscript to Prof. Bengt Nordén, who has encouraged me to submit it to QRB Discovery.

Thank you very much for considering my work. I look forward to your response.

Sincerely,

Siqi Pan

Email: siqipan2008@outlook.com

Review: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR2

Conflict of interest statement

N/A

Comments

This manuscript presents a bioinformatic study of the structure and properties of retinylidene proteins and their QTY-designed variants using AlphaFold3. Both the natural OPN2 and its QTY-designed variant were analyzed using AlphaFold3 and molecular dynamics simulations to investigate their responses to the isomerization of 11-cis-retinal to all-trans-retinal. The study is well-conducted and suitable for publication in QRB Discovery following revision.

1. The manuscript could provide more background and discussion on the downstream signaling pathways of OPN2. What is the potential significance for the treatment of related diseases and the underlying mechanisms of signal transduction?

2. The current MD analysis mainly focuses on structural changes, but it is recommended to include more targeted analysis of functionally relevant regions. Additionally, please clarify how the molecular dynamics simulation setup relates to real physiological processes.

3. In Figure 5, the RMSD of the protein and retinal in the QTY analog appears to be still fluctuating. If longer simulations are not feasible, it is recommended to include statistical analyses over different time windows to demonstrate whether the structure is being stabilized.

4. Please clarify the role and significance of the reverse-QTY design mentioned at the end of the introduction in the context of this study.

5. The first citation of Karagol, A. on page 8 contains garbled characters. This journal name of this citation is also not consistent with others. The citation in the third line of page 9 is incomplete. Please check all reference formats throughout the manuscript.

6. The uses of past and present tenses are not consistent. The manuscript would benefit from editing by a native speaker.

Review: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

This study applies a water solubilization design method called QTY code to monoamine transporters, computationally characterizing some properties of the designed proteins and investigating the relationship between natural mutations in these monoamine transporters and the QTY code. These findings are intriguing and hold some significance for the study on monoamine transporters and the QTY code; however, there are some issues in the manuscript should be addressed by the authors.

Recommendation: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR4

Comments

No accompanying comment.

Decision: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR5

Comments

No accompanying comment.

Author comment: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR6

Comments

No accompanying comment.

Review: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR7

Conflict of interest statement

N/A.

Comments

The author resolved my questions and I think the manuscript is good for acceptance.

Recommendation: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR8

Comments

No accompanying comment.

Decision: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR9

Comments

No accompanying comment.