EcoSal Plus
Domain 4:
Synthesis and Processing of Macromolecules
Escherichia coli Small Proteome
- Authors: Matthew R. Hemm1, Jeremy Weaver2,3, and Gisela Storz4
- Editors: Susan T. Lovett5, Deborah Hinton6
-
VIEW AFFILIATIONS HIDE AFFILIATIONSAffiliations: 1: Department of Biological Sciences, Towson University, Towson, MD; 2: Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD; 3: Present address: Thermo Fisher Scientific, Rockford, IL; 4: Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD; 5: Brandeis University, Waltham, MA; 6: Laboratory of Cell and Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD
-
Received 08 December 2019 Accepted 27 February 2020 Published 08 May 2020
- Address correspondence to Matthew Hemm, [email protected]

-
Abstract:
Escherichia coli was one of the first species to have its genome sequenced and remains one of the best-characterized model organisms. Thus, it is perhaps surprising that recent studies have shown that a substantial number of genes have been overlooked. Genes encoding more than 140 small proteins, defined as those containing 50 or fewer amino acids, have been identified in E. coli in the past 10 years, and there is substantial evidence indicating that many more remain to be discovered. This review covers the methods that have been successful in identifying small proteins and the short open reading frames that encode them. The small proteins that have been functionally characterized to date in this model organism are also discussed. It is hoped that the review, along with the associated databases of known as well as predicted but undetected small proteins, will aid in and provide a roadmap for the continued identification and characterization of these proteins in E. coli as well as other bacteria.
-
Citation: Hemm M, Weaver J, Storz G. 2020. Escherichia coli Small Proteome, EcoSal Plus 2020; doi:10.1128/ecosalplus.ESP-0031-2019




Escherichia coli Small Proteome, Page 1 of 2
< Previous page | Next page > /docserver/preview/fulltext/ecosalplus/9/1/ESP-0031-2019-1.gif /docserver/preview/fulltext/ecosalplus/9/1/ESP-0031-2019-2.gif

References

Article metrics loading...
Abstract:
Escherichia coli was one of the first species to have its genome sequenced and remains one of the best-characterized model organisms. Thus, it is perhaps surprising that recent studies have shown that a substantial number of genes have been overlooked. Genes encoding more than 140 small proteins, defined as those containing 50 or fewer amino acids, have been identified in E. coli in the past 10 years, and there is substantial evidence indicating that many more remain to be discovered. This review covers the methods that have been successful in identifying small proteins and the short open reading frames that encode them. The small proteins that have been functionally characterized to date in this model organism are also discussed. It is hoped that the review, along with the associated databases of known as well as predicted but undetected small proteins, will aid in and provide a roadmap for the continued identification and characterization of these proteins in E. coli as well as other bacteria.

Full text loading...

Figures

Click to view
Figure 1
Small protein gene identification in the E. coli genome over the past 6 years. (A) Histogram of currently annotated protein-coding genes in E. coli compared to those identified in 2013. (B) Histogram of currently annotated small protein genes in E. coli compared to those known in 2013. For both (A) and (B), the light gray bars represent small protein genes annotated in 2013, and the dark gray bars represent genes annotated in 2019. Data on annotated genes in 2013 are from E. coli K12 MG1655 genome annotation U00096.2. Data on annotated genes in 2019 were compiled from a combination of annotations from EcoCyc ( 92 ) and recent papers identifying new small proteins.

Click to view
Figure 2
Ribosome binding sites for representative small protein-coding genes. The sequence logo for E. coli ribosome binding sites is reproduced from reference 93 . Sequences of 12 small protein genes of unknown function are listed below. Red type corresponds to the predicted start codon, while the blue type indicates stretches of four or more G and A residues. Gibbs free energies (ΔG° in kcal/mol) for the interaction between the sequence shown and the 16S RNA were calculated using IntaRNA (http://rna.informatik.uni-freiburg.de/IntaRNA/Input.jsp) ( 94 ). No value is given for the three sequences for which no significant interaction was detected. Rpm (reads per million mapped) values for ribosome profiling carried out in the presence of the inhibitor Onc112 are taken from reference 6 .

Click to view
Figure 3
Structures of representative small proteins. Structures of AcrZ, KdpF, CydX, and CydH (red) in association with the AcrB multidrug efflux pump (PDB 4C48 [ 84 ]), Kdp potassium transporter (PDB 5MRW [ 69 ]), and cytochrome bd-I oxidase (PDB 6RKO [ 80 ]), respectively. The approximate position of the membrane is indicated by shading.
Tables

Click to view
Table 1
Overview of known small protein functions
Supplemental Material
-
Supplemental Table S1
-
MyBook is a cheap paperback edition of the original book and will be sold at uniform, low price.
-
XLSX
11.69 Kb
-
XLSX
-
-
Supplemental Table S2
-
MyBook is a cheap paperback edition of the original book and will be sold at uniform, low price.
-
XLSX
52.68 Kb
-
XLSX
-
-
Supplemental Table S3
-
MyBook is a cheap paperback edition of the original book and will be sold at uniform, low price.
-
XLSX
24.38 Kb
-
XLSX
-
-
Supplemental Table Legends
-
MyBook is a cheap paperback edition of the original book and will be sold at uniform, low price.
-
PDF
105.35 Kb
-
PDF
-