bims-micpro Biomed News
on Discovery and characterization of microproteins
Issue of 2023–07–23
four papers selected by
Thomas Farid Martínez, University of California, Irvine



  1. Comput Biol Med. 2023 Jul 08. pii: S0010-4825(23)00708-4. [Epub ahead of print]164 107243
      Long non-coding RNAs (LncRNAs) are non-protein coding transcripts more than 200 nucleotides in length. Deep sequencing technologies have unveiled lncRNAs can harbor translatable short open reading frames (sORFs). Yet the regulatory mechanisms governing lncRNA translation events remain poorly understood. Here, we exhaustively detected the sequence, functional element, and structure features relevant to lncRNA translation in human. Extensive identification and analysis reveal that translatable lncRNAs contain richer protein-coding related sequence features, cap-dependent and cap-independent translation initiation mechanisms, and more stable secondary structures, as compared to untranslatable lncRNAs. These findings strongly support lncRNAs serve as a repository for the production of new small peptides. Based on the feature fusion affecting translation and the extreme gradient boosting (XGBoost) algorithm, we developed the first computational tool that dedicated for predicting translatable lncRNAs, named TransLncPred. Benchmark experimental results show that our method outperforms several state-of-the-art RNA coding potential prediction tools on the same training and testing datasets. The 100-time 10-fold cross-validation tests also demonstrate that regulatory element-derived features, especially N7-methylguanosine (m7G) and internal ribosome entry site (IRES), contribute to the improvement in predictive performance.
    Keywords:  Long non-coding RNA; Regulatory element; Secondary structure; Small peptides; sORFs
    DOI:  https://doi.org/10.1016/j.compbiomed.2023.107243
  2. Biotechnol Biofuels Bioprod. 2023 Jul 18. 16(1): 114
       BACKGROUND: Methanotrophs have emerged as promising hosts for the biological conversion of methane into value-added chemicals, including various organic acids. Understanding the mechanisms of acid tolerance is essential for improving organic acid production. WatR, a LysR-type transcriptional regulator, was initially identified as involved in lactate tolerance in a methanotrophic bacterium Methylomonas sp. DH-1. In this study, we investigated the role of WatR as a regulator of cellular defense against weak organic acids and identified novel target genes of WatR.
    RESULTS: By conducting an investigation into the genome-wide binding targets of WatR and its role in transcriptional regulation, we identified genes encoding an RND-type efflux pump (WatABO pump) and previously unannotated small open reading frames (smORFs), watS1 to watS5, as WatR target genes activated in response to acetate. The watS1 to watS5 genes encode polypeptides of approximately 50 amino acids, and WatS1 to WatS4 are highly homologous with one predicted transmembrane domain. Deletion of the WatABO pump genes resulted in decreased tolerance against formate, acetate, lactate, and propionate, suggesting its role as an efflux pump for a wide range of weak organic acids. WatR repressed the basal expression of watS genes but activated watS and WatABO pump genes in response to acetate stress. Overexpression of watS1 increased tolerance to acetate but not to other acids, only in the presence of the WatABO pump. Therefore, WatS1 may increase WatABO pump specificity toward acetate, switching the general weak acid efflux pump to an acetate-specific efflux pump for efficient cellular defense against acetate stress.
    CONCLUSIONS: Our study has elucidated the role of WatR as a key transcription factor in the cellular defense against weak organic acids, particularly acetate, in Methylomonas sp. DH-1. We identified the genes encoding WatABO efflux pump and small polypeptides (WatS1 to WatS5), as the target genes regulated by WatR for this specific function. These findings offer valuable insights into the mechanisms underlying weak acid tolerance in methanotrophic bacteria, thereby contributing to the development of bioprocesses aimed at converting methane into value-added chemicals.
    Keywords:  Acetate tolerance; LysR-type regulator; Methanotroph; RND-type efflux pump; Small open reading frame (smORF)
    DOI:  https://doi.org/10.1186/s13068-023-02364-6
  3. J Biomol Struct Dyn. 2023 Jul 18. 1-13
      The pervasive repertoire of plant molecules with the potential to serve as a substitute for conventional antibiotics has led to obtaining better insights into plant-derived antimicrobial peptides (AMPs). The massive distribution of Small Open Reading Frames (smORFs) throughout eukaryotic genomes with proven extensive biological functions reflects their practicality as antimicrobials. Here, we have developed a pipeline named smAMPsTK to unveil the underlying hidden smORFs encoding AMPs for plant species. By applying this pipeline, we have elicited AMPs of various functional activity of lengths ranging from 5 to 100 aa by employing publicly available transcriptome data of five different angiosperms. Later, we studied the coding potential of AMPs-smORFs, the inclusion of diverse translation initiation start codons, and amino acid frequency. Codon usage study signifies no such codon usage biases for smORFs encoding AMPs. Majorly three start codons are prominent in generating AMPs. The evolutionary and conservational study proclaimed the widespread distribution of AMPs encoding genes throughout the plant kingdom. Domain analysis revealed that nearly all AMPs have chitin-binding ability, establishing their role as antifungal agents. The current study includes a developed methodology to characterize smORFs encoding AMPs, and their implications as antimicrobial, antibacterial, antifungal, or antiviral provided by SVM score and prediction status calculated by machine learning-based prediction models. The pipeline, complete package, and the results derived for five angiosperms are freely available at https://github.com/skbinfo/smAMPsTK.Communicated by Ramaswamy H. Sarma.
    Keywords:  Small open reading frames; antimicrobial peptides; pipeline; plant-derived; smORFs; transcriptome
    DOI:  https://doi.org/10.1080/07391102.2023.2235605
  4. Front Microbiol. 2023 ;14 1191166
      Hundreds of different species of small RNAs can populate a bacterial cell. This small transcriptome contains important information for the adaptation of cellular physiology to environmental changes. Underlying cellular networks involving small RNAs are RNA-RNA and RNA-protein interactions, which are often intertwined. In addition, small RNAs can function as mRNAs. In general, small RNAs are referred to as noncoding because very few are known to contain translated open reading frames. In this article, we intend to highlight that the number of small RNAs that fall within the set of translated RNAs is bound to increase. In addition, we aim to emphasize that the dynamics of the small transcriptome involve different functional codes, not just the genetic code. Therefore, since the role of small RNAs is always code-driven, we believe that there is little reason to continue calling them small noncoding RNAs.
    Keywords:  CsrA/RsmA; Hfq; RNA-binding proteins; bacteria; dual-function sRNAs; riboregulation; small RNAs; small protein
    DOI:  https://doi.org/10.3389/fmicb.2023.1191166