DSpace/Dipòsit Manakin

Codon frequency is modulated by proteic selection, resulting in a coding profile in Archaea and Yeast

Registre simple

dc.contributor Universitat de Vic - Universitat Central de Catalunya. Facultat de Ciències i Tecnologia
dc.contributor Universitat de Vic - Universitat Central de Catalunya. Màster Universitari en Anàlisi de Dades Òmiques
dc.contributor.author Roginski, Paul Luc Maxime
dc.date.accessioned 2021-12-22T08:54:51Z
dc.date.available 2021-12-22T08:54:51Z
dc.date.created 2021-08
dc.date.issued 2021-08
dc.identifier.uri http://hdl.handle.net/10854/6877
dc.description Curs 2020-2021 es
dc.description.abstract Codons as fragments of the genetic code articulate both nucleotidic and proteic constraints. If codon usage bias is now admitted to be mainly influenced by GC content, codon frequencies in general may display a more subtle compromise between base composition and selection at proteic level. In order to investigate the existing non-GC content factors of codon frequencies, we compared coding sequences (CDS) of 280 Archaea plus S. cerevisiae genomes to their randomized version (same base-composition and same length). Through dedicated counts we identified several CDS vs random patterns in Archaea some of which reflecting probable or evident proteic constraint : in particular, the systematic enrichment of CDS in negatively charged amino acids, and the strong constraint existing on codons having a T in second position, which, on the basis of hydrophobic cluster analysis attests a folding constraint. The sum of these patterns constitutes a coding profile that enables to accurately classify about 99% of individual archaea sequences between CDS and randomized CDS. In S. cerevisiae, whose coding profile shares similarities with Archeae of close GC content, phylostratigraphic methods allowed to investigate the coding profile of CDS based on their relative age. This analysis reveals that contrary to other genes, the youngest genes (only found in S. cerevisiae) as a whole do not have a strong coding profile. This can be explained by their relative shortness in comparison with other genes. But even when taking length into account, a clear enrichment of misclassified sequences appears in the youngest S. cerevisiae genes. This enrichment may reflect an insufficient proteic optimization operated by selection. es
dc.format application/pdf es
dc.format.extent 18 p. es
dc.language.iso eng es
dc.rights Tots els drets reservats es
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/4.0/deed.ca es
dc.subject.other Nucleòtids es
dc.subject.other Aminoàcids es
dc.subject.other Saccharomyces cerevisiae es
dc.subject.other Proteïnes -- Investigació es
dc.subject.other Regió codificant es
dc.title Codon frequency is modulated by proteic selection, resulting in a coding profile in Archaea and Yeast es
dc.type info:eu-repo/semantics/masterThesis es
dc.rights.accessRights info:eu-repo/semantics/openAccess es

Text complet d'aquest document

Registre simple

Tots els drets reservats Tots els drets reservats

Buscar al RIUVic


Cerca avançada

Llistar per

Estadístiques