Periodic Reporting for period 4 - PhenomeNal (Inheritance, expressivity and epistasis hidden behindthe phenotypic landscape of natural populations)
Periodo di rendicontazione: 2023-03-01 al 2024-08-31
Dissecting the genotype-phenotype relationship in humans, while extremely important, is very difficult not only due to genetic complexity, pleiotropy, and gene-environment interactions but also by large and complex genomes. As a consequence, the completion of an exhaustive catalogue of genetic variants is still far from being reached. The basic understanding of phenotypic diversity and the genetic architecture underlying it is facilitated by the use of microbial models, in particular the budding yeast S. cerevisiae with its physically small (12 Mb), genetically large (4,500 cM), and highly annotated genome. Interestingly, S. cerevisiae has decidedly greater genetic diversity than humans. Phenotypic diversity among yeast strains is significant, and variation is apparent among natural isolates at different levels. In addition, it is possible to study, test, and precisely measure a large number of phenotypes on the same set of strains. Due to its genome size and currently available tools, the genetic basis of phenotypic diversity is more readily identified and thoroughly characterized in S. cerevisiae than in higher eukaryotes.
A better understanding of the phenotype-genotype relationship requires a deeper knowledge of the effect of genetic variants across a large number of individuals. This is stating the obvious but the question is how can we reach this. Deeper mapping studies by linkage or association, as mentioned previously, will definitely bring some insight into this problem but will be insufficient. Alternative and new strategies are now essential and necessary.
In the framework of this project, we decided to marry classical but high-throughput genetic methods with new approaches based on population genomics to connect the phenotypic and allelic landscapes by taking advantage of the powerful budding yeast model system.
Based on the genomic and phenotypic data from the collection of 1,011 S. cerevisiae isolates, we selected a total of 55 natural isolates and we constructed a diallel by intercrossing these isolates. These strains were chosen to span much of the genetic diversity in the native range of the species. In total, we generated 3,025 hybrids, representing 2,970 heterozygous hybrids with a unique parental combination and 55 homozygous hybrids. All these hybrids were screened for high-resolution quantification of mitotic growth ability across 53 conditions including different carbon sources and chemical compounds impacting various physiological and cellular responses. In total, more than 160,00 hybrid/trait combinations were determined. Using this diallel panel, we performed GWAS and quantified the contribution of low-frequency variants across the selected growth conditions and found that among all the genetic variants detected by GWAS, 16.3% of them have a low-frequency in the initial population and explain a significant part of the phenotypic variance (21% on average). This particular diallel design also presented an intrinsic power to evaluate the additive vs. non-additive genetic components contributing to the phenotypic variation. We assessed the effect of intra-locus dominance on the non-additive genetic component and showed that dominance at the single locus level contributed to the phenotypic variation observed. Altogether, these results have major implications for our understanding of the genetic architecture of traits as they highlighted the extensive role of low-frequency variants on the phenotypic variation.
This year, we took advantage of the diallel panel to assess the overall genetic complexity of traits and the prevalence of phenotypic expressity at a population-scale. In this context, we first selected a subset of the large diallel hybrid panel in order to have 190 unique hybrids coming from 20 natural isolates representative of the S. cerevisiae genetic diversity. For each of these hybrids, a large progeny of 160 individuals (corresponding to 40 full tetrads) was obtained, leading to a total of 30,400 offspring individuals. Their mitotic growth has been assessed on 40 growth conditions inducing various cellular stress. As the phenotypic distribution of the offspring of a given cross allows to infer the inheritance patterns to a trait, we assessed the inheritance patterns for 3,841 cross/trait combinations and revealed that while complex inheritance were the most common, 11% of the cross/traits combinations had their phenotypic variation controlled by a single gene with a large effect and 4% displayed digenic interactions. We identified 26 major effect loci on various traits and parental genetic backgrounds. Measurement of the extent of expressivity was performed by investigating the variation of inheritance patterns throughout all the crosses having one parent who carries one of these loci. We found that trait complexity was highly dynamic and tightly linked to the genetic background. Indeed, 22 out of the 26 major effect loci were subjected to various level of expressivity with one to nine crosses showing departure from monogenic inheritance.
Altogether, the obtained results allowed us to have a first new insight into the genetic architecture of complex traits.