Comparative Genomics

Primary tabs

The main research interest of our group is to understand the complex relationships between genome sequences and phenotypes and how these two features evolve within and across species. We generally use large-scale phylogenetics and molecular evolution approaches that allow looking at the evolution of genomes from the perspective of all of their genes, and we apply these analyses to a variety of biological questions related the evolution and function of biological systems, ranging from individual molecules to entire biological communities.


1. Phylogenomics and genome evolution

In the genomic era it has been possible to move from the evolutionary analysis of single protein families (phylogenetics) to that of complete genomes and proteomes (phylogenomics). To achieve this transition, new tools have been developed that allow the large-scale reconstruction of thousands of phylogenetic trees in an automatic way. Interpreting such type of complex data poses many difficulties and does require the development of novel algorithms, tools, forms of representing the data and even new semantics and concepts. We combine the development of original algorithms to treat phylogenomic data with its application to gain knowledge on problems of biological relevance. We have recently started an ERC-funded project entitled RETVOLUTION: “processes and patterns of reticulate evolution in eukaryotes”, which will combine innovative computational and experimental approaches to infer patterns of reticulate evolution across the eukaryotic tree, and relate this to current biological knowledge. A particular focus is placed on elucidating the role of hybridization in the origin of whole genome duplications, and in facilitating the spread of horizontally transferred genes.

2. Comparative genomics and population genomics of fungal pathogens
Fungal infections constitute an ever-growing and significant medical problem. Diseases caused by such pathogens range from simple toe nail infections, to life-threatening systemic mycoses in patients with impaired immune systems. The molecular mechanisms driving invasion of mammalian hosts by fungal pathogens poses many scientifically challenging problems, which are as yet little understood. The ability to infect humans has emerged in several lineages throughout the fungal tree of life. Therefore, the problem of elucidating the mechanism for pathogenesis of fungi, as proposed here, can be approached with an evolutionary perspective by detecting specific adaptations in pathogenic lineages. During the last years we have clarified the evolutionary paths to virulence of major fungal pathogens such as Candida glabrata and Candida parapsilosis.
3. Microbiome-host interactions in health and disease
As a natural progression of our interest in the role of fungi in health and disease, the group started working on the analysis of the microbiome. Our main focus within this field is quantifying and understanding the role of the fungal component, which is usually neglected by mainstream analyses. In the last three years we have engaged in several massive projects, which include the oral microbiome project “Saca La Lengua” (, with over 1500 samples, the lung microbiome in patients with chronic respiratory syndrome, the lung microbiome in the intensive care unit patient (with artificial ventilators), and the intestinal microbiome in colon cancer. These projects involve analysis of bacterial metagenomes and of the fungal component. For the analysis of the fungal microbiome we have developed novel approaches that overcome existing limitations of the traditional ITS-based sequencing.
4. Evolutionary genomics of long, non-coding RNAs
Recent genomics analyses have facilitated the discovery of a novel major class of stable transcripts, now called long non-coding RNAs (lncRNAs). A growing number of analyses have implicated lncRNAs in the regulation of gene expression, dosage compensation and imprinting, and there is increasing evidence suggesting the involvement of lncRNAs in various diseases such as cancer. Despite recent advances, however, the role of the large majority of lncRNAs remains unknown and there is current debate on what fraction of lncRNAs may just represent transcriptional noise. Our group has recently embarked into a project, funded by the European Research Council that aims to combine state-of-the-art computational and sequencing techniques in order to elucidate what evolutionary mechanisms are shaping this enigmatic component of eukaryotic genomes.
5. Evolution of eukaryotes
Every eukaryotic organism shows a high level of sub-cellular compartmentalization that is significantly more intricate than the most complex prokaryotic cell. How such degree of complexity came to be is still not fully understood. In this context, endo-symbiotic events with bacterial organisms have been proposed to be the source of a number of organelles including mitochondria, chloroplasts and peroxisomes. Only recently, it has been possible to contrast these hypotheses with the growing availability of completely sequenced genomes and organellar proteomic data. In the last years, we have used large-scale evolutionary analyses to investigate the origin and evolution two most widespread organelles for which an endosymbiotic origin has been proposed: mitochondria and peroxisomes.
A lot of more information about us in our group website: