The possibilities of using biological knowledge for filtering pairs of SNPs in GWAS studies: an exploratory study on public protein-interaction and pathway data.
Authors | |
---|---|
Year of publication | 2014 |
Type | Article in Proceedings |
Conference | Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms |
MU Faculty or unit | |
Citation | |
Web | http://www.scitepress.org/DigitalLibrary/Link.aspx?doi=10.5220/0004915002590264 |
Doi | http://dx.doi.org/10.5220/0004915002590264 |
Field | Informatics |
Keywords | GWAS; SNPs; biological knowledge; databases; genotyping; filtering |
Description | Genome-wide association studies have become a standard way of discovering novel causative alleles by loooking for statisticaly significant associations in patient genotyping data. The present challenge for these methods is to discover associations involving multiple interacting loci, a common phenomenon in diseases often related to epistasis. The main problem is the exponential increase in necessary computational power for every additional interacting locus considered in association tests. Several approaches have been proposed to manage this problem, including limiting analysis to interacting pairs and filtering SNPs according to external biological knowledge. Here we explore the possibilities of using public protein interaction data and pathway maps to filter out only pairs of SNPs that are likely to interact, perhaps because of epistatic mechanisms working at the protein level. After filtering all possible pairs of SNPs by their presence in a common protein-protein interaction or proteins sharing a metabolic or signalling pathway, we calculate the possible reduction in computational requirements under different scenarios. We discuss these exploratory results in the context of the so-called "lost heredity" and the usefulness of similar approach for similar scenarios. |
Related projects: |