The possibilities of using biological knowledge for filtering pairs of SNPs in GWAS studies: an exploratory study on public protein-interaction and pathway data.

Lexa,  Matej; Štefanič,  Stanislav

The possibilities of using biological knowledge for filtering pairs of SNPs in GWAS studies: an exploratory study on public protein-interaction and pathway data.

Warning

This publication doesn't include Institute of Computer Science. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors	LEXA Matej ŠTEFANIČ Stanislav
Year of publication	2014
Type	Article in Proceedings
Conference	Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms
MU Faculty or unit	Faculty of Informatics
Citation
web	http://www.scitepress.org/DigitalLibrary/Link.aspx?doi=10.5220/0004915002590264
Doi	http://dx.doi.org/10.5220/0004915002590264
Field	Informatics
Keywords	GWAS; SNPs; biological knowledge; databases; genotyping; filtering
Description	Genome-wide association studies have become a standard way of discovering novel causative alleles by loooking for statisticaly significant associations in patient genotyping data. The present challenge for these methods is to discover associations involving multiple interacting loci, a common phenomenon in diseases often related to epistasis. The main problem is the exponential increase in necessary computational power for every additional interacting locus considered in association tests. Several approaches have been proposed to manage this problem, including limiting analysis to interacting pairs and filtering SNPs according to external biological knowledge. Here we explore the possibilities of using public protein interaction data and pathway maps to filter out only pairs of SNPs that are likely to interact, perhaps because of epistatic mechanisms working at the protein level. After filtering all possible pairs of SNPs by their presence in a common protein-protein interaction or proteins sharing a metabolic or signalling pathway, we calculate the possible reduction in computational requirements under different scenarios. We discuss these exploratory results in the context of the so-called "lost heredity" and the usefulness of similar approach for similar scenarios.
Related projects:	THALAssaemia MOdular Stratification System for personalized therapy of beta-thalassemia