With the mapping of the human genome by the Human Genome Project and the ongoing mapping of genome of several organisms, the next step of genomics will involve finding an association between a trait and the genetic constituent of an organism. The susceptibility of a human to some particular diseases can also be attributed to a certain variations of the gene or a sequence of the DNA. The information of the sequence will help in finding new remedies like gene therapy methods and new drugs. Several researches proving an association between a trait of a person and the haplotype of the person have been carried out. In our research, we have tried to find an association between a phenotype and haplotype of a person from the population data. We have genotypes of a population expressing a certain trait and also, we have genotypes of another population not expressing the same trait or phenotype. Here, we have implemented an algorithm, which tries to predict a sequence that can be held responsible for that particular trait. Such a sequence should occur in maximum number of genotypes of population displaying that phenotype. To support our result, we also need to confirm that the sequence is not present in any of the genotype of the population, not expressing that phenotype. Also, we have formulated Maximum Collection Set Cover problem which is an interesting set theory problem. This problem can be mapped to the discriminatory haplotyping problem and is open to be proved NP-Hard.
School:University of Cincinnati
School Location:USA - Ohio
Source Type:Master's Thesis
Keywords:haplotype genotype phenotype
Date of Publication:01/01/2004