## Abstract

The problem of haplotype inference under the Mendelian law of inheritance on pedigree genotype data is studied. The minimum recombination principle states that genetic recombinations are rare and haplotypes with fewer recombinations are more likely to exist. Given genotype data on a pedigree, the problem of Minimum Recombination Haplotype Inference (MRHI) is to find a set of haplotype configurations consistent with the genotype data having the minimum number of recombinations. In this paper, we focus on a variation of the MRHI problem that gives more realistic solutions, namely the k-MRHI problem, which has the additional constraint that the number of recombinations in each parent-offspring pair is at most k. Although the k-MRHI problem is NP-hard even for k = 1, the k-MRHI problem with k > 1 can be solved efficiently by dynamic programming in O(nm_{0}^{3k+1}2^{m0}) time by adopting an approach similar to the one used by Doi, Li and Jiang [4] on pedigrees with n nodes and at most mo heterozygous loci in each node. In particular, the 1-MRHI problem can be solved in O(nm_{0}^{4}2^{m0}) time. We propose an O(n^{2}m_{0}) algorithm to find a node as the root of the pedigree tree so as to further reduce the time complexity to O(momin(t _{R})), where t_{R} is the number of feasible haplotype configuration combinations in all trios in the pedigree tree when R is the root. If the pedigree has few generations, the 1-MRHI problem can be solved in O(min{nm_{0}^{4}2^{m0}, nm_{0} ^{l+1}2^{μR+l}}) time, where μ_{R} is the number of heterozygous loci in the root, and 1 is the maximum path length from the root to the leaves in the pedigree tree. Experiments on both real and simulated data confirm the out-performance of our algorithm when compared with other popular algorithms. In most real cases, our algorithm gives the same haplotyping results but runs much faster. In some real cases, other algorithms give an answer which has the least number of recombinations, while our algorithm gives a more credible solution and runs faster.

Original language | English |
---|---|

Title of host publication | Transactions on Computational Systems Biology II |

Publisher | Springer Verlag |

Pages | 100-112 |

Number of pages | 13 |

ISBN (Print) | 3540294015, 9783540294016 |

DOIs | |

Publication status | Published - 2005 |

Externally published | Yes |

Event | International Workshop on Bioinformatics Research and Applications, IWBRA 2005 - Atlanta, GA, United States Duration: 22 May 2005 → 24 May 2005 |

### Publication series

Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|

Volume | 3680 LNBI |

ISSN (Print) | 0302-9743 |

ISSN (Electronic) | 1611-3349 |

### Conference

Conference | International Workshop on Bioinformatics Research and Applications, IWBRA 2005 |
---|---|

Country/Territory | United States |

City | Atlanta, GA |

Period | 22/05/05 → 24/05/05 |