Back to Browse Journals » Advances and Applications in Bioinformatics and Chemistry » Volume 2

On calculating the probability of a set of orthologous sequences

Authors Junfeng Liu, Liang Chen, Hongyu Zhao, Dirk F Moore, Yong Lin, Weichung Joe Shih

Published 26 February 2009 Volume 2009:2 Pages 37—48


Review by Single-blind

Peer reviewer comments 3

Junfeng Liu1,2, Liang Chen3, Hongyu Zhao4, Dirk F Moore1,2, Yong Lin1,2, Weichung Joe Shih1,2

1Biometrics Division, The Cancer, Institute of New Jersey, New Brunswick, NJ, USA; 2Department of Biostatistics, School of Public Health, University of Medicine and Dentistry of New Jersey, Piscataway, NJ, USA; 3Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA; 4Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT, USA

Abstract: Probabilistic DNA sequence models have been intensively applied to genome research. Within the evolutionary biology framework, this article investigates the feasibility for rigorously estimating the probability of a set of orthologous DNA sequences which evolve from a common progenitor. We propose Monte Carlo integration algorithms to sample the unknown ancestral and/or root sequences a posteriori conditional on a reference sequence and apply pairwise Needleman–Wunsch alignment between the sampled and nonreference species sequences to estimate the probability. We test our algorithms on both simulated and real sequences and compare calculated probabilities from Monte Carlo integration to those induced by single multiple alignment.

Keywords: evolution, Jukes–Cantor model, Monte Carlo integration, Needleman–Wunsch alignment, orthologous

Creative Commons License This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.

Download Article [PDF] 


Readers of this article also read:

Identification and analysis of potential targets in Streptococcus sanguinis using computer aided protein data analysis

Chowdhury MRH, Bhuiyan MIK, Saha A, Mosleh IM, Mondol S, Ahmed CMS

Advances and Applications in Bioinformatics and Chemistry 2014, 7:45-54

Published Date: 25 November 2014

The comparison of protein-entrapped liposomes and lipoparticles: preparation, characterization, and efficacy of cellular uptake

Chang WK, Tai YJ, Chiang CH, Hu CS, Hong PD, Yeh MK

International Journal of Nanomedicine 2011, 6:2403-2417

Published Date: 20 October 2011

FDR-FET: an optimizing gene set enrichment analysis method

Rui-Ru Ji, Karl-Heinz Ott, Roumyana Yordanova, et al

Advances and Applications in Bioinformatics and Chemistry 2011, 4:37-42

Published Date: 15 March 2011

Classification of heterodimer interfaces using docking models and construction of scoring functions for the complex structure prediction

Yuko Tsuchiya, Eiji Kanamori, Haruki Nakamura, et al

Advances and Applications in Bioinformatics and Chemistry 2009, 2:79-100

Published Date: 22 September 2009

Identification of longevity genes with systems biology approaches

Yuanyou Tan, John M Bush, Weijiu Liu, Fusheng Tang

Advances and Applications in Bioinformatics and Chemistry 2009, 2:49-56

Published Date: 27 February 2009

Evolution of a domain conserved in microtubule-associated proteins of eukaryotes

Alex S Rajangam, Hongqian Yang, Tuula T Teeri, Lars Arvestad

Advances and Applications in Bioinformatics and Chemistry 2008, 1:51-69

Published Date: 23 September 2008