TY - JOUR
T1 - Barnacle
T2 - An assembly algorithm for clone-based sequences of whole genomes
AU - Choi, Vicky
AU - Farach-Colton, Martin
PY - 2003/11/27
Y1 - 2003/11/27
N2 - We propose an assembly algorithm Barnacle for sequences generated by the clone-based approach. We illustrate our approach by assembling the human genome. Our novel method abandons the original physical-mapping-first framework. As we show, Barnacle more effectively resolves conflicts due to repeated sequences which is the main difficulty of the sequence assembly problem. In addition, we are able to detect inconsistencies in the underlying data. We present and compare our results on the December 2001 freeze of the public working draft of the human genome with NCBI's assembly (Build 28). The assembly of December 2001 freeze of the public working draft generated by Barnacle and the source code of Barnacle are available at (http://www.cs.rutgers.edu/∼vchoi).
AB - We propose an assembly algorithm Barnacle for sequences generated by the clone-based approach. We illustrate our approach by assembling the human genome. Our novel method abandons the original physical-mapping-first framework. As we show, Barnacle more effectively resolves conflicts due to repeated sequences which is the main difficulty of the sequence assembly problem. In addition, we are able to detect inconsistencies in the underlying data. We present and compare our results on the December 2001 freeze of the public working draft of the human genome with NCBI's assembly (Build 28). The assembly of December 2001 freeze of the public working draft generated by Barnacle and the source code of Barnacle are available at (http://www.cs.rutgers.edu/∼vchoi).
KW - Clone-based sequencing
KW - Sequence assembly algorithm
UR - http://www.scopus.com/inward/record.url?scp=0242266906&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0242266906&partnerID=8YFLogxK
U2 - 10.1016/S0378-1119(03)00825-4
DO - 10.1016/S0378-1119(03)00825-4
M3 - Article
C2 - 14597400
AN - SCOPUS:0242266906
SN - 0378-1119
VL - 320
SP - 165
EP - 176
JO - Gene
JF - Gene
IS - 1-2
ER -