Crimson: A Data Management System to Support Evaluating Phylogenetic Tree Reconstruction Algorithms

Loading...
Thumbnail Image

Related Collections

Degree type

Discipline

Subject

databases
phylogenetics

Funder

Grant number

License

Copyright date

Distributor

Related resources

Author

Zheng, Yifeng
Fisher, Stephen
Cohen, Shirley
Guo, Sheng
Kim, Junhyong

Contributor

Abstract

Evolutionary and systems biology increasingly rely on the construction of large phylogenetic trees which represent the relationships between species of interest. As the number and size of such trees increases, so does the need for efficient data storage and query capabilities. Although much attention has been focused on XML as a tree data model, phylogenetic trees differ from document-oriented applications in their size and depth, and their need for structure based queries rather than path-based queries. This paper focuses on Crimson, a tree storage system for phylogenetic trees used to evaluate phylogenetic tree reconstruction algorithms within the context of the NSF CIPRes project. A goal of the modeling component of the CIPRes project is to construct a huge simulation tree representing a "gold standard" of evolutionary history against which phylogenetic tree reconstruction algorithms can be tested. In this demonstration, we highlight our storage and indexing strategies and show how Crimson is used for benchmarking phylogenetic tree reconstruction algorithms. We also show how our design can be used to support more general queries over phylogenetic trees.

Advisor

Date of presentation

2006-09-01

Conference name

Departmental Papers (CIS)

Conference dates

2023-05-17T00:20:33.000

Conference location

Date Range for Data Collection (Start Date)

Date Range for Data Collection (End Date)

Digital Object Identifier

Series name and number

Volume number

Issue number

Publisher

Publisher DOI

Journal Issues

Comments

Postprint version. Published in 32nd International Conference on Very Large Data Bases, ACM 2006, September 12-15, 2006, pages 1231-1234. Publisher URL: http://www.vldb.org/dblp/db/conf/vldb/vldb2006.html

Recommended citation

Collection