Download - Assessing the limits of phylogenomics: can too much data be a bad thing?

Discover

Podcast Features
Your all-in-one podcasting solution.

Podcast Studio
Easy-to-use audio recorder app.
Livestream
High-performing audio live, without limits.

Podcast App
The best podcast player & podcast app.
Podbean AI
AI-Enhanced Audio Quality and Content Generation.

Ads Marketplace
Join Ads Marketplace to earn money
through sponsorship on your podcast.

PodAds
Manage your ads with dynamic ad insertion capability.
Patron & Paid Content
The seamless way for fans to support you directly
from your podcast.
Apple Podcasts Subscriptions Integration
Effortlessly publish and manage exclusive episodes for your
Apple Podcasts subscribers directly from Podbean.

All Arts Business Comedy Education
Fiction Government Health & Fitness History Kids & Family
Leisure Music News Religion & Spirituality Science
Society & Culture Sports Technology True Crime TV & Film
Live

How to Start a Podcast
How to Start a Live Podcast
How to Monetize a podcast
How to Promote Your Podcast
How to Use Group Recording

Log in
Start your podcast for free

Podcasting
Monetization
Enterprise
Pricing
Discover

Phylogenetics

Technology

Assessing the limits of phylogenomics: can too much data be a bad thing?

2011-06-27

Download Right click and do "save link as"

In contrast to the situation not even 20 years ago, molecular sequence data is now plentiful (if still patchily distributed) and phylogenomic studies of hundreds of taxa on a broad taxonomic scale are becoming increasingly common. Whereas the accuracy of phylogenetic analysis was limited until recently by a shortage of data (and then for both taxa and characters), the results of large and comprehensive phylogenomic studies where data are not limiting are also not without their problems. Analyses including large numbers of taxa run up against the superexponential increase in the number of possible solutions, requiring any or all of more time, faster computers in conjunction with parallel processing, and cleverer heuristics to find a hopefully near optimal solution. Perhaps less appreciated, however, is that the increasing taxonomic scope of our analyses demands the use of large amounts of molecular sequence data with significant rate heterogeneity across the data set (whether between or within partitions) to achieve full resolution throughout the tree. In this talk, I examine how the performance of phylogenetic analysis is affected when analyzing large number of taxa or a large multigene data set incorporating the degree of rate heterogeneity that is to be found, if not needed, in typical phylogenomic data sets.