The contribution of mitochondrial metagenomics to large-scale data mining and phylogenetic analysis of Coleoptera

Linard, Benjamin and Crampton-Platt, Alex and Moriniere, Jerome and Timmermans, Martijn J. T. N. and Andújar, Carmelo and Arribas, Paula and Miller, Kirsten E. and Lipecki, Julia and Favreau, Emeline and Hunter, Amie and Gómez-Rodríguez, Carola and Barton, Christopher and Nie, Ruie and Gillett, Conrad P. D. T. and Breeschoten, Thijmen and Bocak, Ladislav and Vogler, Alfried P. (2018) The contribution of mitochondrial metagenomics to large-scale data mining and phylogenetic analysis of Coleoptera. Molecular Phylogenetics and Evolution, 128 . pp. 1-11. ISSN 1055-7903

This is the latest version of this item.

Full text is not in this repository.

Abstract

A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of > 3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16,000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.

Item Type: Article
Research Areas: A. > School of Science and Technology > Natural Sciences
Item ID: 24736
Useful Links:
Depositing User: Martijn Timmermans
Date Deposited: 31 Jul 2018 16:23
Last Modified: 16 Nov 2018 12:36
URI: http://eprints.mdx.ac.uk/id/eprint/24736

Available Versions of this Item

Actions (login required)

Edit Item Edit Item