Abstract:
Environmental DNA (eDNA) technology is an emerging method for ecological and biodiversity monitoring and evaluation, and is one of the frontier technologies. A complete and accurate reference sequence database is the basis of the application of eDNA technology to the investigation of aquatic biodiversity. At present, there are still some problems in eDNA reference sequences of different aquatic organisms, such as the use of different marker genes for different groups of aquatic organisms, the scattered distribution and inaccurate classification of reference sequences, and the lacking of reference sequences for aquatic organisms in China. To address these issues, the Aquatic eDNA database (AeDNA, http://aedna.ihb.ac.cn/) was constructed in this study. AeDNA integrates two types of reference sequences: DNA barcode and genome database. The DNA barcode database contains more than 600000 records of different marker genes, including 18S, 28S, ITS, CO1, 12S and rbcL, etc., which covers about 20000, 10000, 10000, 10000 and 10000 species of fishes, plants, Zoobenthos, zooplankton and phytoplankton, respectively. The genome database contains 6199 organelle genomes of mitochondria and chloroplast, as well as the genomes of fish and protist which generated by the 10000 Fish Genomes Project and the Protist 10000 Genomes Project. The habitats cover various water environments such as rivers, lakes, seas, glaciers and hot springs. In particular, AeDNA contains more than 60000 reference sequences of aquatic organisms in various water environments of China. In summary, AeDNA is a comprehensive eDNA reference sequence database with large amount of data, high coverage of taxa, high accuracy and more data for aquatic organisms in China. It will be a fundamental resource for aquatic ecology and biodiversity monitoring.