Supporting data for "Hybrid de novo genome assembly of the Chinese herbal plant danshen (Salvia miltiorrhiza Bunge)".
Dataset type: Genomic
Data released on December 04, 2015
Danshen (Salvia miltiorrhiza Bunge) of Lamiaceae family, also known as Chinese red sage, is valued in the traditional Chinese medicine mainly to treat cardiovascular and cerebrovascular diseases. Because of this pharmacological prospect, continuous research aims to identify novel bioactive compounds and their biosynthetic pathways in danshen. So far, only EST and RNA-seq data for this herbal plant are available to the public. We therefore propose that the construction of a reference danshen genome will help elucidate the biosynthetic steps for important secondary metabolites, thereby pushing forward the investigation of novel drugs from the plant.
Here we present the assembled the highly heterozygous danshen genome with the help of 395 × raw read coverage using Illumina technologies, and about 10 × raw read coverage by using single molecular sequencing technology. The draft genome is approximately 641 Mb, with a contig N50 size of 82.8kb and scaffold N50 size of 1.2 Mb.
Further analyses predicted 27,986 protein-coding genes and 1,147 unique gene families in the danshen genome.
Additional details
Read the peer-reviewed publication(s):
Accessions (data generated as part of this study):
BioProject:
PRJNA287594
Sample ID | Taxonomic ID | Common Name | Genbank Name | Scientific Name | Sample Attributes |
---|---|---|---|---|---|
Salvia miltiorrhiza | 226208 | Salvia miltiorrhiza | Salvia miltiorrhiza | Age:1 Tissue:leaf Alternative accession-SRA Sample:SRS967127 ... + |