Supporting data for "Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n=50)"

Dataset type: Genomic, Transcriptomic
Data released on August 23, 2017

Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well annotated, reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are two species of domestic water buffalo, the river (2n=50) and the swamp (2n=48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366,983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21,398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues, and identified 21,711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 29048578)

Accessions (data generated as part of this study):

BioProject: PRJEB4351
BioProject: PRJNA207334
Assembly: GCA_000471725.1





Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
abomasum89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
Blood89462domestic water buffalowater buffaloBubalus bubalis Description:DNA extracted from blood sample of a f...
Analyte type:DNA
Geographic location (country and/or sea,region):Lo...
...
+
bone_marrow89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
cerebellum89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
embryo_pool89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen pool of...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
embryo_single89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from fresh blastocyst st...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
endometrium89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
heart89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
hypophysis89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
kidney89462domestic water buffalowater buffaloBubalus bubalis Description:RNA extracted from snap frozen tissue ...
Analyte type:RNA
Geographic location (country and/or sea,region):Lo...
...
+
Displaying 1-10 of 31 Sample(s).




File NameSample IDData TypeFile FormatSizeRelease Date 
Coding sequenceFASTA12.6 MB2017-08-10
Mixed archiveTAR13.82 MB2017-08-10
ImageJPG1004.83 KB2017-08-10
Coding sequenceFASTA85.91 MB2017-08-18
AnnotationTEXT2.82 MB2017-08-18
Genome sequenceFASTA2.72 GB2017-08-18
AnnotationGFF352.89 MB2017-08-18
Protein sequenceFASTA27.35 MB2017-08-18
AnnotationUNKNOWN181.29 MB2017-08-18
OtherTEXT0.52 KB2017-08-18
Displaying 1-10 of 13 File(s).
Funding body Awardee Award ID Comments
U.S. Department of Agriculture TPL Smith 5438-31000-073-00D
National Institutes of Health KD Pruitt Intramural Research Program
Date Action
August 23, 2017 Dataset publish
October 2, 2017 Manuscript Link added : 10.1093/gigascience/gix088
November 9, 2022 Manuscript Link updated : 10.1093/gigascience/gix088
February 9, 2023 Link updated : Assembly:GCA_000471725.1