cabbage white
NCBI taxon id: | 64459 |
---|---|
Order: | Lepidoptera |
Family: | Pieridae |
NCBI lineage: | Eukaryota;Metazoa;Ecdysozoa;Arthropoda;Hexapoda;Insecta;Pterygota;Neoptera;Endopterygota;Lepidoptera;Glossata;Ditrysia;Papilionoidea;Pieridae;Pierinae;Pieris; |
GoaT genome size (M): | 254 (ancestor) |
GoaT asm span (M): | 245 (direct) |
GoaT chr no.: | 50 (direct) |
GoaT ploidy: | 2 (ancestor) |
ToLID prefix: | ilPieRapa |
Below is information about specimens collected for this species retrieved from the Golden Record manifest.
public_name | specimen_id | gal | sex | organism_part | biosample | biospecimen |
---|---|---|---|---|---|---|
ilPieRapa1 | SAN0000822 | SANGER INSTITUTE | FEMALE | WHOLE_ORGANISM | SAMEA7523264 | SAMEA7523164 |
ilPieRapa2 | SAN20001350 | SANGER INSTITUTE | MALE | HEAD | SAMEA111436085 | SAMEA111436077 |
ilPieRapa2 | SAN20001350 | SANGER INSTITUTE | MALE | THORAX | SAMEA111436083 | SAMEA111436077 |
ilPieRapa2 | SAN20001350 | SANGER INSTITUTE | MALE | SPERM_SEMINAL_FLUID | SAMEA111436104 | SAMEA111436077 |
ilPieRapa3 | SAN20001347 | SANGER INSTITUTE | MALE | HEAD | SAMEA111436084 | SAMEA111436078 |
Below are estimates of genome size, repeat size, heterozygosity based on k-mer specta analysis with GenomeScope.
Below are stats for each PacBio seqeuncing run collected for this species.
pipeline | specimen | date | run id | movie | well | tag | yield | N50 | sample accession | run accession | barcode |
---|---|---|---|---|---|---|---|---|---|---|---|
PacBio - HiFi | ilPieRapa1 | 2019-12-23 | TRACTION-RUN-4 | m64016_191223_193312 | A01 | 1018 | 14,514,763,067 | 14,192 | SAMEA7523264 | ERR6606787 |
Illumina run stats.
pipeline | specimen | date | run id | read pairs | yield | sample accession | run accession | run status | barcode |
---|---|---|---|---|---|---|---|---|---|
Hi-C - Arima v1 | ilPieRapa1 | 2020-03-25 | 33892_7#1 | 476,181,972 | 71,903,477,772 | SAMEA7523264 | ERR6002748 | qc complete | Pieris rapae (1.00) |
Chromium genome | ilPieRapa1 | 2020-02-04 | 32955_5#7 | 96,942,244 | 14,638,278,844 | SAMEA7523264 | ERR6002746 | qc complete | Pieris rapae (1.00) |
Chromium genome | ilPieRapa1 | 2020-02-04 | 32955_5#6 | 117,528,962 | 17,746,873,262 | SAMEA7523264 | ERR6002745 | qc complete | Pieris rapae (1.00) |
Chromium genome | ilPieRapa1 | 2020-02-04 | 32955_5#8 | 145,700,076 | 22,000,711,476 | SAMEA7523264 | ERR6002747 | qc complete | Pieris rapae (1.00) |
Chromium genome | ilPieRapa1 | 2020-02-04 | 32955_5#5 | 86,049,572 | 12,993,485,372 | SAMEA7523264 | ERR6002744 | qc complete | Pieris rapae (1.00) |
RNA PolyA | ilPieRapa1 | 2021-05-25 | 37935_7#16 | 32,565,220 | 4,917,348,220 | SAMEA7523264 | ERR6286709 | qc complete | Pieris rapae (1.00) |
Below are results from a screen of the PacBio data using Mash screen against RefSeq assemblies. Only results with identity over 90% are displayed.
identity | info |
---|---|
0.931037 | NC_018417.1 Candidatus Carsonella ruddii HT isolate Thao2000, complete genome |
0.907083 | [2017 seqs] NW_004087753.1 Ichthyophthirius multifiliis unplaced genomic scaffold scaff_1120509249386, whole genome shotgun sequence [...] |
0.904322 | [2 seqs] NC_008513.1 Buchnera aphidicola BCc, complete genome [...] |
0.903605 | NC_002520.1 Amsacta moorei entomopoxvirus 'L', complete genome |
0.900615 | [28 seqs] NZ_LTBM01000001.1 Candidatus Phytoplasma oryzae isolate Mbita1 Pmin.contig.0_1, whole genome shotgun sequence [...] |
Species composition by small subunit (SSU) presence in the assembly with MarkerScan.
specimen | contig | SSU length | attributed taxonomy by SSU |
---|---|---|---|
No matching records found |
MarkerScan cobiont assembly by read separation based on observed families (see above). These reads are both aligned to the assembly and independently re-assembled. The quality of these assemblies is assessed by their completeness according to BUSCO, their span and the number of reads they encompass. For more information here.
specimen | family | classified reads | original assembly | re-assembly | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
count | (%) | BUSCO | BUSCO | contigs | contig length | number of reads | BUSCO | contigs | contig length | number of reads | ||
No matching records found |
Canonical tetranucleotide counts for each contig or scaffold reduced to two dimensions with UMAP to allow visualisation.
Features (colours represent quantile bins):
In-progress assembly QC.
specimen | asm | date | contig N50 | contigs | scaffold N50 | scaffolds | length | BUSCO | merqury |
---|---|---|---|---|---|---|---|---|---|
canu | ccs.purge | 2020-01-15 | 10,943,817 | 62 | 10,943,817 | 53 | 256,312,092 | C:98.7%[S:98.4%,D:0.3%],F:0.3%,M:1.0%,n:1658 | - |
falcon | ccs.2-asm-falcon | 2020-01-03 | 9,422,358 | 62 | 9,422,358 | 55 | 256,212,453 | C:98.6%[S:98.1%,D:0.5%],F:0.4%,M:1.0%,n:1658 | - |
In-progress organelle results from MitoHiFi or Oatk.
specimen | asm | organelle | date | length | genes | frameshifts | is circular | reference |
---|---|---|---|---|---|---|---|---|
No matching records found |