NCBI taxon id: | 60958 NCBI; ENA; GoaT |
---|---|
Order: | Rhynchobdellida |
Family: | Piscicolidae |
NCBI lineage: | Eukaryota;Metazoa;Spiralia;Lophotrochozoa;Annelida;Clitellata;Hirudinea;Hirudinida;Oceanobdelliformes;Piscicolidae;Piscicola; |
GoaT genome size (M): | 425 (ancestor) |
GoaT asm span (M): | 171 (direct) |
GoaT chr no.: | 16 (ancestor) |
GoaT haploid no.: | 8 (ancestor) |
GoaT ploidy: | 2 (ancestor) |
ToLID prefix: | wrPisGeom |
Below is information about specimens collected for this species retrieved from the Sample Tracking System (STS).
tolid | specimen_id | gal | sex | organism_part | biosample | biospecimen |
---|---|---|---|---|---|---|
wrPisGeom1 | NHMUK014361516 | NATURAL HISTORY MUSEUM | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7521291 | SAMEA7521191 |
Below are estimates of genome size, repeat size, heterozygosity based on k-mer specta analysis with GenomeScope2.
source | specimen | k-mer | k-cov | haploid size | repeat (%) | heterozygosity (%) | model fit (%) | model error (%) | histogram |
---|---|---|---|---|---|---|---|---|---|
10x | wrPisGeom1 | 31 | 33.76 | 343,222,616 | 86.86 | 6.71 | 81.25 | 1.27 | ![]() ![]() histogram.txt |
pacbio | wrPisGeom1 | 31 | 38.1 | 178,149,545 | 37.27 | 1.00 | 94.62 | 0.29 | ![]() ![]() histogram.txt |
hic-arima2 | wrPisGeom1 | 31 | 269.4 | 155,415,387 | 33.30 | 1.67 | 96.07 | 0.63 | ![]() ![]() histogram.txt |
Below are stats for each PacBio seqeuncing run collected for this species.
pipeline | specimen | date | run id | movie | well | tag | yield | N50 | sample accession | run accession | barcode |
---|---|---|---|---|---|---|---|---|---|---|---|
PacBio - HiFi | wrPisGeom1 | 2020-09-12 | 77367 | m64125_200912_160102 | C01 | 1003 | 14,885,902,350 | 12,635 | SAMEA7521291 | ERR9744411 |
Below are stats for each ONT seqeuncing run collected for this species.
pipeline | specimen | date | run id | flowcell | type | yield | N50 | sample accession | report |
---|---|---|---|---|---|---|---|---|---|
No matching records found |
Below are stats for each Illumina run collected for this species. Click on a row to see associated plots from samtools stats.
pipeline | specimen | date | run id | read pairs | yield | sample accession | run accession | run status | barcode |
---|---|---|---|---|---|---|---|---|---|
Chromium genome | wrPisGeom1 | 2020-11-16 | 35281_7#6 | 78,807,900 | 11,899,992,900 | SAMEA7521291 | qc complete | Piscicola geometra (1.00) | |
Chromium genome | wrPisGeom1 | 2020-11-16 | 35281_7#7 | 72,118,850 | 10,889,946,350 | SAMEA7521291 | qc complete | Piscicola geometra (1.00) | |
Chromium genome | wrPisGeom1 | 2020-11-16 | 35281_7#5 | 91,248,214 | 13,778,480,314 | SAMEA7521291 | qc complete | Piscicola geometra (1.00) | |
Chromium genome | wrPisGeom1 | 2020-11-16 | 35281_7#8 | 72,051,746 | 10,879,813,646 | SAMEA7521291 | qc complete | Piscicola geometra (1.00) | |
Hi-C - Arima v2 | wrPisGeom1 | 2020-10-21 | 35216_5#1 | 841,530,484 | 127,071,103,084 | SAMEA7521291 | ERR9767808 | qc complete | Piscicola geometra (1.00) |
Below are results from a screen of the PacBio data using Mash screen against RefSeq assemblies. Only results with identity over 90% are displayed.
identity | info |
---|---|
0.95473 | [4 seqs] NZ_JH815580.1 Aeromonas veronii AER39 supercont1.1, whole genome shotgun sequence [...] |
0.932793 | NC_018417.1 Candidatus Carsonella ruddii HT isolate Thao2000, complete genome |
Species composition by small subunit (SSU) presence in the assembly with MarkerScan.
specimen | contig | SSU length | attributed taxonomy by SSU |
---|---|---|---|
wrPisGeom1 | atg003941l | 1815 |
|
wrPisGeom1 | ptg000030c | 1501 |
|
wrPisGeom1 | ptg000129l | 1527 |
|
wrPisGeom1 | ptg000236l | 2191 |
|
wrPisGeom1 | ptg000245l | 2199 |
|
wrPisGeom1 | ptg000291c | 1498 |
|
wrPisGeom1 | ptg000736l | 2190 |
|
wrPisGeom1 | ptg000845l | 2196 |
|
MarkerScan cobiont assembly by read separation based on observed families (see above). These reads are both aligned to the assembly and independently re-assembled. The quality of these assemblies is assessed by their completeness according to BUSCO, their span and the number of reads they encompass. For more information here.
specimen | family | classified reads | original assembly | re-assembly | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
count | (%) | BUSCO | BUSCO | contigs | contig length | number of reads | BUSCO | contigs | contig length | number of reads | ||
No matching records found |
Canonical tetranucleotide counts for each contig or scaffold reduced to two dimensions with UMAP to allow visualisation.
Features (colours represent quantile bins):
BTK datasets:
In-progress assembly QC.
specimen | asm | date | contig N50 | contigs | scaffold N50 | scaffolds | length | BUSCO | merqury |
---|---|---|---|---|---|---|---|---|---|
wrPisGeom1 | hicanu | 2020-10-15 | 466,990 | 3,664 | 432,837,072 | - | |||
wrPisGeom1 | hicanu.purging | 2020-10-15 | 865,462 | 1,043 | 202,244,151 | - | |||
wrPisGeom1 | hicanu.scaff.salsa | 2020-10-15 | 785,508 | 1,051 | 6,928,121 | 418 | 202,552,959 | - | |
wrPisGeom1 | hicanu.purging_25 | 2020-10-15 | 865,462 | 1,043 | 202,362,678 | - | |||
wrPisGeom1 | hicanu.repurging | 2020-10-15 | 1,053,160 | 438 | 180,290,300 | - | |||
wrPisGeom1 | hifiasm.purging_e | 2020-10-26 | 1,832,736 | 841 | 1,832,736 | 823 | 200,845,667 | - | |
wrPisGeom1 | hifiasm.purging | 2020-10-26 | 1,832,736 | 826 | 203,602,412 | C:82.8%[S:78.7%,D:4.1%],F:5.8%,M:11.4%,n:978 | - | ||
wrPisGeom1 | hifiasm.scaff.salsa | 2020-10-26 | 1,753,725 | 834 | 7,388,889 | 361 | 203,826,104 | - | |
wrPisGeom1 | hifiasm.purging_25 | 2020-10-26 | 1,801,613 | 832 | 204,571,360 | - | |||
wrPisGeom1 | hifiasm | 2020-10-26 | 1,352,144 | 1,338 | 236,657,719 | C:83.2%[S:75.3%,D:7.9%],F:5.3%,M:11.5%,n:978 | - | ||
wrPisGeom1 | hifiasm.repurging | 2020-10-26 | 1,832,736 | 825 | 202,811,967 | - | |||
wrPisGeom1 | hifiasm.purging | 2021-07-22 | 1,525,907 | 790 | 205,920,083 | C:77.4%[S:71.4%,D:6.0%],F:8.4%,M:14.2%,n:954 | Q37.2-C56.8(10X); Q51.9-C98.8(HiFi) | ||
wrPisGeom1 | hifiasm | 2021-07-22 | 1,139,858 | 1,326 | 243,946,648 | C:77.4%[S:65.0%,D:12.4%],F:8.5%,M:14.1%,n:954 | Q36.7-C57.4(10X); Q51.8-C99.1(HiFi) | ||
wrPisGeom1 | hifiasm.scaff_.salsa | 2022-04-12 | 1,123,688 | 500 | 7,589,004 | 216 | 211,769,108 | - | |
wrPisGeom1 | hifiasm.scaff_.yahs | 2022-04-12 | 1,161,498 | 506 | 8,272,861 | 334 | 211,661,508 | - | |
wrPisGeom1 | hifiasm.purging | 2022-04-12 | 1,219,847 | 489 | 211,547,640 | C:77.7%[S:71.4%,D:6.3%],F:8.3%,M:14.0%,n:954 | Q36.6-C61.9(10X); Q51.2-C98.6(HiFi) | ||
wrPisGeom1 | hifiasm | 2022-04-12 | 750,567 | 1,898 | 411,797,916 | C:79.0%[S:20.6%,D:58.4%],F:7.9%,M:13.1%,n:954 | Q35.9-C63.3(10X); Q51.2-C99.1(HiFi) | ||
wrPisGeom1 | hifiasm.purging_ | 2022-04-12 | 1,219,847 | 489 | 211,627,108 | - | |||
wrPisGeom1 | hifiasm.scaffolding.salsa | 2022-04-12 | 1,123,688 | 500 | 6,853,822 | 223 | 211,686,140 | C:78.0%[S:71.7%,D:6.3%],F:8.1%,M:13.9%,n:954 | Q36.6-C61.9(10X); Q51.2-C98.6(HiFi) |
wrPisGeom1 | hifiasm.scaffolding.yahs | 2022-04-12 | 1,161,498 | 504 | 8,272,861 | 331 | 211,582,240 | C:78.0%[S:71.7%,D:6.3%],F:8.1%,M:13.9%,n:954 | Q36.6-C61.9(10X); Q51.2-C98.6(HiFi) |
wrPisGeom1 | hicanu | 2021-07-22 | 478,806 | 3,574 | 431,037,438 | C:78.0%[S:10.8%,D:67.2%],F:8.1%,M:13.9%,n:954 | Q38.0-C57.8(10X); Q59.4-C99.4(HiFi) | ||
wrPisGeom1 | hicanu.purging | 2021-07-22 | 809,309 | 1,062 | 203,843,664 | C:77.3%[S:71.4%,D:5.9%],F:8.1%,M:14.6%,n:954 | Q37.9-C57.0(10X); Q59.0-C99.2(HiFi) | ||
wrPisGeom1 | hicanu | 2022-04-12 | 448,444 | 4,750 | 496,605,999 | C:78.9%[S:7.8%,D:71.1%],F:8.1%,M:13.0%,n:954 | Q34.8-C42.8(10X); Q56.9-C100.0(HiFi) | ||
wrPisGeom1 | hicanu.purging | 2022-04-12 | 999,568 | 794 | 216,142,654 | C:77.9%[S:70.9%,D:7.0%],F:8.3%,M:13.8%,n:954 | Q36.6-C62.3(10X); Q56.6-C99.0(HiFi) |
In-progress organelle results from MitoHiFi or Oatk.
specimen | asm | organelle | date | length | genes | frameshifts | is circular | reference |
---|---|---|---|---|---|---|---|---|
wrPisGeom1 | mitohifi.hifiasm | mito | 2022-04-12 | 25,026 | 37 | None | False | BK059172.1; 14,788 bp; 37 genes |
wrPisGeom1 | mitohifi.hicanu | mito | 2022-04-12 | 25,172 | 37 | None | True | BK059172.1; 14,788 bp; 37 genes |
wrPisGeom1 | mitohifi.reads | mito | 2022-04-12 | 23,938 | 37 | None | False | BK059172.1; 14,788 bp; 37 genes |