NCBI taxon id: | 41880 NCBI; ENA; GoaT |
---|---|
Order: | Pseudoscourfieldiales |
Family: | Pycnococcaceae |
NCBI lineage: | Eukaryota;Viridiplantae;Chlorophyta;Pycnococcaceae;Pycnococcus; |
GoaT genome size (M): | 73 (ancestor) |
GoaT asm span (M): | 32 (direct) |
GoaT chr no.: | 20 (ancestor) |
GoaT haploid no.: | 7 (ancestor) |
GoaT ploidy: | 38 (ancestor) |
ToLID prefix: | ucPycProv |
Below is information about specimens collected for this species retrieved from the Sample Tracking System (STS).
tolid | specimen_id | gal | sex | organism_part | biosample | biospecimen |
---|---|---|---|---|---|---|
ucPycProv1 | SAN0000882 | SANGER INSTITUTE | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7524550 | SAMEA7524508 |
ucPycProv1 | SAN0000882 | SANGER INSTITUTE | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7524537 | SAMEA7524508 |
ucPycProv1 | SAN0000882 | SANGER INSTITUTE | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7524539 | SAMEA7524508 |
ucPycProv1 | SAN0000882 | SANGER INSTITUTE | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7524549 | SAMEA7524508 |
ucPycProv1 | SAN0000882 | SANGER INSTITUTE | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7524545 | SAMEA7524508 |
Below are estimates of genome size, repeat size, heterozygosity based on k-mer specta analysis with GenomeScope2.
source | specimen | k-mer | k-cov | haploid size | repeat (%) | heterozygosity (%) | model fit (%) | model error (%) | histogram |
---|---|---|---|---|---|---|---|---|---|
pacbio | ucPycProv1 | 31 | 218.9 | 41,966,312 | 5.14 | 2.13 | 65.69 | 0.26 | ![]() ![]() histogram.txt |
hic-arima2 | ucPycProv1 | 31 | 19.08 | 1,887,775,688 | 127.59 | 6.17 | 81.46 | 0.32 | ![]() ![]() histogram.txt |
Below are stats for each PacBio seqeuncing run collected for this species.
pipeline | specimen | date | run id | movie | well | tag | yield | N50 | sample accession | run accession | barcode |
---|---|---|---|---|---|---|---|---|---|---|---|
PacBio - HiFi | ucPycProv1 | 2020-10-22 | 77883 | m64016_201022_152708 | A01 | 1011 | 741,730,094 | 12,748 | SAMEA7524508 | ERR8705848 | |
PacBio - HiFi | ucPycProv1 | 2020-11-28 | 78610 | m64094_201128_210619 | C01 | 1011 | 19,217,028,692 | 13,209 | SAMEA7524508 | ERR8705849 |
Below are stats for each ONT seqeuncing run collected for this species.
pipeline | specimen | date | run id | flowcell | type | yield | N50 | sample accession | report |
---|---|---|---|---|---|---|---|---|---|
No matching records found |
Below are stats for each Illumina run collected for this species. Click on a row to see associated plots from samtools stats.
pipeline | specimen | date | run id | read pairs | yield | sample accession | run accession | run status | barcode |
---|---|---|---|---|---|---|---|---|---|
Hi-C - Arima v2 | ucPycProv1 | 2021-07-30 | 39559_2#2 | 655,934,820 | 99,046,157,820 | SAMEA7524508 | ERR8571632 | qc complete | Pycnococcus provasolii (1.00) |
Below are results from a screen of the PacBio data using Mash screen against RefSeq assemblies. Only results with identity over 90% are displayed.
identity | info |
---|---|
0.977643 | NZ_CP003984.1 Planktomarina temperata RCA23, complete genome |
0.969782 | [33 seqs] NZ_FOSM01000032.1 Celeribacter marinus strain DSM 100036, whole genome shotgun sequence [...] |
0.95314 | [26 seqs] NZ_GG657907.1 Methylophaga thiooxydans DMS010 genomic scaffold scf_1104571000292, whole genome shotgun sequence [...] |
0.925333 | [44 seqs] NZ_LZFQ01000001.1 Donghicola sp. JL3646 contig1, whole genome shotgun sequence [...] |
0.921831 | NC_015554.1 Alteromonas sp. SN2, complete genome |
0.912148 | [6 seqs] NZ_KI421507.1 Phaeobacter arcticus DSM 23566 genomic scaffold Phaar_neu_scaffold_2516677464.1, whole genome shotgun sequence [...] |
Species composition by small subunit (SSU) presence in the assembly with MarkerScan.
specimen | contig | SSU length | attributed taxonomy by SSU |
---|---|---|---|
No matching records found |
MarkerScan cobiont assembly by read separation based on observed families (see above). These reads are both aligned to the assembly and independently re-assembled. The quality of these assemblies is assessed by their completeness according to BUSCO, their span and the number of reads they encompass. For more information here.
specimen | family | classified reads | original assembly | re-assembly | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
count | (%) | BUSCO | BUSCO | contigs | contig length | number of reads | BUSCO | contigs | contig length | number of reads | ||
No matching records found |
Canonical tetranucleotide counts for each contig or scaffold reduced to two dimensions with UMAP to allow visualisation.
Features (colours represent quantile bins):
BTK datasets:
In-progress assembly QC.
specimen | asm | date | contig N50 | contigs | scaffold N50 | scaffolds | length | BUSCO | merqury |
---|---|---|---|---|---|---|---|---|---|
ucPycProv1 | hifiasm | 2021-05-03 | 810,764 | 731 | 92,702,523 | C:77.3%[S:69.8%,D:7.5%],F:9.8%,M:12.9%,n:255 | Q51.7-C98.8(HiFi) | ||
ucPycProv1 | hicanu | 2021-05-03 | 691,284 | 1,625 | 142,426,038 | C:78.1%[S:37.3%,D:40.8%],F:9.8%,M:12.1%,n:255 | Q65.2-C98.9(HiFi) |
In-progress organelle results from MitoHiFi or Oatk.
specimen | asm | organelle | date | length | genes | frameshifts | is circular | reference |
---|---|---|---|---|---|---|---|---|
No matching records found |