NCBI taxon id: | 1918144 NCBI; ENA; GoaT |
---|---|
Order: | Diptera |
Family: | Tachinidae |
NCBI lineage: | Eukaryota;Metazoa;Ecdysozoa;Arthropoda;Hexapoda;Insecta;Pterygota;Neoptera;Endopterygota;Diptera;Brachycera;Muscomorpha;Oestroidea;Tachinidae;Phasiinae;Phasiini;Phasia; |
GoaT genome size (M): | 691 (ancestor) |
GoaT asm span (M): | 876 (direct) |
GoaT chr no.: | 12 (ancestor) |
GoaT haploid no.: | 6 (ancestor) |
GoaT ploidy: | 3 (ancestor) |
ToLID prefix: | idPhaObes |
Below is information about specimens collected for this species retrieved from the Sample Tracking System (STS).
tolid | specimen_id | gal | sex | organism_part | biosample | biospecimen |
---|---|---|---|---|---|---|
idPhaObes1 | Ox000770 | UNIVERSITY OF OXFORD | NOT_COLLECTED | WHOLE_ORGANISM | SAMEA7746583 | SAMEA7746483 |
idPhaObes2 | Ox001761 | UNIVERSITY OF OXFORD | MALE | WHOLE_ORGANISM | SAMEA10979409 | SAMEA10979021 |
idPhaObes3 | NHMUK013805878 | NATURAL HISTORY MUSEUM | NOT_PROVIDED | ABDOMEN | SAMEA111458359 | SAMEA111458040 |
idPhaObes3 | NHMUK013805878 | NATURAL HISTORY MUSEUM | NOT_PROVIDED | HEAD | SAMEA111458217 | SAMEA111458040 |
idPhaObes3 | NHMUK013805878 | NATURAL HISTORY MUSEUM | NOT_PROVIDED | THORAX | SAMEA111458126 | SAMEA111458040 |
Below are estimates of genome size, repeat size, heterozygosity based on k-mer specta analysis with GenomeScope2.
source | specimen | k-mer | k-cov | haploid size | repeat (%) | heterozygosity (%) | model fit (%) | model error (%) | histogram |
---|---|---|---|---|---|---|---|---|---|
10x | idPhaObes1 | 31 | 23.2 | 875,720,148 | 46.99 | 2.06 | 98.45 | 0.49 | ![]() ![]() histogram.txt |
pacbio | idPhaObes1 | 31 | 15.88 | 749,948,248 | 38.68 | 2.03 | 97.89 | 0.22 | ![]() ![]() histogram.txt |
hic-arima2 | idPhaObes2 | 31 | 50.63 | 901,498,804 | 163.64 | 44.27 | 97.20 | 0.35 | ![]() ![]() histogram.txt |
Below are stats for each PacBio seqeuncing run collected for this species.
pipeline | specimen | date | run id | movie | well | tag | yield | N50 | sample accession | run accession | barcode |
---|---|---|---|---|---|---|---|---|---|---|---|
PacBio - HiFi | idPhaObes1 | 2021-04-03 | 81044 | m64230e_210403_120226 | C01 | 1021 | 4,622,954,643 | 16,205 | SAMEA7746583 | ERR10355973 | |
PacBio - HiFi (ULI) | idPhaObes1 | 2022-09-13 | TRACTION-RUN-249 | m64174e_220913_041213 | D01 | 1022 | 20,992,122,809 | 8,785 | SAMEA7746583 | ERR10355972 |
Below are stats for each ONT seqeuncing run collected for this species.
pipeline | specimen | date | run id | flowcell | type | yield | N50 | sample accession | report |
---|---|---|---|---|---|---|---|---|---|
No matching records found |
Below are stats for each Illumina run collected for this species. Click on a row to see associated plots from samtools stats.
pipeline | specimen | date | run id | read pairs | yield | sample accession | run accession | run status | barcode |
---|---|---|---|---|---|---|---|---|---|
Chromium genome | idPhaObes1 | 2021-06-01 | 37642_3#35 | 108,919,592 | 16,446,858,392 | SAMEA7746583 | ERR10297873 | qc complete | Phasia obesa (1.00) |
Chromium genome | idPhaObes1 | 2021-06-01 | 37642_3#33 | 152,821,004 | 23,075,971,604 | SAMEA7746583 | ERR10297871 | qc complete | Phasia obesa (1.00) |
Chromium genome | idPhaObes1 | 2021-06-01 | 37642_3#34 | 94,993,372 | 14,343,999,172 | SAMEA7746583 | ERR10297872 | qc complete | Phasia obesa (1.00) |
Chromium genome | idPhaObes1 | 2021-06-01 | 37642_3#36 | 74,574,882 | 11,260,807,182 | SAMEA7746583 | ERR10297874 | qc complete | Phasia obesa (1.00) |
Hi-C - Arima v2 | idPhaObes2 | 2022-09-21 | 45846_1#4 | 840,144,286 | 126,861,787,186 | SAMEA10979409 | ERR10297875 | qc complete | Phasia obesa (1.00) |
RNA PolyA | idPhaObes3 | 2024-03-15 | 48593_1#5 | 63,798,136 | 9,633,518,536 | SAMEA111458126 | qc complete |
Below are results from a screen of the PacBio data using Mash screen against RefSeq assemblies. Only results with identity over 90% are displayed.
identity | info |
---|---|
0.989958 | [120 seqs] NZ_JYPC01000001.1 Wolbachia endosymbiont of Operophtera brumata strain Ob_Wba WbOb01_Sc001, whole genome shotgun sequence [...] |
0.986519 | NC_021089.1 Wolbachia endosymbiont of Drosophila simulans wHa, complete genome |
0.943219 | NC_018417.1 Candidatus Carsonella ruddii HT isolate Thao2000, complete genome |
0.913621 | [2 seqs] NC_008513.1 Buchnera aphidicola BCc, complete genome [...] |
0.91124 | [3 seqs] NZ_LN890285.1 Buchnera aphidicola (Tuberolachnus salignus) strain BTs genome assembly, chromosome: 1 [...] |
0.909369 | [2017 seqs] NW_004087753.1 Ichthyophthirius multifiliis unplaced genomic scaffold scaff_1120509249386, whole genome shotgun sequence [...] |
0.909049 | [28 seqs] NZ_LTBM01000001.1 Candidatus Phytoplasma oryzae isolate Mbita1 Pmin.contig.0_1, whole genome shotgun sequence [...] |
0.908727 | NZ_AP013293.1 Candidatus Sulcia muelleri PSPU DNA, complete genome |
0.903242 | [130 seqs] NZ_MUJL01000001.1 Wolbachia pipientis wUni gwu.contig.0_1, whole genome shotgun sequence [...] |
0.902876 | [833 seqs] NC_031481.1 Plasmodium gaboni strain SY75 chromosome 1, whole genome shotgun sequence [...] |
Species composition by small subunit (SSU) presence in the assembly with MarkerScan.
specimen | contig | SSU length | attributed taxonomy by SSU |
---|---|---|---|
idPhaObes1 | atg003530l | 1937 |
|
idPhaObes1 | atg004940l | 2000 |
|
idPhaObes1 | atg005631l | 1976 |
|
idPhaObes1 | ptg000129l | 1499 |
|
idPhaObes1 | ptg000257l | 1982 |
|
idPhaObes1 | ptg001174l | 1500 |
|
idPhaObes1 | ptg001709l | 792 |
|
idPhaObes1 | ptg002115l | 963 |
|
idPhaObes1 | ptg004726l | 1987 |
|
MarkerScan cobiont assembly by read separation based on observed families (see above). These reads are both aligned to the assembly and independently re-assembled. The quality of these assemblies is assessed by their completeness according to BUSCO, their span and the number of reads they encompass. For more information here.
specimen | family | classified reads | original assembly | re-assembly | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
count | (%) | BUSCO | BUSCO | contigs | contig length | number of reads | BUSCO | contigs | contig length | number of reads | ||
idPhaObes1 | Anaplasmataceae | 10,633 | 0.4 | C:100.0%[S:0.0%,D:100.0%],F:0.0%,M:0.0%,n:364 | C:99.7%[S:1.1%,D:98.6%],F:0.0%,M:0.3%,n:364 | 4 | 2.59Mb | 10,030 | C:99.7%[S:0.5%,D:99.2%],F:0.0%,M:0.3%,n:364 | 2 | 2.61Mb |
Canonical tetranucleotide counts for each contig or scaffold reduced to two dimensions with UMAP to allow visualisation.
Features (colours represent quantile bins):
BTK datasets:
In-progress assembly QC.
specimen | asm | date | contig N50 | contigs | scaffold N50 | scaffolds | length | BUSCO | merqury |
---|---|---|---|---|---|---|---|---|---|
idPhaObes1 | hifiasm | 2022-09-21 | 424,248 | 5,012 | 1,037,074,116 | C:98.9%[S:84.0%,D:14.9%],F:0.1%,M:1.0%,n:1367 | Q46.0-C95.5(10X); Q55.4-C98.9(HiFi) | ||
idPhaObes1 | hifiasm.purging | 2022-09-21 | 496,949 | 3,288 | 884,314,648 | C:98.8%[S:97.1%,D:1.7%],F:0.1%,M:1.1%,n:1367 | Q46.3-C94.8(10X); Q55.4-C98.3(HiFi) | ||
idPhaObes1 | hifiasm.scaffolding.salsa | 2022-09-21 | 491,703 | 3,303 | 10,025,522 | 1,246 | 885,343,148 | C:99.0%[S:97.5%,D:1.5%],F:0.1%,M:0.9%,n:1367 | Q46.3-C94.8(10X); Q55.4-C98.3(HiFi) |
idPhaObes1 | hifiasm.scaffolding.yahs | 2022-09-21 | 472,847 | 3,401 | 146,845,135 | 794 | 884,836,048 | C:99.0%[S:97.5%,D:1.5%],F:0.1%,M:0.9%,n:1367 | Q46.3-C94.8(10X); Q55.4-C98.3(HiFi) |
idPhaObes1 | hicanu.purging | 2022-09-21 | 277,195 | 5,923 | 901,325,703 | C:98.6%[S:95.2%,D:3.4%],F:0.5%,M:0.9%,n:1367 | Q47.0-C95.4(10X); Q64.2-C98.9(HiFi) | ||
idPhaObes1 | hicanu | 2022-09-21 | 212,287 | 15,744 | 1,773,699,292 | C:99.3%[S:6.9%,D:92.4%],F:0.3%,M:0.4%,n:1367 | Q45.8-C99.8(10X); Q64.8-C100.0(HiFi) |
In-progress organelle results from MitoHiFi or Oatk.
specimen | asm | organelle | date | length | genes | frameshifts | is circular | seqs | reference |
---|---|---|---|---|---|---|---|---|---|
idPhaObes1 | mitohifi.hifiasm | mito | 2022-09-21 | 17,459 | 37 | None | True | 1 | MK644821.1; 15,792 bp; 37 genes |
idPhaObes1 | mitohifi.hicanu | mito | 2022-09-21 | 13,531 | 34 | COX3 | False | 1 | MK644821.1; 15,792 bp; 37 genes |
idPhaObes1 | mitohifi.reads | mito | 2022-09-21 | 17,459 | 37 | None | False | 1 | MK644821.1; 15,792 bp; 37 genes |