Questions tagged [assembly]
Process of creating the original sequence from the read sequences that it generated during a sequencing experiment. Can refer to genome assembly, in which case the original sequence is a genome, or transcripts assembly, in which case the original sequences are RNA transcripts.
152
questions
0
votes
1
answer
130
views
BUSCO results apparently inconsistent
I have two assemblies obtained from two different softwares for the same run data. In one of them, I get a BUSCO score of 69,4%. In the other one, I get 68,5%. The first assembly covers a 99,7% ...
0
votes
1
answer
591
views
How to de novo hybrid assemble with Pacbio CCS and Illumina PE reads
I would like to perform de novo genome assembly on a diploid microalgal strain.
I have two datasets:
PacBio CCS/HiFi reads, low coverage.
Illumina PE 2x150 (standard shotgun)
Does anybody have any ...
1
vote
1
answer
70
views
Assembling all transcripts for an individual gene? (using single sequence to seed the assembly)
Let's say I have a candidate gene and I believe that in an individual sample, the genome sequence differs from the reference which then interferes with alignment.
Is there a way for me to do a "...
0
votes
2
answers
202
views
How do you set the coverage in PacBio's Sequel II?
I am reading the Whole Genome Sequencing for de novo Assembly Best Practices
Use the Sequel II or IIe System and SMRT® Cell 8M to sequence to desired coverage depth for complexity
of genome
10- to 15-...
5
votes
1
answer
89
views
Can someone help me estimating the runtime of the pipeline applied by the vertebrate genome project?
The vertebrate genome project (VGP)
has a lot of interesting publications such as this one.
The rough pipeline is outlined below:
Here the pipeline in more detail:
While the paper describes all the ...
0
votes
2
answers
332
views
How to know if the DNA sequence has been assembled and why is it important to know how it was assembled?
I have downloaded my FASTA format files, that have the DNA sequences of the coding region of the genes and the DNA sequence of the complete genome, from NCBI. How can I recognize if these sequences ...
1
vote
1
answer
98
views
How to quantifiy of specific genes from shotgun metagenome?
I have googled a "lot", couldn't find any specific answer to the question. So, I am here seeking for your guidance. My question is similar to this. I have several metagenome (n=30). But for ...
1
vote
1
answer
515
views
Trinity assembly from many samples
When you combine samples for de-novo transcriptome assembly with Trinity, do you suggest limiting the number of reads for each sample? I had read one of Matthew MacManes' papers awhile back suggesting ...
0
votes
2
answers
156
views
Genome annotation
I have helped a lab sequence a mitochondrial genome. I used then MITOS to annotate the genome (warning them that I did it just for curiosity as I am not experienced). They submitted the sequence and ...
1
vote
1
answer
565
views
where to find sample contig data
Where or How can I find contigs? I am trying to learn Bioinformatics and I want to make a reference-based gene search. I also want to align contigs with a given reference sequence. From NCBI other ...
-1
votes
2
answers
146
views
Denovo, Stacks: Getting an “ambiguous redirect” error
I tried running the following command
...
0
votes
3
answers
77
views
Genome QC + Assembly Pipeline semantics
I’m trying to create a pipeline for genome assembly. How best can I “redirect/pipe” from existing fasta files (or files in general) to other steps of the pipeline?
I was thinking of going from the SRA ...
1
vote
1
answer
118
views
Quast duplication ratio and mismatches percent
When analyzing Quast results it seems that it doesn't calculate mismatches and indels in a useful way if the "Duplication ratio" is over 1.
For example, that's what I get for an assembly ...
3
votes
2
answers
89
views
Gold standard benchmark
This page is claimed to contain a gold standard benchmark for viral genome assembly.
https://github.com/cbg-ethz/5-virus-mix
The claim is here:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411778/
...
0
votes
0
answers
248
views
How to filter a genome assembly consistsing of a large number of contigs?
I did some de novo genome assemblies with Illumina PE data using SPAdes, whereas most of them consisting of a large number of contigs(>1000). I have several questions below.
Do we need to filter ...