Skip to main content

Questions tagged [assembly]

Process of creating the original sequence from the read sequences that it generated during a sequencing experiment. Can refer to genome assembly, in which case the original sequence is a genome, or transcripts assembly, in which case the original sequences are RNA transcripts.

0 votes
1 answer
130 views

BUSCO results apparently inconsistent

I have two assemblies obtained from two different softwares for the same run data. In one of them, I get a BUSCO score of 69,4%. In the other one, I get 68,5%. The first assembly covers a 99,7% ...
juanjo75es's user avatar
0 votes
1 answer
591 views

How to de novo hybrid assemble with Pacbio CCS and Illumina PE reads

I would like to perform de novo genome assembly on a diploid microalgal strain. I have two datasets: PacBio CCS/HiFi reads, low coverage. Illumina PE 2x150 (standard shotgun) Does anybody have any ...
bishopia's user avatar
1 vote
1 answer
70 views

Assembling all transcripts for an individual gene? (using single sequence to seed the assembly)

Let's say I have a candidate gene and I believe that in an individual sample, the genome sequence differs from the reference which then interferes with alignment. Is there a way for me to do a "...
story's user avatar
  • 1,603
0 votes
2 answers
202 views

How do you set the coverage in PacBio's Sequel II?

I am reading the Whole Genome Sequencing for de novo Assembly Best Practices Use the Sequel II or IIe System and SMRT® Cell 8M to sequence to desired coverage depth for complexity of genome 10- to 15-...
ilam engl's user avatar
  • 280
5 votes
1 answer
89 views

Can someone help me estimating the runtime of the pipeline applied by the vertebrate genome project?

The vertebrate genome project (VGP) has a lot of interesting publications such as this one. The rough pipeline is outlined below: Here the pipeline in more detail: While the paper describes all the ...
ilam engl's user avatar
  • 280
0 votes
2 answers
332 views

How to know if the DNA sequence has been assembled and why is it important to know how it was assembled?

I have downloaded my FASTA format files, that have the DNA sequences of the coding region of the genes and the DNA sequence of the complete genome, from NCBI. How can I recognize if these sequences ...
HelpNeederStudent's user avatar
1 vote
1 answer
98 views

How to quantifiy of specific genes from shotgun metagenome?

I have googled a "lot", couldn't find any specific answer to the question. So, I am here seeking for your guidance. My question is similar to this. I have several metagenome (n=30). But for ...
Nok Imchen's user avatar
1 vote
1 answer
515 views

Trinity assembly from many samples

When you combine samples for de-novo transcriptome assembly with Trinity, do you suggest limiting the number of reads for each sample? I had read one of Matthew MacManes' papers awhile back suggesting ...
CephBirk's user avatar
  • 151
0 votes
2 answers
156 views

Genome annotation

I have helped a lab sequence a mitochondrial genome. I used then MITOS to annotate the genome (warning them that I did it just for curiosity as I am not experienced). They submitted the sequence and ...
juanjo75es's user avatar
1 vote
1 answer
565 views

where to find sample contig data

Where or How can I find contigs? I am trying to learn Bioinformatics and I want to make a reference-based gene search. I also want to align contigs with a given reference sequence. From NCBI other ...
abelfit's user avatar
  • 73
-1 votes
2 answers
146 views

Denovo, Stacks: Getting an “ambiguous redirect” error

I tried running the following command ...
BioBash's user avatar
  • 19
0 votes
3 answers
77 views

Genome QC + Assembly Pipeline semantics

I’m trying to create a pipeline for genome assembly. How best can I “redirect/pipe” from existing fasta files (or files in general) to other steps of the pipeline? I was thinking of going from the SRA ...
user10415's user avatar
1 vote
1 answer
118 views

Quast duplication ratio and mismatches percent

When analyzing Quast results it seems that it doesn't calculate mismatches and indels in a useful way if the "Duplication ratio" is over 1. For example, that's what I get for an assembly ...
juanjo75es's user avatar
3 votes
2 answers
89 views

Gold standard benchmark

This page is claimed to contain a gold standard benchmark for viral genome assembly. https://github.com/cbg-ethz/5-virus-mix The claim is here: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411778/ ...
juanjo75es's user avatar
0 votes
0 answers
248 views

How to filter a genome assembly consistsing of a large number of contigs?

I did some de novo genome assemblies with Illumina PE data using SPAdes, whereas most of them consisting of a large number of contigs(>1000). I have several questions below. Do we need to filter ...
YP CHEN's user avatar
  • 81

15 30 50 per page
1 2 3
4
5
11