Questions tagged [assembly]

Ask Question

Process of creating the original sequence from the read sequences that it generated during a sequencing experiment. Can refer to genome assembly, in which case the original sequence is a genome, or transcripts assembly, in which case the original sequences are RNA transcripts.

152 questions

35 votes

2 answers

3k views

Why do some assemblers require an odd-length kmer for the construction of de Bruijn graphs?

Why do some assemblers like SOAPdenovo2 or Velvet require an odd-length k-mer size for the construction of de Bruijn graph, while some other assemblers like ABySS are fine with even-length k-mers?

Kamil S Jaron

5,632

asked May 19, 2017 at 18:34

19 votes

3 answers

699 views

How to deal with heterozygosity during polishing of genome assembly based on long reads?

All the long-read sequencing platforms are based on single-molecule sequencing which causes higher per-base error rates. For this reason a polishing step was added to genome assembly pipelines - ...

Kamil S Jaron

5,632

asked May 21, 2017 at 11:49

18 votes

1 answer

652 views

How can I improve a long-read assembly with a repetitive genome?

I'm currently trying to assembly a genome from a rodent parasite, Nippostrongylus brasiliensis. This genome does have an existing reference genome, but it is highly fragmented. Here are some ...

gringer♦

14.7k

asked May 25, 2017 at 11:46

14 votes

3 answers

530 views

How to make a distinction between the "classical" de Bruijn graph and the one described in NGS papers?

In Computer Science a De Bruijn graph has (1) m^n vertices representing all possible sequences of length n over ...

Leo Martins

asked May 19, 2017 at 10:32

13 votes

5 answers

358 views

Improve a reference genome with sequencing data

I have a DNA sample which I know doesn't quite match my reference genome - my culture comes from a subpopulation which has undergone significant mutation since the reference was created. The example I ...

Scott Gigante

2,173

asked May 18, 2017 at 3:11

10 votes

2 answers

281 views

Extract sequence context of high-degree nodes in assembly graphs

I often use metaSPAdes to assemble short reads from human microbiomes. My simplified understanding of short-read de Bruijn graph assemblers is that they fail where ambiguous paths cannot be resolved. ...

acvill

asked Feb 2, 2022 at 17:16

10 votes

3 answers

723 views

Pooling data in metagenome assembly

I have 12 human gut microbiome WGS Nextseq reads (151 bp paired end). What will be an effective strategy to assemble a metagenome? Let us say I have already filtered the fastq for quality, adapter ...

deepseas

asked Jun 29, 2017 at 16:24

9 votes

2 answers

2k views

Is there a standard definition for "assembly polishing"?

Is there a standard definition for "assembly polishing" in the field? Is there a standard definition for what polishing algorithms do? My understanding of "polishing" is strongly influenced by ...

EB2127

1,423

asked May 15, 2018 at 19:19

8 votes

2 answers

2k views

estimate genome size: kmer-based approach from PacBio reads

Can anyone suggest a software/method for kmer analysis using PacBio reads (RSII)? Something similar to Jellyfish, that I saw in a nice tutorial - but must be suitable for long, noisy reads. ...

aechchiki

2,706

asked Jul 23, 2018 at 7:19

7 votes

1 answer

295 views

How to calculate overall reference coverage with MUMmer?

Is the MUMmer suite capable of calculating reference sequence coverage statistics for all query sequences collectively? It would be possible to achieve by parsing the output of ...

bedeabc

asked Aug 3, 2017 at 9:29

7 votes

1 answer

380 views

wtdbg2: practical implications of k-mer fsize and psize choice

I am using wtdbg2 2.3 to assemble a human genome (sequenced on PromethION from a cell line). I filtered out reads with low average quality, and now I am trying to determine the parameters that will ...

Daniel Standage

5,090

asked Feb 14, 2019 at 21:07

6 votes

2 answers

54 views

How to display novel genome assemblies or uncommon genome assemblies using the UCSC Genome Browser?

I want to display E.coli BW25113 (GenBank: CP009273.1) strain in UCSC browser. This strain is not listed in http://microbes.ucsc.edu/ browser. How can I display E.coli BW25113 assembly in the browser?

Supertech

asked Feb 4, 2022 at 0:07

6 votes

1 answer

165 views

Verify a predicted protein in one genome in a different genome of the same species

I have two genome assemblies of the same non-model species, call them Assembly 1 (generated from Illumina data) and Assembly 2 (generated from PacBio data). For Assembly 1, I also have predicted ...

aechchiki

2,706

asked Nov 13, 2017 at 12:10

6 votes

2 answers

193 views

Genome assembly from error-prone reads

I understand how to assemble genome from error-free reads. I implemented like this: Construct directed overlap graph with reads as vertices and edges as maximum overlap between two vertices. ...

gagro

asked Oct 22, 2017 at 12:20

6 votes

1 answer

132 views

Is there a way to assemble contigs starting from a specific sequence?

My work involves searching for marker genes/fragments in metagenomic databases (like the Sequence Read Archive). Once I find these sequences, I would like to know more about the neighboring genomic ...

Laura

1,007

asked Apr 9, 2019 at 11:50

15 30 50 per page

2 3 4 5

…

11 Next

Stack Exchange Network

Questions tagged [assembly]

Why do some assemblers require an odd-length kmer for the construction of de Bruijn graphs?

How to deal with heterozygosity during polishing of genome assembly based on long reads?

How can I improve a long-read assembly with a repetitive genome?

How to make a distinction between the "classical" de Bruijn graph and the one described in NGS papers?

Improve a reference genome with sequencing data

Extract sequence context of high-degree nodes in assembly graphs

Pooling data in metagenome assembly

Is there a standard definition for "assembly polishing"?

estimate genome size: kmer-based approach from PacBio reads

How to calculate overall reference coverage with MUMmer?

wtdbg2: practical implications of k-mer fsize and psize choice

How to display novel genome assemblies or uncommon genome assemblies using the UCSC Genome Browser?

Verify a predicted protein in one genome in a different genome of the same species

Genome assembly from error-prone reads

Is there a way to assemble contigs starting from a specific sequence?

Hot Network Questions

Questions tagged [assembly]

Related Tags