Questions tagged [assembly]
Process of creating the original sequence from the read sequences that it generated during a sequencing experiment. Can refer to genome assembly, in which case the original sequence is a genome, or transcripts assembly, in which case the original sequences are RNA transcripts.
152
questions
1
vote
1
answer
88
views
Which one is a more convenient assembly?
I have developed a software for de novo genome assembly. Its performance varies gradually according to how much data you employ. At initial stages it often produces contigs that look like that when ...
1
vote
0
answers
18
views
Repeat analysis on eukaryotic assemblies
I have a hundred insect genomes and I'm looking for repeated regions along these assemblies.
First of all, I thought of using <...
1
vote
2
answers
794
views
Is it possible to filter contaminated reads for raw PacBio sequences (not HiFi reads) before assembly?
De novo genome assembly for non-model organisms face the issue of bacterial contamination. For assembled contigs with mostly bacterial-like sequences (based on BLAST search), the entire contig can be ...
1
vote
1
answer
79
views
Is there a simple command for outputting a tab delimited columns?
I am working on a fasta file and am writing my command in nano within command-line and executing using python, also within a command line.
My objective is to get my command to provide me with a tab ...
0
votes
2
answers
2k
views
creating a tab delimited file
I am working on a project using a fasta file. I am writing my command in nano within command-line and executing using python, also within my command-line.
I would like my command to provide me with a ...
0
votes
1
answer
271
views
Is it possible to convert BAM file from one genome assembly to the other?
I Have multiple BAM files that are referenced to UCSC genome assembly GRCh37/hg19 that are read in different time frames. Now, I am planning a different studies that require assembling all the data ...
3
votes
2
answers
278
views
Contamination on genome assembly
I had a question for the community.
I have a genome of a new species that has been sequenced via 150pb Illumina paired-end.
To verify the quality of the assembly I used the ...
3
votes
1
answer
205
views
How to find all WGS assemblies accessions of a species
Some background
Similar to the OP of https://www.biostars.org/p/377840/, I would like to programmatically BLAST a sequence to a local database of all WGS assemblies.
Since this isn't feasible for the ...
1
vote
1
answer
188
views
Interpreting contig alignments to a reference genome
I have applied two de novo genome assembly tools to data from the run SRR12707453, corresponding to a phage (I downloaded the data and have no relation with the authors of the study).
Using rnaSPAdes ...
1
vote
1
answer
998
views
Difference between paired-end, mate-pair and long read
I writing here because I have some questions for you.
I wondered what the essential differences were between paired-end, ...
2
votes
1
answer
148
views
PacBio long-reads impact in transcriptome de novo assembly?
We are strongly interested in assembly a good transcriptome of reference for a non-model organism and build a local database. We have sequenced the same individual with Illumina (150 millions of pair-...
2
votes
2
answers
335
views
Genome assembly of SRR12196449 with SPAdes
I am trying to assemble the run SRR12196449 with SPAdes. The description of their project is:
This project expected to standardize a method for amplification and
...
1
vote
2
answers
212
views
How can I get or create a reference genome for Bacteria?
I am a computer engineer and nowadays trying to grasp some concepts of Bioinformatics particularly, reference genomes and genomic variants. My aim is to find the effect of sequence features on variant ...
1
vote
1
answer
139
views
RCSB API: limited returned result
Is there a limit of query results on RCSB API?
Interestingly, no matter how to adjust my query criteria, only 10 assemblies are returned. Here is an example query json: (truncated from a longer query ...
0
votes
1
answer
137
views
Resources to learn genome assembly workflow for small genomes (like viruses)
I have sequencing data of a few samples of a DNA virus.
I'd like to learn de novo assembly of 'short read' data, to construct a scaffold and then count the abundance of each strain in the data.
I ...