Comments (3)
Hi,
Luckily, I was able to reproduce the error quickly because I actually already had this dataset on hand. Thanks for letting us know. We will try to have a fix soon ASAP.
I fixed one bug related to this (a simple parsing error) but there still seems to be a bug related to the alignment ratio. The reads are indeed much better (longer and higher quality) than before. 100% of the reads map and the average reads lengths are much longer. This is causing problems in the parsing of the MAP_align_ratio
file created.
@cheny19 I'm now convinced that our pre-set bins sizes for read lengths ranges are not a good idea. All the reads are in the align_ratio
file are migrating to the last bin (because the reads are so long), leaving every other bin empty, cauinge errors when loading this file. I can change the bins to scale to the data but I don't know what effect this will have on the modeling.
from nanosim.
Hi Mike,
Sorry I didn't get back to you earlier as I was on vacation.
The binning problem is fixed now. I'm using equal number of reads in each bin now, and user can specify the number of bins they need. The default number of bins is 20.
I still need to test and find out the best default value that works for most of the datasets, but right now, this version is ready to use. I tested it on your dataset and it works pretty well.
Let me know if you have further concerns.
from nanosim.
That works fine now thanks!
from nanosim.
Related Issues (20)
- Nanosim hangs in the middle HOT 18
- Infinite loop in function extract_reads in metagenome mode when length equals max length HOT 2
- Transcriptome mode error rate tsv explanation HOT 2
- Models for R10.3 or R10.4 flow cell
- Option to specify desired read coverage or sequencing depth HOT 2
- ValueError: Found array with 0 sample(s) (shape=(0, 1)) while a minimum of 1 is required. HOT 6
- Please specify the training reads and its reference genome! HOT 3
- Stuck at simulation stage HOT 4
- simulator.py genome FileNotFoundError: [Errno 2] No such file or directory: 'training_model_profile' HOT 1
- NanoSim for tuning Minimap2 parameters? HOT 2
- Models for newer versions of Guppy with sup basecalls HOT 3
- Options / suggestions for how to simulate nCats data? HOT 1
- Support for Dorado? HOT 2
- IndexError: list index out of range HOT 1
- Installation error HOT 4
- How to find reference genome for pre-trained models HOT 2
- Coverage breadth following metagenome characterization HOT 2
- Can't install Nanosim HOT 2
- Questions about the usage and processing of the expression profile HOT 1
- Failure in using Nanosim for transcriptome (ValueError: file does not contain alignment data)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nanosim.