Comments (5)
Thanks for the test set, there is definitely something wrong.
non-productive chains should always end up in the extra_chains
column. My first guess would be that there's something wrong how "T"/"F"
are interpreted.
After all
> bool("F")
True
from scirpy.
hmm but the airr schema says that boolean columns should be encoded as str
T
/F
according to https://github.com/airr-community/airr-standards/blob/master/docs/datarep/rearrangements.rst
Boolean values
Boolean values must be encoded as T for true and F for false.
from scirpy.
I think I found it:
Lines 482 to 485 in e954d17
When read_airr
gets a .tsv
file, it uses airr.read_rearrangement
, which correctly converts the types. However, when it gets a data.frame
it doesn't do any type conversion or validation.
Probably it would be best to rely on airr
to validate data frames as well. Unfortunately, the RearrangementReader
isn't designed in a way to easily cope with anything but a tsv
file.
So either I'll need a hacky workaround or dump the dataframe to a temporary tsv
file which isn't particularly efficient.
from scirpy.
Well not that hacky after all:
Lines 78 to 102 in 9e23474
Can you try if this behaves as expected?
#349
from scirpy.
yes amazing! the fix works!
from scirpy.
Related Issues (20)
- tl.define_clonotypes within_group parameter returns ValueError HOT 1
- Integrate TCRdist3 HOT 5
- Retrieving specific portions of the Immune Receptor beyond the junction (or CDR3). HOT 2
- ir_dist alignment stuck HOT 4
- IEDB database cdr3_aa stored as junction_aa HOT 10
- Unclear default value for the Hamming Distance cut-off HOT 1
- Dandelion interoperability
- Where has UMI count for AIR chains gone? HOT 1
- Large dataset tutorial
- Make sure axes of nextwork plots don't have any ticks
- Add the Morisita-Horn index for repertoire overlap similarity scores HOT 1
- Sorting logic in `index_chains()` HOT 3
- Community tutorial page
- ir.tl.ir_query fails with error 'ValueError: max_workers must be greater than 0' HOT 1
- ir.tl.clonotype_modularity - ValueError: Length of values does not match length of index HOT 2
- "read_10x_vdj" not loading data properly HOT 2
- clone definition purely using CDR3 sequence HOT 1
- Optimize TCRdist metric HOT 1
- When running 'ir.tl.define_clonotypes' on MacOS14.4.1, I've got an Error:module 'os' has no attribute 'sched_getaffinity' HOT 2
- TypeError: join() got an unexpected keyword argument 'validate' HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scirpy.