Comments (3)
also (def fkey (comp key first))
as a work-around
from tech.ml.dataset.
My workaround for the moment:
> (->> (ds/group-by-column (ds/->dataset [{:a 1 :b 1} {:a 2 :b 2}]) :a)
(into {})
ffirst)
1
from tech.ml.dataset.
I get it - the java.util.LinkedHashMap$Entry
isn't a clojure.lang.AMapEntry
.
Your workaround is a lot better because it uses less memory.
This is fine for now, will keep any eye out for other cases where it might be helpful to treat the map entries as two element vectors. If those arise can re-think if this is a good idea and how to implement.
from tech.ml.dataset.
Related Issues (20)
- Do `partition` and `partition-by` make any sense here in TMD? HOT 1
- left-join on longer datasets causes an error HOT 1
- CVE-2021-40531 on org.apache.datasketches/datasketches-java HOT 1
- left-join fails when options argument is nil HOT 2
- Documentation and the actual behavior of `select` do not match. HOT 2
- `ds/rows` produces something vector-of-maps-like that transit cannot handle HOT 7
- Arrow, writing nested types. HOT 5
- tribuo changes types between input dataset and prediction HOT 5
- tensor->dataset not working for 2-d arrays HOT 2
- dataset->categorical-maps does not work as documented
- make `invert-categorical-map` more strict on unknown reverse mapping values HOT 4
- add additional arrity for probability-distributions->label-column to specify result-data type
- strange cat map produced with multiple columns HOT 1
- not all comment lines are recognized as comments HOT 4
- Missing column when reading a parquet file HOT 5
- 'exact' type rolling window
- upgrade to org.tribuo 4.3.1
- support jsonl format for read HOT 1
- fastexcel - update documentation to new version of fastexcel-reader
- add conversion from tensor to/from org.tribuo.math.la.Matrix HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tech.ml.dataset.