Comments (3)
Sounds good. We are setting up nightly builds of pyarrow (using the conda-forge toolchain for builds) so that if you require a new function in between conda-forge releases (which are loosely tied to Apache releases) you aren't affected by that. I'll first try to reproduce the existing test case but using dynamically generated IPC messages. We should be able to remove the Flatbuffers files from libgdf also
cc @cpcloud
from cudf.
I'm adding some notes about the hardcoded binary data.
The existing hardcoded data are generated from mapd via pymapd's select_ipc_gpu
. The related source line in mapd are:
- schema writing: https://github.com/mapd/mapd-core/blob/fae2f9079fb3fa1f60200212443adaac752e3030/QueryEngine/ResultSetConversion.cpp#L418
- recordbatch writing: https://github.com/mapd/mapd-core/blob/fae2f9079fb3fa1f60200212443adaac752e3030/QueryEngine/ResultSetConversion.cpp#L458
Moving all the gpu serialization logic into arrow will allow us to simplify the tests here in pygdf, and to make it easier for all GOAI projects to implement the same serialization format.
from cudf.
Fixed by #200
from cudf.
Related Issues (20)
- [BUG]when setting dask.config.set({"dataframe.backend": "cudf"}), ddf.explode("col1") and apply customized function cannot work correctly anymore? HOT 5
- [FEA] Add support for `cudf.unique` HOT 1
- [QST] Should cudf.Series.unique return a ndarray or cudf.Series? HOT 2
- [FEA] Align cudf Python's `ColumnBase.__init__` subclass signatures with their base class and add construction validation
- [FEA] Support three-valued (Kleene) logic in any and all aggregations
- [FEA] Support order-preserving groupby option in cudf-polars
- [FEA] Support scan-based aggregations in cudf-polars
- [FEA] Support cross-casting to/from strings in cudf-polars
- [FEA] Full coverage of stringfunction methods in cudf polars
- [FEA] Full coverage of datetime methods in cudf-polars
- [FEA] Run polars test suite in cudf-polars CI, and keep track of pass rates
- [FEA] Accelerate cupy array creation from DataFrame.values HOT 4
- [FEA] Support batch construction of strings columns HOT 1
- [FEA] Have `cudf::make_empty_column(cudf::type_id::STRING)` return a column with a child column of empty offsets HOT 3
- [PERF] looping through dataframe is 100x slower than when running without cudf HOT 3
- [BUG] cuDF and Pandas return different results for ... HOT 4
- [FEA] HOT 1
- [FEA] Adjust libcudf to not load cuFile by default
- [FEA] Add support for `cudf.DataFrame.aggregate`
- [BUG] Series.value_counts hangs with over 1B rows of input HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cudf.