Comments (4)
You're very welcomed Artyom.
There are limits to what can be done in parallel.
Basically, compressing / decompressing a block involves 2 parts :
- Match Copy
- Entropy Coding
For the time being, entropy coding is completely independent within each block (this could change in the future). That means it's possible to encode / decode this stage within multiple blocks in parallel, store the result, and make it available to the next stage.
Unfortunately, Match Copy is not so kind. Especially during the decompression stage, decompressing a block requires access to previous blocks, which must therefore be already decoded. Serial in nature.
Going around this limitation would require independent blocks.
This is a valid use case btw : in many data structures, random access is important, so blocks are independent and tiny.
Small independent blocks hurt compression badly though. A mitigation solution is to use a shared dictionary, a kind of "static prefix" which is going to be used for all blocks. All block operations remain independent, and thus can be parallelized.
As one can guess, it's a pretty advanced topic. I believe a bit more work is required to get there.
In the meantime, if you are interested in challenging your CUDA skills, I can only invite you to try compressing / decompressing small independent blocks. It's going to be pretty difficult already, and will require you to get a good understanding of zstd internals.
from zstd.
When I was doing audio DSP programming, I found CUDA only helped in some scenarios, when processing savings was worth the data transfer costs. Usually I found SSE optimizations easier & less friction.
from zstd.
Should we keep this issue opened ?
Any new item in the foreseeable future ?
from zstd.
Close this issue, please. It is impossible to integrate cuda support for the foreseeable future...
from zstd.
Related Issues (20)
- erro
- Add library and cli flags for file format with embedded dictionary
- Question about ZSTD protocole HOT 2
- Building on MacOS 13 and targeting MacOS 11 and SDK 11.3 (or any other MacOS version) does not work HOT 2
- Integrating the library with an external thread pool HOT 2
- Is it safe to move compression and decompression contexts between threads? HOT 1
- ZDICT_trainFromBuffer_cover is not thread safe HOT 17
- zstd compression output differens with the same options between 1.5.5 and 1.5.6 HOT 5
- Warning message for `zstd -v --train` is missing line breaks
- How to accelerate the process of dictionary training in zstd? HOT 5
- tests/cli-tests/cltools/zstdless.sh fails with newer version of less HOT 3
- Please promote thread pools from experimental to stable HOT 1
- The CMake build script breaks check_ipo_supported
- Dynamic decompression HOT 3
- Change `dictionary_compression.c` example to use API for dictionary creation
- Enable weak symbol support for Risc-V? HOT 1
- Possibly missing check for truncated initial states in Huffman weight block HOT 4
- Poor compressor behavior on interleaved data HOT 2
- zstd 1.5.5+ has worse performance on Graviton2 nodes than v1.4.4 HOT 4
- [Not a bug] Dictionary building strategy HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from zstd.