Comments (3)
Should I assume the same as with when creating a tar?
ClickHouse/src/IO/Archives/TarArchiveWriter.cpp
Lines 11 to 18 in fe92c92
Then we can also compress when archiving into a zip but what compression methods are supported?
ClickHouse/src/IO/Archives/ZipArchiveWriter.cpp
Lines 344 to 362 in fe92c92
You can read the code located in this folder yourself :) There are not so many lines written there.
So, you can either determine the compression method by a file extension or force the archive "algorithm" to use a specific codec internally if it is supported.
from clickhouse.
I briefly looked into the code and compression_method
is only used when ClickHouse writes a backup in an archive. Moreover, data is already compressed well (columnar format, etc), so I'm not sure if it can improve things significantly.
What are all the options in terms of file extension names and what exactly do they do?
They are used to determine which archive algorithm to use. The whole list of possible extensions can be found here:
ClickHouse/src/IO/Archives/createArchiveWriter.cpp
Lines 29 to 30 in fe92c92
from clickhouse.
Thanks for taking a look at this for me.
So if I understand correctly, I can either make a tar, a zip or neither.
If I create a tar then I can compress with various different methods, based on the file extensions the options are:
- zstandard
- lzma
- xz
- bzip2
- gunzip
Then we can also compress when archiving into a zip but what compression methods are supported? Should I assume the same as with when creating a tar?
from clickhouse.
Related Issues (20)
- Flaky test `02922_deduplication_with_zero_copy.sh` after 2024-06-18 HOT 10
- parameters not successfully substituted within a window function
- Test http_external_tables_memory_tracking is flaky
- Parts created on older version have different move_ttl_info.expression and are not moved to disk on newer CH version HOT 1
- BACKUP/RESTORE ... SETTINGS don't allow settings which not related to BACKUP/RESTORE command
- Protection from the same server UUID on cluster (can lead to 'Session was killed' and other issues)
- Unexpected JSON output for zero values
- The query result of the distributed table groupBitmapOr is inconsistent with the expected result HOT 3
- ASSUME CONSTRAINT not optimizing queries with MATERIALIZED COLUMNS HOT 5
- CTE query may produce unexpected result.
- `bitShiftLeft` may produce unexpected result.
- `bitTest` may produce unexpected result. HOT 2
- Order of checking max_concurrent_queries_* settings
- Insertion into distributed table causes Segmentation fault HOT 2
- ClickHouse always prints `<jemalloc>: Number of CPUs detected is not deterministic. Per-CPU arena disabled.` on t2.micro machines on AWS HOT 1
- Format `One` should not read files. HOT 1
- Flaky `test_checking_s3_blobs_paranoid/test.py::test_when_s3_broken_pipe_at_upload_is_retried` HOT 2
- DEFAULT_MARK_CACHE_MAX_SIZE = 5368_MiB HOT 1
- NOT_FOUND_COLUMN_IN_BLOCK exception on merge deduplicate propagated into projection
- Cleanup passwords from inside the query in the command line parameter.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clickhouse.