hyrise / hyrise-v1 Goto Github PK

HYRISE In-Memory Hybrid Storage Engine (archived, now developed in hyrise/hyrise repo)

Home Page: https://github.com/hyrise/hyrise

License: MIT License

Makefile 0.82% CSS 2.61% JavaScript 2.06% Python 3.98% Shell 0.27% C++ 87.74% C 0.69% Ruby 1.15% OpenEdge ABL 0.05% HTML 0.12% Perl 0.51%

hyrise-v1's Introduction

Welcome to Hyrise

Hyrise is a research in-memory database system that has been developed by HPI since 2009 and has been entirely rewritten in 2017. Our goal is to provide a clean and flexible platform for research in the area of in-memory data management. Its architecture allows us, our students, and other researchers to conduct experiments around new data management concepts. To enable realistic experiments, Hyrise features comprehensive SQL support and performs powerful query plan optimizations. Well-known benchmarks, such as TPC-H or TPC-DS, can be executed with a single command and without any preparation.

This readme file focuses on the technical aspects of the repository. For more background on our research and for a list of publications, please visit the Hyrise project page.

You can still find the (archived) previous version of Hyrise on Github.

Citation

When referencing this version of Hyrise, please use the following bibtex entry:

(click to expand)

@inproceedings{DBLP:conf/edbt/DreselerK0KUP19,
  author    = {Markus Dreseler and
               Jan Kossmann and
               Martin Boissier and
               Stefan Klauck and
               Matthias Uflacker and
               Hasso Plattner},
  editor    = {Melanie Herschel and
               Helena Galhardas and
               Berthold Reinwald and
               Irini Fundulaki and
               Carsten Binnig and
               Zoi Kaoudi},
  title     = {Hyrise Re-engineered: An Extensible Database System for Research in
               Relational In-Memory Data Management},
  booktitle = {Advances in Database Technology - 22nd International Conference on
               Extending Database Technology, {EDBT} 2019, Lisbon, Portugal, March
               26-29, 2019},
  pages     = {313--324},
  publisher = {OpenProceedings.org},
  year      = {2019},
  url       = {https://doi.org/10.5441/002/edbt.2019.28},
  doi       = {10.5441/002/edbt.2019.28},
  timestamp = {Mon, 18 Mar 2019 16:09:00 +0100},
  biburl    = {https://dblp.org/rec/conf/edbt/DreselerK0KUP19.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Supported Systems

Hyrise is developed for Linux (preferrably the most current Ubuntu version) and optimized to run on server hardware. We support Mac to facilitate the local development of Hyrise, but do not recommend it for benchmarking.

Supported Benchmarks

We support a number of benchmarks out of the box. This makes it easy to generate performance numbers without having to set up the data generation, loading CSVs, and finding a query runner. You can run them using the ./hyriseBenchmark* binaries.

Note that the query plans are generated in our CI pipeline with possibly many stages in parallel and different CI runs might be executed on different machines. Reported runtimes are not to be taken as solid benchmark performance numbers.

Benchmark	Notes
TPC-DS	Query Plans
TPC-H	Query Plans
Join Order	Query Plans
Star Schema	Query Plans
JCC-H	Call the hyriseBenchmarkTPCH binary with the -j flag.
TPC-C	In development, no proper optimization done yet

Getting started

Have a look at our contributor guidelines.

You can find definitions of most of the terms and abbreviations used in the code in the glossary. If you cannot find something that you are looking for, feel free to open an issue.

The Step by Step Guide is a good starting point to get to know Hyrise.

Native Setup

You can install the dependencies on your own or use the install_dependencies.sh script (recommended) which installs all of the therein listed dependencies and submodules. The install script was tested under macOS Monterey (12.4) and Ubuntu 22.04.

See dependencies for a detailed list of dependencies to use with brew install or apt-get install, depending on your platform. As compilers, we generally use recent versions of clang and gcc (Linux only). Please make sure that the system compiler points to the most recent version or use cmake (see below) accordingly. Older versions may work, but are neither tested nor supported.

Setup using Docker

If you want to create a Docker-based development environment using CLion, head over to our dedicated tutorial.

Otherwise, to get all dependencies of Hyrise into a Docker image, run

docker build -t hyrise .

You can start the container via

docker run -it hyrise

Inside the container, you can then checkout Hyrise and run ./install_dependencies.sh to download the required submodules.

Building and Tooling

It is highly recommended to perform out-of-source builds, i.e., creating a separate directory for the build. Advisable names for this directory would be cmake-build-{debug,release}, depending on the build type. Within this directory call cmake .. to configure the build. By default, we use very strict compiler flags (beyond -Wextra, including -Werror). If you use one of the officially supported environments, this should not be an issue. If you simply want to test Hyrise on a different system and run into issues, you can call cmake -DHYRISE_RELAXED_BUILD=On .., which will disable these strict checks. Subsequent calls to CMake, e.g., when adding files to the build will not be necessary, the generated Makefiles will take care of that.

Compiler choice

CMake will default to your system's default compiler. To use a different one, call cmake -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ .. in a clean build directory. See dependencies for supported compiler versions.

Unity Builds

Starting with cmake 3.16, you can use -DCMAKE_UNITY_BUILD=On to perform unity builds. For a complete (re-)build or when multiple files have to be rebuilt, these are usually faster, as the relative cost of starting a compiler process and loading the most common headers is reduced. However, this only makes sense for debug builds. See our blog post on reducing the compilation time for details.

ccache

For development, you may want to use ccache, which reduces the time needed for recompiles significantly. Especially when switching branches, this can reduce the time to recompile from several minutes to one or less. On the downside, we have seen random build failures on our CI server, which is why we do not recommend ccache anymore but merely list it as an option. To use ccache, add -DCMAKE_CXX_COMPILER_LAUNCHER=ccache to your cmake call. You will need to adjust some ccache settings either in your environment variables or in your ccache config so that ccache can handle the precompiled headers. On our CI server, this worked for us: CCACHE_SLOPPINESS=file_macro,pch_defines,time_macros CCACHE_DEPEND=1.

Build

Simply call make -j*, where * denotes the number of threads to use.

Usually debug binaries are created. To configure a build directory for a release build make sure it is empty and call CMake like cmake -DCMAKE_BUILD_TYPE=Release

Lint

./scripts/lint.sh (Google's cpplint is used for the database code. In addition, we use flake8 for linting the Python scripts under /scripts.)

Format

./scripts/format.sh (clang-format is used for the database code. We use black for formatting the Python scripts under /scripts.)

Test

Calling make hyriseTest from the build directory builds all available tests. The binary can be executed with ./<YourBuildDirectory>/hyriseTest. Subsets of all available tests can be selected via --gtest_filter=.

Coverage

./scripts/coverage.sh will print a summary to the command line and create detailed html reports at ./coverage/index.html

Requires clang on macOS and Linux.

Address/UndefinedBehavior Sanitizers

cmake -DENABLE_ADDR_UB_LEAK_SANITIZATION=ON will generate Makefiles with AddressSanitizer, LeakSanitizer, and Undefined Behavior options. Compile and run them as normal - if any issues are detected, they will be printed to the console. It will fail on the first detected error and will print a summary. To convert addresses to actual source code locations, make sure llvm-symbolizer is installed (included in the llvm package) and is available in $PATH. To specify a custom location for the symbolizer, set $ASAN_SYMBOLIZER_PATH to the path of the executable. This seems to work out of the box on macOS - if not, make sure to have llvm installed. The binary can be executed with LSAN_OPTIONS=suppressions=asan-ignore.txt ./<YourBuildDirectory>/hyriseTest.

cmake -DENABLE_THREAD_SANITIZATION=ON will work as above but with the ThreadSanitizer. Some sanitizers are mutually exclusive, which is why we use two configurations for this.

Compile Times

When trying to optimize the time spent building the project, it is often helpful to have an idea how much time is spent where. scripts/compile_time.sh helps with that. Get usage instructions by running it without any arguments.

Maintainers

Martin Boissier
Daniel Lindner
Marcel Weisgut

Contact: [email protected]

Maintainers Emeriti

Markus Dreseler
Stefan Halfpap
Jan Kossmann

Contributors

Yannick Bäumer
Lawrence Benson
Jasper Blum
Lukas Budach
Timo Djürken
Alexander Dubrawski
Fabian Dumke
Leonard Geier
Richard Ebeling
Fabian Engel
Ben-Noah Engelhaupt
Moritz Eyssen
Martin Fischer
Christian Flach
Pedro Flemming
Mathias Flüggen
Johannes Frohnhofen
Pascal Führlich
Carl Gödecken
Adrian Holfter
Theresa Hradilak
Ben Hurdelhey
Sven Ihde
Ivan Illic
Jonathan Janetzki
Michael Janke
Max Jendruk
Tobias Jordan
David Justen
Youri Kaminsky
Marvin Keller
Mirko Krause
Eva Krebs
Henok Lachmann
Sven Lehmann
Till Lehmann
Tom Lichtenstein
Alexander Löser
Jan Mattfeld
Arne Mayer
Dominik Meier
Julian Menzler
Torben Meyer
Leander Neiß
Vincent Rahn
Hendrik Rätz
Niklas Riekenbrauck
Alexander Riese
Marc Rosenau
Johannes Schneider
David Schumann
Simon Siegert
Arthur Silber
Furkan Simsek
Toni Stachewicz
Daniel Stolpe
Jonathan Striebel
Nils Thamm
Hendrik Tjabben
Justin Trautmann
Carsten Walther
Leo Wendt
Lukas Wenzel
Fabian Wiebe
Tim Zimmermann

hyrise-v1's People

Contributors

Stargazers

Watchers

Forkers

grundprinzip bastih timbokopter jwust kateyy jlumqz martinfaust mrks irruputuncu lanice ollixy cfrahnow dgimb89 vandsaini came mtin brandlukas jmarten aaronelmore yousraabdullah nevermatch aspi92 mindis dukeharris charsyam thommyh ypsitau k34n3 sungsoo anukat2015 hanumathrao b-xiang quotfyproperty siddta gabrielcc2 mjendruk weiminzhang mingtu evilmcjerkface a6802739 nannancy pombredanne seanpm2001

hyrise-v1's Issues

Inconsistent State in MVCC

In the current implementation, MVCC allows for inconsistent reads. For a record deleted by another transaction, it is impossible to determine whether it would have been valid or not in the context of the current transaction. Example:

T1 starts with last_CID=6, does things
T2 starts, inserts record A, commits with CID=7
T3 starts, deletes record A, commits with CID=8
T1 reads A with valid=0 and CID (8) > last_CID (6) ==> current implementation assumes that value was valid for T1's read

Radix Join and PCs

Currently the RadixJoin does not work on PointerCalculator objects since the class does not implement the getAttributeVectors() interface. Even if it would it would break since the positions inside the vector might not be the same as the input positions.

Possible solutions: If the input is a pointer calculator, rewrite the positions to match the input vector and extend the handling to multiple horizontal partitions.

This is related to #18

Duplicate inputs to operators are currently removed

This makes writing Operators that expect identical data inputs twice (such as self joins) a pain.

Anyone knows a case where it makes sense to filter inputs for duplicates? Pl let me know, otherwise, this filter will be removed.

Add caveats section to readme

It might be nice to add a caveat section on the intended audience and what someone can expect from this chunk of code.

Assign unique table ids for logging

For logging, we need to identify to which table a certain entry was written. This could be a unique table id.

We need to discuss the scope of such an ID. Is it a Store or a Table? What information do we need to non-ambiguously identify where a logged value id belongs?
In the following, "table" means the logical construct, not the Table class
Table IDs must be unique over restarts of Hyrise. When I create the table "customers" and it gets the ID 5, it should have 5 when I restart Hyrise.
For this, we also need to save meta information (column [names], ...) about the table
We only need to log data in the delta - the main is persisted using snapshots
It would be great to have small table ids, not GUIDs
The table id shall be stored close to the data (in the Store class?). We should not need to go to the StorageManager every time we want to log something

Fails on clang++ 3.2

Fails in AbstractCoreBoundTask::launchThread when taking address of executeTask with "pure virtual function called".

Barrier has weird semantics

Well Barrier is kind of weird, because it uses the length n of _field_definition to forward the first n elements, regardless of actually set values in _field_definition.

We should make sure that its semantics are basically output == input.

Simplify Table Loading

There should be a way to simply access the tables stored on disk without really need to specify always the load operation explicitly.

setTableName in CreateIndex is misleading

Should be "setIndexName" or similar.

Add AbstractResource concept & unify operations data handling

Implement a base class AbstractResource that is used to transport resource between different operations in a plan.

This should replace multiple lists of resource types in OperationData

Fully namespace everything in src/lib or completely remove namespaces

The current half/half situation doesn't really help anyone.

Github Pages and Documentation

Can we use github pages to store the recent version of the documenation. I think this should be possible...

StorageManager distinction between tables and indexes is awkward

Move towards a storage manager that only stores AbstractResource objects with a name ready for retrieval. (Also remove the loading shortcuts in StorageManager while we are at it).

SegFault enhancement mit backtrace und stackframes

Remove getSlice/getSliceWidth and adjacent methods from AbstractTable

Partitioning of tables for parallel execution

Distribute method in PlanOperation returns first and last as array positions (0 and 999 for a table with 1000 elements) -> following iterator-style you would expect last to work as an exit condition for iterating over input table.

Add HAVING operator

RFC: Improve logging to explicitly distinguish hardware counters and make the results clearer

{
    /*... other stuff ... */
    "performance_data": {
        /* ... other operands, too */
        "operandId": {
            "timing": { /* all these values are explicit not measured through hardware counters */
                "start": 10 /*ns since query start*/ ,
                "stop": 20 /*ns since query start*/ ,
                "start_epoch": 100000000123 /*ns since epoch*/ ,
                "stop_epoch": 100000000133 /*ns since epoch*/ ,
            },
            "counters": {
                /* these values are explicitly measured through hardware counters */
                "PAPI_TOT_INS": 10,
                "PAPI_TOT_CYC": 20,
                "PAPI_L2_TCM": 30
            },
            "input": {
                tables: []
            },
            "output": {
                /* same as input */
            },
            "custom": {
                /* op may emit whatever JSON it deems sensible */
            }
        }
    }
}

in request, we specify what we want
for logging

{ "operations": [ /*...*/ ], "logging" = ["papi", "input", "output", "timing"] }

Issues with this proposal: "counters" would currently only count what happens in executePlanOperation

Logfile Management (DESIGN DISCUSSION)

Currently, all transactions log into a single logfile, which consequently grows over time and never gets deleted or truncated. Also, once a table is merged, its previous log entries must be removed/invalidated to avoid redo recovery in case of failure.

Solution 1: One logfile per table. Problems: increased logging overhead, commits must either be written into all logfiles or in a separate commit log.

Solution 2: One logfile for all tables, checkpoint entries when table is merged (i.e. "ignore previous log entries for table X"), new logfile after merge. When all tables have been merged once, the first logfile can be deleted, the second after all tables were merged twice and so on. Problems: Might take a while until all tables are merged.

Solution 3: One logfile for all tables, checkpoint entries when table is merged (i.e. "ignore previous log entries for table X"), new logfile after merge. Separate worker reads old logfiles on every table merge, removes entries and writes truncated log back. Problems: Potentially costly operation, IO overhead

Binutils-dev missing in vagrant/chef setup

I ran vagrant up. On the virtual box I tried to compile Hyrise, but the binutils-dev package is missing (the -lbfd flag fails in the linker). A simple apt-get install binutils-dev rectified the situation.

TransactionManager: Aborting transaction does not clean up rows that have been committed

The commit implementation allows Store->updateCommitId to fail in some way, but does not clean up any changes that have been made.

MergeJoin does not merge correctly

Entries in the right-hand table only appear once in the result table, namely for the first match. Instead, they should appear for every matching entry in the left-hand table.

Cleanup pointer calculator design

Replace pointer based fields and pos_list with non-pointer members
Replace const pos_list * with const pos_list&

Increase ease of use of PointerCalculator.

How to handle Pointer Calculator and Tables in Operators?

prototype for typeswitches

Improve Papi Initialization

PAPI initialization takes a while for small ops. Maybe only initialize once per thread.

Extract taskscheduler unit tests into own binary

UUID assignment unnecessarily broad and slow!

The generic assignment of UUIDs to all containers through placement in the AbstractTable places an unnecessary burden on the overall system.

tests before uuid: make test 3.34s user 1.19s system 77% cpu 5.843 total
tests after uuid: make test 8.84s user 4.61s system 103% cpu 12.975 total

There is no reason that every single AbstractTable needs a UUID. It might even be debatable if every Store needs a UUID.

Potential fixes:

replace with cheaper ID-mechanism (atomic is probably faster than drawing a new number out of a mersenne_twister RNG)
lazily generate when needed
move to stores(?)
replace with just using the pointer casted to size_t - the pointer will be unique for a given cycle of server-start to next server start

A way to write self-contained JSON tests

A problem with sharing error cases is that you also need to share a file or a database connection to be able to run the test.

If instead, we could load a table from JSON only, this would reduce the work needed to reproduce bugs.

So either enhance the existing string loader with one that can also provide a table, or provide JSON-based loaders, that allows for loading from the operation parameters ie:

  {
    "type": "JsonTable",
    "table": { "header" : ["a", "b"],
                  "types" : ["INT", "STRING"],
                 /*optional*/ partitions: [ "1_R", "2_R" ],
                  "data" : [ [ 1, "USA" ], [ 2, "GERMANY" ] ] }
  }

JoinType being ignored in JoinScan

Setting _join_type in JoinScan does not have any effect on the join being performed

Store: Delta resize is not thread-safe

While only one thread may resize the delta, other threads working on delta at the time of resizing may end up working on deallocated memory, since the underlying vectors may change during a reserve.

Correct Dictionary cannot be found in Horizontal table, if tables with diff dictionaries are unified

when getvalue searches the right dictionary, column AND row needs to be taken into account

TaskSchedulerAdjustment, ThreadpoolAdjustment, SettingsOperation redundant

The Plan Operations TaskSchedulerAdjustment and ThreadpoolAdjustment do exactly the same, SettingsOperation also does the same but leaves out one step. Therefore it seems that only one of these three is actually necessary and the other two should be removed

Remove template-based Allocators in favor of polymorphic allocators.

Currently, the facilities surrounding template-based allocators are completely underused and are not actually necessary. Instead, it would be good to use polymorphic allocators similar to what is implemented in bsl https://github.com/bloomberg/bsl/wiki/BDE-Allocator-model and will be part of the c++14 standard.

Make Commit Parallel

Currently only one TX can be committed at a time, this should be improved :)

Remove vname from operators

Since 6576e8d, much boilerplate isn't necessary any longer when implementing operators.

Thus, we should remove all name/vname methods and use registerPlanOperation<Type>("name"); instead.

Add Meta Information about currently loaded Tables

There should be a meta table that shows which tables are there, to follow the SQL spec this should as well contain information about the columns etc as well.

Something like information_schema etc from MySQL & Co.

Update Documentation

The current status of the documentation is pre-c++11 and our shared_ptr usage, so we should find some time to update it.

Remove TPCCHQ1Scan PlanOp?

The TPCCHQ1Scan Plan Operation seems unnecessary and can potentially be removed unless required for special purposes

Radix Join Hashing for non-int columns

Radix Join should work on non int columns as well. The RadixCluster plan operation has to be adapted templated to work independent of the dictionary type.

Create tests for limit/offset parameters in ResponseTask

Enhance Expression Support

Currently only rudimentary expression support is available for HYRISE. It should be extended to support.

Expressions: mod( (a * b), 1000)
Expressions: like - string matching
Expressions: exists with subselect
Expressions: substr()
Expressions: ascii()
Expressions: extract (year from data)

@JWUST any hints on that one?

Implement client-side driven transactions

Instead of implicitly assuming every request encloses a transaction, a transaction may last for several requests. Thus, we need:

BeginTransaction Operator -> returns txid (maybe even full context?)
Let queries run in a user specified transaction context

segfault in MysqlTableloader during parallel execution

Add support for "IN subquery" predicates

HYRISE should be able to support filter predicates based on subqueries. Something like

SELECT * FROM table where attr in (SELECT id FROM othertab);

hyrise / hyrise-v1 Goto Github PK

hyrise-v1's Introduction

Welcome to Hyrise

Citation

Supported Systems

Supported Benchmarks

Getting started

Native Setup

Setup using Docker

Building and Tooling

Compiler choice

Unity Builds

ccache

Build

Lint

Format

Test

Coverage

Address/UndefinedBehavior Sanitizers

Compile Times

Maintainers

Maintainers Emeriti

Contributors

hyrise-v1's People

Contributors

Stargazers

Watchers

Forkers

hyrise-v1's Issues

Recommend Projects

Recommend Topics

Recommend Org