Comments (4)
Btw, it is probably worth spec'ing out which parts of Materialize exist in which offerings. My sense is:
- There is a very reasonable, non-durable, non-autoscaling, non-clustermanaged thing that each user could try out on their laptop / in a browser / etc., which is a simple binary download without GBs of Apache cruft, and which gets them up and running in minutes.
- There is also a very reasonable durable, autoscaling, cluster managed thing that wraps around this that is worth paying for, and that serious people would agree that they need, even though the above did not crash during their demo runs.
That's one hypothetical partitioning of value, but with something like that in mind we can more clearly determine which features need to go where. At the moment interactive
is the closest thing to (1.) above, but it isn't obvious (to me) that we want to add all of the features in to it, both because they may make the initial experience more awkward, and because it may be "useful" to hold them back.
from materialize.
Currently, @benesch is working on an intermediate API, 'metastore'. The intention is for metastore to be backed by zookeeper for (2). But it's equally possible to have a shim 'zookeeper' that's a linked-list that satisfies (1) with no additional Apache cruft.
The reason why zookeeper is such an attractive option is that if the primary streaming data ingest layer is Kafka, then Kafka users typically have Zookeeper running anyhow.
from materialize.
Closing this out. I'm pretty happy with the end result, which uses ZooKeeper to store the metadata, but then the Chosen Node (worker node 0) pushes them through a timely sequencer for a consistent ordering.
from materialize.
For posterity: getting ZooKeeper to expose a consistent stream of events is literally impossible. It's not built for that. I think it might be possible with etcd, but even if it is, it's definitely awkward. If we want that, we'd need to bundle a Raft implementation, and that seems like serious overkill. The solution of using ZooKeeper for persistence and timely's sequencer for ordering actually seems like the best option.
from materialize.
Related Issues (20)
- sentry: panic: error bootrapping index `as_of`: min_as_of [1700133417001] greater than max_as_of [1698874434001] HOT 3
- release: v0.78.14 required reviews HOT 2
- Materialize console sometimes initially fails to load due to CORS issue
- release: v0.78.15 required reviews HOT 2
- Postgres inconsistency: whitespace diff in jsonb output HOT 1
- catalog: Replace usages of the stash-debug tool with the catalog-debug tool
- Postgres inconsistency: jsonb's || operator produces empty result when applied with simple values
- parallel-workload: SELECTs still gets stuck with few indexes and without peek fast path HOT 1
- Postgres inconsistency: extract year from date
- Add mz_internal.mz_idle_clusters view to system catalog
- dbt-materialize: force contract pre-flight check to run on valid cluster HOT 1
- Order of results depends on SELECT expressions HOT 1
- Add reverse(text) function
- Allow creating table like other table
- Postgres inconsistency: regex replace uses other character to access groups
- Add regexp_matches function
- Checks + backup + rollback to previous fails HOT 1
- Make alter-connection test more robust
- [PS] Parallelize Platform Checks in Buildkite
- Make SQL parsing error "Error: expected expression, but found reserved keyword" more helpful
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from materialize.