Comments (13)
What you're trying to do is possible using the shard allocation awareness feature. It's not documented in our docs yet, but you can take a look at the elasticsearch documentation here http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/modules-cluster.html#allocation-awareness
We've it on our roadmap to work on cross-dc stuff. So it should pop up in our own docs eventually.
Hope that helps.
Update:
Just as an addition in case you don't know Elasticsearch. Crate uses Elasticsearch as a library and is responsible for various tasks. That also involves some parts of the shard/replication logic which is why the settings that are documented on the site I linked should work if you apply them in the crate.yml file.
from crate.
hi there, for people who have never worked with elasticsearch its not that easy, today i setup my first crate server, only one node... seems to work, i still habe problems[https://github.com/crate/crate-admin/issues/86] but anyway... could someone please provide a simple info how the replication has to be setup between 2 diffrent locations ? its not the part we are too lazy.. hours went away to setup all this... so please have some mercy and help me/us here with a simple config example, that would be great! thanks a lot....
from crate.
michabb: do you just having problems that your nodes won't join the same cluster? than maybe you need to configure unicast instead of the multicast (default) on each node, this blog post should help: https://crate.io/blog/using-crate-in-multinode-setup/
otherwise please give us some more hints about your wanted setup.
from crate.
private: hi sebastian (übrigens, wir sprachen auf der IPC14, ich hatte die idee/vorschlag mit dem mysql-plugin, vielleicht erinnerst du dich ? ;) ich hab also gleich mal euer crate getestet, aber derzeit nur eine node... eine zweite hatte ich noch nicht probiert, weil ich nirgendwo was in der doku dazu gefunden hatte, daher: vielen dank für den blog-link, ich werde es testen ;-)
public: thanks for the blog link, i will try ;)
from crate.
There is some wording in elasticsearch docs that say:
"Avoid clusters that span multiple data-centers, even if the data-centers are colocated in close proximity. Definitely avoid clusters that span large geographic distances."
Found at: http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html#_network
It is pretty clear wording, do you know why they say this?
from crate.
Yes the reason they say this is the increased latency:
Elasticsearch clusters assume that all nodes are equal…not that half the nodes are actually 150ms distant in another datacenter. Larger latencies tend to exacerbate problems in distributed systems and make debugging and resolution more difficult.
This of course also applies to Crate.
from crate.
btw... i have a 2 node cluster right now, but everybody trying this should take care.. the default port for node 2 node communication is 4300, not 4200 ... this simple fact cost me half an hour g
from crate.
@michabbb
you are right, we should document the ports more prominent and clear. those ports are currently only covered in the configuration documentation found at https://crate.io/docs/stable/configuration.html?#ports
we will update the documentation
from crate.
there are already users using the routing allocation https://crate.io/docs/stable/configuration.html?#routing-allocation to implement a multi zone deployment on AWS.
it is possible with the forced allocation awareness to ensure that queries only touch nodes in the same zone. and also to make sure that replicas are distributed accross such zones.
we will create a document explaining this explicitly and update this issue when done.
from crate.
@weswam @ahto @michabbb
we've added a best practice document about multi zone setup / cross datacenter replication to our documentation.
you can find it here https://crate.io/docs/en/latest/best_practice/multi_zone_setup.html
from crate.
thanks
from crate.
Guess this can be closed since it is now documented properly.
from crate.
How to do this across 2 data centres for version 1.0.2?
I have a lab instance consisting 6 nodes across 2 zones as described in
https://crate.io/docs/crate/guide/en/latest/scaling/multi-zone-setup.html
When i start replication (number of replication =3 ) shard suppose 100 should start 400 shards .
But replication isnt working only 25% replicated data (i.e original copy with 100 shrads running)
Please advice
from crate.
Related Issues (20)
- Eligible filters are not pushed down if any are ineligible, resulting in none being pushed down HOT 3
- Implement position(substring in string)
- Unexpected result when using escape character in `LIKE` HOT 2
- Cast to geo_shape not working on generated column HOT 2
- NPE on `MIN` aggregation on a mixed-version cluster 5.8.0/5.7.3 HOT 1
- Alternating exceptions/results with geo_shape UDF and MATCH USING WITHIN
- Enable logging in ECS JSON format
- Allow users with `AL` privileges to manage session settings defaults for other users HOT 1
- `COUNT()` from a nested view should use `COUNT` operator instead of `COLLECT`
- StaticInformationSchemaQueryTest.testIsNotNull test failure HOT 2
- Success `statusCode` for failed bulk inserts with HTTP endpoint HOT 1
- Extra flags to provide further details on failed records on HTTP endpoint bulk insertion HOT 1
- Improve `ARRAY_LENGTH` performance for arrays within object arrays HOT 1
- Improve performance in HyperLogLogDistinctAggregation
- Invalid results when filtering with `ARRAY_LENGTH` on an array in array of objects HOT 2
- IndexOutOfBoundsException on query with ORDER BY in 3 nodes cluster HOT 3
- ShardInfo in NodeInfo MXBean is missing the schema name leading to duplicates HOT 4
- Table with dynamic column policy is silently failing first insert if it includes an empty array HOT 5
- Unexpected result when using empty `OBJECT` literal HOT 2
- Behaviour of numeric scale and casting inconsistent / unspecified HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from crate.