Coder Social home page Coder Social logo

Cross Data Center Replication about crate HOT 13 CLOSED

crate avatar crate commented on September 24, 2024
Cross Data Center Replication

from crate.

Comments (13)

mfussenegger avatar mfussenegger commented on September 24, 2024

What you're trying to do is possible using the shard allocation awareness feature. It's not documented in our docs yet, but you can take a look at the elasticsearch documentation here http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/modules-cluster.html#allocation-awareness

We've it on our roadmap to work on cross-dc stuff. So it should pop up in our own docs eventually.

Hope that helps.

Update:

Just as an addition in case you don't know Elasticsearch. Crate uses Elasticsearch as a library and is responsible for various tasks. That also involves some parts of the shard/replication logic which is why the settings that are documented on the site I linked should work if you apply them in the crate.yml file.

from crate.

michabbb avatar michabbb commented on September 24, 2024

hi there, for people who have never worked with elasticsearch its not that easy, today i setup my first crate server, only one node... seems to work, i still habe problems[https://github.com/crate/crate-admin/issues/86] but anyway... could someone please provide a simple info how the replication has to be setup between 2 diffrent locations ? its not the part we are too lazy.. hours went away to setup all this... so please have some mercy and help me/us here with a simple config example, that would be great! thanks a lot....

from crate.

seut avatar seut commented on September 24, 2024

michabb: do you just having problems that your nodes won't join the same cluster? than maybe you need to configure unicast instead of the multicast (default) on each node, this blog post should help: https://crate.io/blog/using-crate-in-multinode-setup/
otherwise please give us some more hints about your wanted setup.

from crate.

michabbb avatar michabbb commented on September 24, 2024

private: hi sebastian (übrigens, wir sprachen auf der IPC14, ich hatte die idee/vorschlag mit dem mysql-plugin, vielleicht erinnerst du dich ? ;) ich hab also gleich mal euer crate getestet, aber derzeit nur eine node... eine zweite hatte ich noch nicht probiert, weil ich nirgendwo was in der doku dazu gefunden hatte, daher: vielen dank für den blog-link, ich werde es testen ;-)

public: thanks for the blog link, i will try ;)

from crate.

ahto avatar ahto commented on September 24, 2024

There is some wording in elasticsearch docs that say:

"Avoid clusters that span multiple data-centers, even if the data-centers are colocated in close proximity. Definitely avoid clusters that span large geographic distances."

Found at: http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html#_network

It is pretty clear wording, do you know why they say this?

from crate.

mfussenegger avatar mfussenegger commented on September 24, 2024

Yes the reason they say this is the increased latency:

Elasticsearch clusters assume that all nodes are equal…not that half the nodes are actually 150ms distant in another datacenter. Larger latencies tend to exacerbate problems in distributed systems and make debugging and resolution more difficult.

This of course also applies to Crate.

from crate.

michabbb avatar michabbb commented on September 24, 2024

btw... i have a 2 node cluster right now, but everybody trying this should take care.. the default port for node 2 node communication is 4300, not 4200 ... this simple fact cost me half an hour g

from crate.

dobe avatar dobe commented on September 24, 2024

@michabbb
you are right, we should document the ports more prominent and clear. those ports are currently only covered in the configuration documentation found at https://crate.io/docs/stable/configuration.html?#ports

we will update the documentation

from crate.

dobe avatar dobe commented on September 24, 2024

there are already users using the routing allocation https://crate.io/docs/stable/configuration.html?#routing-allocation to implement a multi zone deployment on AWS.

it is possible with the forced allocation awareness to ensure that queries only touch nodes in the same zone. and also to make sure that replicas are distributed accross such zones.

we will create a document explaining this explicitly and update this issue when done.

from crate.

chaudum avatar chaudum commented on September 24, 2024

@weswam @ahto @michabbb
we've added a best practice document about multi zone setup / cross datacenter replication to our documentation.
you can find it here https://crate.io/docs/en/latest/best_practice/multi_zone_setup.html

from crate.

michabbb avatar michabbb commented on September 24, 2024

thanks

from crate.

mfussenegger avatar mfussenegger commented on September 24, 2024

Guess this can be closed since it is now documented properly.

from crate.

sahil98541 avatar sahil98541 commented on September 24, 2024

How to do this across 2 data centres for version 1.0.2?
I have a lab instance consisting 6 nodes across 2 zones as described in
https://crate.io/docs/crate/guide/en/latest/scaling/multi-zone-setup.html

When i start replication (number of replication =3 ) shard suppose 100 should start 400 shards .
But replication isnt working only 25% replicated data (i.e original copy with 100 shrads running)

Please advice

from crate.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.