Coder Social home page Coder Social logo

Comments (8)

eshioji avatar eshioji commented on May 20, 2024 3

For people who want to have a quick hacky solution to this. One can create a machine in the VPC and launch a cluster from that machine using private DNS/IP.

However, one needs to do the following:

  • Hack flintrock (https://github.com/eshioji/flintrock/pull/1/files)
  • Assign additional security group that allows access to necessary ports (the one flintrock assigns didn't work for my VPC setting)
  • Make it so that the machines can resolve their own hostname (the default AMI gets hostnames like ip-xx-xx-xx-xx and Spark fails to launch because it can't resolve it).

For the last problem, I did the following:
Create a script with the following content and let it run at startup using rc.local. Not the nicest way but it works.

OWN_IP=`curl -s http://169.254.169.254/latest/meta-data/local-ipv4`
HOSTNME=`hostname`
echo "$OWN_IP $HOSTNME" | sudo tee --append /etc/hosts

Finally thank you @nchammas , I wanted to use a custom AMI and this project was a life-saver!

from flintrock.

pferrel avatar pferrel commented on May 20, 2024 2

is something like the chaordic fork implemented here? If so are their instructions? I'm trying to run Spark behind the VPC with not outside traffic.

from flintrock.

MuksSharma avatar MuksSharma commented on May 20, 2024

Thanks @nchammas. This use case is becoming more common in enterprise setting. Any thoughts on when this feature might become available?

from flintrock.

nchammas avatar nchammas commented on May 20, 2024

As with the typical open source project, there are no promises on when a feature might be delivered. The things that usually get worked on are critical bug fixes, fun stuff, and everything else, in that order. 😄

My current intent is to work next on adding support for resizing existing clusters and improving launch progress reporting. If I get some time at work to devote to Flintrock (e.g. for this issue), I may work on it after I get those 2 features out.

from flintrock.

rmessner avatar rmessner commented on May 20, 2024

It is exactly our case ( and we don't even have hostname resolution, and that would need some hack for spark and hadoop ), I will work out something and submit it to you @nchammas.

My use case will be :

  • Downloading spark from a custom location ( thanks to #104 )
  • Use private DNS names instead of public ones
  • Define several origins ( multiple CIDR addresses to access the cluster for the security groups )

from flintrock.

rmessner avatar rmessner commented on May 20, 2024

In case of the private VPC, I'm struggling with the ssh access when installing and configuring, because i'm outside the security group and it take my public IP to access an private IP so doesn't work.

I'm thinking about 2 solutions :

  • Opening a security breach for the install and configuration step
  • Bind the access_origins to the VPC ( means my private IP is within the range of the origins )

What do you think about it @nchammas ?

from flintrock.

nchammas avatar nchammas commented on May 20, 2024

Hey @rmessner, I updated the description of the issue and commented on your PRs as well.

from flintrock.

nchammas avatar nchammas commented on May 20, 2024

Related PRs from others who have implemented their own fixes for this:

from flintrock.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.