Coder Social home page Coder Social logo

[question] about mist HOT 6 CLOSED

geoHeil avatar geoHeil commented on May 28, 2024
[question]

from mist.

Comments (6)

spushkarev avatar spushkarev commented on May 28, 2024

Hi @geoHeil !
Thanks for the question. To list a few key differences.

  1. Fundamentally job server is designed to create jobs to be run by big data engineers. Hydrosphere Mist provides higher level decoupling for web developers.
    Job server API:
    POST /jars/app_name
    POST /contexts/context_for_myapp
    POST /data/params_for_my_job
    POST /jobs/my_job

Versus Mist API for the same action:
POST /weather-forecast
{
lat: “37.777114”,
long: “-122.419631”
radius: 100
}

  1. Mist provides reactive and true asynchronous API - i.e. you could run streaming job and subscribe for updates. Also you'll be able to run parametrized streaming job or modify a running job to receive new results.

  2. Synchronous realtime API for high throughput - we are working on adding a model serving support for online queries with low latency.

  3. Python jobs support - you could implement Spark programs in Python to be executed from Mist.

  4. Spark contexts isolation and resource management on driver nodes.

  5. And others

Take a look at the blog post for some background details
https://medium.com/@stepanpushkarev/architecting-smart-applications-on-top-of-apache-spark-b0fcab6ea400

pipeline.io is not a product - it's just a demo for training classes with all the buzzwords inside a single docker container :) While it is cool I'm not sure how to use it.

seldom is is a platform with all the dependencies. Also I do not see big data technologies there.

Mist is strongly focused on providing model serving and data pipeline API on top of Spark.
Regarding testing, deployment and monitoring - we are working on other independent micro products which will complement Mist in a long term.

from mist.

geoHeil avatar geoHeil commented on May 28, 2024

Sounds pretty awesome.

Synchronous realtime API for high throughput - we are working on adding a model serving support for online queries with low latency.
When do you plan tho have low latency serving up and running?

from mist.

spushkarev avatar spushkarev commented on May 28, 2024

Its a tough problem and it depends on actual implementation of the serving action. The goal is to re-use existing Spark code and not to export the model to PMML->NoSQL DB->custom online serving layer.
We hope to have the 1st version ready at January.
If you are interested we'll share a design vision and then will collaborate on implementation.

from mist.

geoHeil avatar geoHeil commented on May 28, 2024

I am interested but not sure if I will really be able to support you. This
will depend on if we will use mist internally.
spushkarev [email protected] schrieb am Di. 22. Nov. 2016 um 23:53:

Its a tough problem and it depends on actual implementation of the serving
action. The goal is to re-use existing Spark code and not to export the
model to PMML->NoSQL DB->custom online serving layer.
We hope to have the 1st version ready at January.
If you are interested we'll share a design vision and then will
collaborate on implementation.


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#93 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABnc9Krt8YMBKa0K8xJMGh3kojq70x9Aks5rA3J2gaJpZM4K5me6
.

from mist.

geoHeil avatar geoHeil commented on May 28, 2024

Do I understand that for your solution containers (docker) are mandatory? Or would it work in a yarn only world as well?

from mist.

spushkarev avatar spushkarev commented on May 28, 2024

Docker is not mandatory, Mist is Scala-Akka application and could be run anywhere.

from mist.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.