Coder Social home page Coder Social logo

.Net for Spark about spark-daria HOT 3 CLOSED

dbeavon avatar dbeavon commented on June 2, 2024
.Net for Spark

from spark-daria.

Comments (3)

MrPowers avatar MrPowers commented on June 2, 2024

@dbeavon - thanks for creating the issue. I didn't even know that Microsoft has a project to bring Spark language bindings to .NET. I don't know anything about the .NET Spark library ecosystem, so don't have any valuable info for you unfortunately.

from spark-daria.

dbeavon avatar dbeavon commented on June 2, 2024

@MrPowers One of the main challenges on the .Net side of things is that you don't find any results when you google, there aren't many samples, and there aren't many utilities or libraries yet to help people get started.

For about a year I've been developing code in scala/spark, and relying on community projects in that ecosystem. But when I'm migrating any of my work to the .Net side of spark, then I'm discovering that I have to write custom code for some very basic purposes (eg. dataframe validation and what-not). There aren't a lot of community projects that offer utilities or extension API's.

The first thing I tried was a google search for .net and daria, but turned up nothing.

Would it be a problem if I someone were to create a project that reused "daria" in its name, for the sake of google searches (maybe "daria-spark-net" or whatever). It would be very helpful for anyone that had to migrate code from scala when it was built with some of the utilities in this github project.

FYI, In terms of the technical architecture, .Net works like python. It relies on the same integration features that python uses. .Net core is hosted out-of-process in a way that behaves like a side-car for a spark worker node. Just like python it uses apache arrow. Despite the comparisons with python, the c# language itself is very similar to scala and I think a good c# developer will feel relatively comfortable in scala and vice-versa. .... All this is to say that for .Net developers it will probably be more "natural" to use some equivalent of a scala API, and not some equivalent of a python library.

Please let me know if you have any strong opinions about potentially creating a .Net variation of "daria".

from spark-daria.

MrPowers avatar MrPowers commented on June 2, 2024

@dbeavon - I support building a similar project for .net users, but I'd rather a different name gets used if that's alright. I used quinn for the PySpark version of this lib. I'm sure these helper functions would be useful for the .net community. Thanks for checking and hope you're finding this lib useful!

from spark-daria.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.