Coder Social home page Coder Social logo

Comments (10)

GeorgeS2019 avatar GeorgeS2019 commented on May 27, 2024

Found answer:

Lemma only supported in English

image

from corenlp.

AngledLuffa avatar AngledLuffa commented on May 27, 2024

Technically a Python lemmatizer could be converted to Java if someone were to implement the seq2seq inference code in Java, but as we have discussed with other annotators that exist for German in Python, there's zero drive to do that for multiple reasons. If you want to talk with @manning about commissioning such work, please feel free. Otherwise, there's a lemmatizer in Stanza for German.

https://github.com/stanfordnlp/stanza

from corenlp.

GeorgeS2019 avatar GeorgeS2019 commented on May 27, 2024

@AngledLuffa
I am still exploring CoreNLP for German.
Still at a preliminary stage to explore feasibility.

Thanks for all feedback.

Python is great, it is challenging when used in other business use case.

Have you heard of TorchSharp?

from corenlp.

AngledLuffa avatar AngledLuffa commented on May 27, 2024

I had not, but I have never used C# for anything, so perhaps not surprising

from corenlp.

GeorgeS2019 avatar GeorgeS2019 commented on May 27, 2024

TorchSharp is attempt to bring PyTorch to c#

We started having so much opposition from the c# community because the syntax is python like but the programming langauge is c#

Therefore it is a best of both worlds

Writing PyTorch like code in c# with all the support of PyTorch community.

We are working towards using TorchSharp to access PyTorch models e.g. HuggingFace

from corenlp.

GeorgeS2019 avatar GeorgeS2019 commented on May 27, 2024

@AngledLuffa

Is stanza based on Torch or pytoch?

from corenlp.

GeorgeS2019 avatar GeorgeS2019 commented on May 27, 2024

seq2seq inference code

This is a regular code in TorchSharp.

Perhaps using the lemma model in Stanza and doing it in TorchSharp, by passing the CoreNLP in Java, but using CoreNLP in C#

from corenlp.

AngledLuffa avatar AngledLuffa commented on May 27, 2024

Is stanza based on Torch or pytorch?

It's Python, using pytorch

Perhaps using the lemma model in Stanza and doing it in TorchSharp, by passing the CoreNLP in Java, but using CoreNLP in C#

There's a fundamental problem here which is

  • no one here uses C# for anything
  • even the number of Java projects is very limited. it's basically me, another research engineer, and our PI who work on this as needed
  • everyone here thinks that because Stanza is available for German, with a complete suite of models, there's no point in adding more functionality to the CoreNLP side of things. the one asterisk being that there is currently no Stanza constituency parser
  • if we did spend time converting a model such as the lemmatizer to Java, it would not benefit the group in any way (no publications, no citations, no $$) and it would not benefit the individual in any way (seeing as how publications and citations are needed to advance our careers). therefore no one will ever wake up one day and say "today is the day / week / month to convert the German lemmatizer to Java"

The only ways I see to change this dynamic is to do it yourself, to propose a research project that for some reason needs a German lemmatizer in Java with a high likelihood of being published, or offer our PI to buy a commercial license for CoreNLP specifically with a German lemmatizer. You are welcome to pursue any of those three options, and otherwise I think we're reaching the limit of what can be accomplished with polite conversation about the limitations of the CoreNLP models for German.

from corenlp.

GeorgeS2019 avatar GeorgeS2019 commented on May 27, 2024

@AngledLuffa

Currently not in position related to commercial.

Imagine, a smart German learning app that will help Ukrainian to speed up their integration (by speaking German in record time) in Germany.

This is the direction I could think of. Is there funding that would support this from US?

from corenlp.

AngledLuffa avatar AngledLuffa commented on May 27, 2024

from corenlp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.