Comments (10)
Found answer:
Lemma only supported in English
from corenlp.
Technically a Python lemmatizer could be converted to Java if someone were to implement the seq2seq inference code in Java, but as we have discussed with other annotators that exist for German in Python, there's zero drive to do that for multiple reasons. If you want to talk with @manning about commissioning such work, please feel free. Otherwise, there's a lemmatizer in Stanza for German.
https://github.com/stanfordnlp/stanza
from corenlp.
@AngledLuffa
I am still exploring CoreNLP for German.
Still at a preliminary stage to explore feasibility.
Thanks for all feedback.
Python is great, it is challenging when used in other business use case.
Have you heard of TorchSharp?
from corenlp.
I had not, but I have never used C# for anything, so perhaps not surprising
from corenlp.
TorchSharp is attempt to bring PyTorch to c#
We started having so much opposition from the c# community because the syntax is python like but the programming langauge is c#
Therefore it is a best of both worlds
Writing PyTorch like code in c# with all the support of PyTorch community.
We are working towards using TorchSharp to access PyTorch models e.g. HuggingFace
from corenlp.
Is stanza based on Torch or pytoch?
from corenlp.
seq2seq inference code
This is a regular code in TorchSharp.
Perhaps using the lemma model in Stanza and doing it in TorchSharp, by passing the CoreNLP in Java, but using CoreNLP in C#
from corenlp.
Is stanza based on Torch or pytorch?
It's Python, using pytorch
Perhaps using the lemma model in Stanza and doing it in TorchSharp, by passing the CoreNLP in Java, but using CoreNLP in C#
There's a fundamental problem here which is
- no one here uses C# for anything
- even the number of Java projects is very limited. it's basically me, another research engineer, and our PI who work on this as needed
- everyone here thinks that because Stanza is available for German, with a complete suite of models, there's no point in adding more functionality to the CoreNLP side of things. the one asterisk being that there is currently no Stanza constituency parser
- if we did spend time converting a model such as the lemmatizer to Java, it would not benefit the group in any way (no publications, no citations, no $$) and it would not benefit the individual in any way (seeing as how publications and citations are needed to advance our careers). therefore no one will ever wake up one day and say "today is the day / week / month to convert the German lemmatizer to Java"
The only ways I see to change this dynamic is to do it yourself, to propose a research project that for some reason needs a German lemmatizer in Java with a high likelihood of being published, or offer our PI to buy a commercial license for CoreNLP specifically with a German lemmatizer. You are welcome to pursue any of those three options, and otherwise I think we're reaching the limit of what can be accomplished with polite conversation about the limitations of the CoreNLP models for German.
from corenlp.
Currently not in position related to commercial.
Imagine, a smart German learning app that will help Ukrainian to speed up their integration (by speaking German in record time) in Germany.
This is the direction I could think of. Is there funding that would support this from US?
from corenlp.
from corenlp.
Related Issues (20)
- Compile error, 'tree' can't be resolved...can't figure out what's going on! HOT 11
- com.apple.eawt.Application can not be resolved to a type (in class OSXAdapter) HOT 5
- Demo Website Issue HOT 2
- An exception occurred: Expecting value: line 1 column 1 (char 0) HOT 1
- IntervalTree#remove null pointer exception HOT 4
- i am getting a lock screen bug HOT 3
- Upgrade Apache Lucene to resolve vulnerability for consumers HOT 8
- negation modifier HOT 4
- Add Automatic-Module-Name to MANIFEST.MF HOT 22
- english.all.3class.distsim.crf.ser.gz ???? HOT 1
- Training Shift Reduce Parser HOT 1
- Wrong POS for "keine": PRON instead of DET HOT 7
- Support HOT 2
- Is downloads.cs.stanford.edu down? HOT 3
- Arabic Processing data HOT 2
- VBN vs VBD in the input files from PTB
- Is https://corenlp.run down? HOT 1
- Local Server Run Fails Due to Main Website Outage HOT 2
- Cannot instantiate a StanfordCoreNLP pipeline in a Springboot Project using Maven HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from corenlp.