xgeric / ucphrase-exp Goto Github PK
View Code? Open in Web Editor NEWThis project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).
License: Apache License 2.0
This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).
License: Apache License 2.0
Can cpu run this code?
and
How to run ?
Hi,
Thanks for opening your code for this paper.
I read your paper and I found that you compared your results with pretrained model such as stanfordNLP. I am curious about how to extract phrases/mentions by stanfordNLP. Do you use "coref" annotator to get coreferent mentions as extracted phrase? Or there are other tools based on stanfordCoreNLP?
Wait for your reply.
Thanks in advance.
The data folder download website can't be opened
When I download the data from https://www.dropbox.com/s/1bv7dnjawykjsji/data.zip?dl=0 and unzip the package, no file of '/standard/kp20k.tagging.human_2.json'.
怎么跑自己的数据呀??
This might be a repeat, but I didn't find an answer.
Could you please specify where can I find actual resulting phrases after running the code?
I scanned the code and all the configs multiple times, but i don't see it.
Unless it's the spans in Attmap.cs_roberta_base.3layers, but in that case i'm lost at trying to interpret it.
Cheers!
Edit: I checked devdata-cs_roberta_base-core.CNN.3layers/model and these spans make much more sense, still could you please confirm those are the ones and how do I interpret them?
How can we run this pip line on our own dataset?
Have you tried your model in other languages? Can you provide the corpus, such as Chinese?
Hi! Excuse! What the labels of the CNN model ? The input of the CNN is 'attention map' and the output of the model is 'silver labels' ? However, the sliver labels are not all correct ?
error: could not create 'match/keywordprocessor.cpython-37m-x86_64-linux-gnu.so': No such file or directory
Steps to reproduce:
The dataset cannot be downloaded
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.