Coder Social home page Coder Social logo

xgeric / ucphrase-exp Goto Github PK

View Code? Open in Web Editor NEW
165.0 165.0 32.0 12.58 MB

This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).

License: Apache License 2.0

Shell 0.11% Python 74.38% Cython 25.51%

ucphrase-exp's People

Contributors

xgeric avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

ucphrase-exp's Issues

How to extract phrase by using stanfordcoreNLP tool?

Hi,

Thanks for opening your code for this paper.
I read your paper and I found that you compared your results with pretrained model such as stanfordNLP. I am curious about how to extract phrases/mentions by stanfordNLP. Do you use "coref" annotator to get coreferent mentions as extracted phrase? Or there are other tools based on stanfordCoreNLP?

Wait for your reply.

Thanks in advance.

怎么跑自己的数据呀??

results interpretation

This might be a repeat, but I didn't find an answer.
Could you please specify where can I find actual resulting phrases after running the code?
I scanned the code and all the configs multiple times, but i don't see it.
Unless it's the spans in Attmap.cs_roberta_base.3layers, but in that case i'm lost at trying to interpret it.
Cheers!

Edit: I checked devdata-cs_roberta_base-core.CNN.3layers/model and these spans make much more sense, still could you please confirm those are the ones and how do I interpret them?

what the labels of CNN?

Hi! Excuse! What the labels of the CNN model ? The input of the CNN is 'attention map' and the output of the model is 'silver labels' ? However, the sliver labels are not all correct ?

Getting "Killed" error when run on kp20k Dataset

UCPhrase-error (2)

When I run python exp.py --gpu 0 --dir_data ../data/kp20k, I get the Killed error.
While debugging I found the error happens at the return torch.tensor(padded_tensor, dtype=torch.float32) (line 66 of src/model_att/model.py)

Is there a way to fix this?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.