Coder Social home page Coder Social logo

deepapi's Introduction

Deep API Learning

Code for the FSE 2016 paper Deep API Learning.

Two Versions

We release both Theano and PyTorch code of our approach, in the theano and pytorch folders, respectively.

  • The theano folder contains the code to run the experiments presented in the paper. The code is frozen to what it was when we originally wrote the paper. (NOTE: we modified some deprecated API invocations to fit for the latest python and theano).

  • The PyTorch is the bleeding-edge reporitory where we packaged it up, improved the code quality and added some features.

If you are interested in using DeepAPI, check out the PyTorch version and feel free to contribute.

For more information, please refer to the README files under the directory of each component.

Tool Demo

An online tool demo can be found in http://211.249.63.55/ (Currently shut down due to limited budget)

Citation

If you find it useful and would like to cite it, the following would be appropriate:

@inproceedings{gu2016deepapi,
    author = {Gu, Xiaodong and Zhang, Hongyu and Zhang, Dongmei and Kim, Sunghun},
    title = {Deep API Learning},
    booktitle = {Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering},
    series = {FSE 2016},
    year = {2016},
    location = {Seattle, WA, USA},
    pages = {631--642},
    publisher = {ACM},
    address = {New York, NY, USA},
}

deepapi's People

Contributors

guxd avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

deepapi's Issues

Please provide testing API or trained model.

Hi,
I am a researcher at York University and I am trying to reproduce your work. Is there any way to use your trained model for performance evaluation now? I saw the readme file says the testing API is not avaliable due to budget.
If not, can you please share the trained model? my email is [email protected]
Regards,
Moshi

Missing training hdf5 files

I am unable to run the PyTorch code. It is looking for for training h5 file - train.apiseq.shuf.h5

raise IOError("``%s`` does not exist" % (filename,))
OSError: ``./data/train.apiseq.shuf.h5`` does not exist

Where can I get the training files?

please provide accurate training data

Hello!

Thanks for your amazing work!
I am a researcher at york university and I am trying to reproduce your work. I found some problem here:

I am trying to extract training data from the h5 file using the data loader but I got non-sense labels:
image
I am sure that this is not a parsing error for the reason that:

The index matches the dictionary:
image
dictionary:
image

I am using the original APIdata class
image
and I decode the api answer using the original script:
image

Please confirm this problem and upload the most recent valid dataset.
My email is [email protected]

Best,
Moshi

Question about BLEU metric

Dear authors,

I run sampel.py and the script output three values: recall, precision, and F1.

image

I am wondering which value did you refer to in the Table 1 of your paper? Is it recall, precision, or F1?
image

Thank you for your clarification!

Is the training model trustworthy?

I downloaded the 120,000. model you trained in the dataset and tested it. The following results were obtained from 1000 test questions,
Avg Recall BLEU 34.955057, Avg Precision BLEU 34.955057, F1 34.955057,Is there a problem there?

How is the training set generated

Dear Professor Gu, I was wondering how the training set is generated? When I opened the 'train.apiseq.h5' file, there were two data sets: indices and phrases. What do they each represent? At the same time, the indices data set contains two attributes, length and pos. How is it generated? After I read the paper, I still couldn't understand it, which caused me a lot of trouble. I'm looking forward to get your reply sincerely. Thank you!
Here are some of the data sets I could see.
image

can't import ScheduledOptim in seq2seq.py

Dear Author,
when I tried to run the pytorch part, I found that the ScheduledOptim class could not be referenced in the seq2seq.py file. I did not find this class in modules.py, how can I solve this problem?
Thank you!!!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.