Coder Social home page Coder Social logo

gpt2sp's Introduction

GPT2SP Replication Package


GPT2SP

A web-based Agile story point estimator
View Demo »

Table of Contents
  1. How to replicate
  2. Acknowledgements
  3. License

How to replicate

About the Datasets

All of the datasets for 16 different projects are available in the marked_data folder Each dataset has the following 5 columns:

  1. issuekey: Issue ID
  2. title: Issue Title
  3. description: Issue Description
  4. storypoint: Assigned Story Point of the Issue
  5. split_mark: Represent whether the row was used as training, validation, or testing
issuekey title description storypoint split_mark
... ... ... ... ...

About the Models

Model Naming Convention

All of the models on HuggingFace Model Hub and Google Drive has the same naming convention as described in the following table:

Model ID Model Specification Experiment Scenario
#0 BPE GPT2 Tokenizer + Custom Pre-trained GPT-2 (GPT2SP) Within-Project
#00 BPE GPT2 Tokenizer + Custom Pre-trained GPT-2 (GPT2SP) Within-Repository
#000 BPE GPT2 Tokenizer + Custom Pre-trained GPT-2 (GPT2SP) Cross-Repository
#2 Word-levelSP Tokenizer + Custom Pretrained GPT-2 Within-Project
#22 Word-levelSP Tokenizer + Custom Pretrained GPT-2 Within-Repository
#222 Word-levelSP Tokenizer + Custom Pretrained GPT-2 Cross-Repository
#6 WordPieceSP Tokenizer + Custom Pretrained GPT-2 Within-Project
#66 WordPieceSP Tokenizer + Custom Pretrained GPT-2 Within-Repository
#666 WordPieceSP Tokenizer + Custom Pretrained GPT-2 Cross-Repository
#7 SentencePieceSP Tokenizer + Custom Pretrained GPT-2 Within-Project
#77 SentencePieceSP Tokenizer + Custom Pretrained GPT-2 Within-Repository
#777 SentencePieceSP Tokenizer + Custom Pretrained GPT-2 Cross-Repository

How to access the models

About the Model Training Process

About the GPT2SP Web App

  • Access the GPT2SP web app here to interact with our GPT2SP model and navigate the datasets

Acknowledgements

  • Special thanks to DeepSE's developers for providing the datasets and the replication package.
  • Special thanks to developers from PyTorch, HuggingFace, Streamlit, Transformers Interpret for providing amazing frameworks for the community

License

MIT License

gpt2sp's People

Contributors

michaelfu1998-create avatar vtawosi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.