Coder Social home page Coder Social logo

fsdl-2022-weak-supervision-project's Introduction

fsdl-2022-weak-supervision-project's People

Contributors

bergr7 avatar diegoquintanav avatar edabati avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

fsdl-2022-weak-supervision-project's Issues

Create model UI for users

Effort: 1 person (max 2)

We should create a simple UI for users to query and interact with the model.
The UI should require an input text from the user, request the model prediction using the model API endpoint in services/api-serveless and show the prediction in an informative way.

Ideally it should be setup to run in a docker container (or AWS EC2 like the example course labs?).

Options for implementation:

  • Gradio
  • Streamlit

Train 1st model using Weak Labels

Effort: 1 person (max 2)

We created a labeled dataset created with Weak Supervision available here: https://huggingface.co/datasets/bergr7/weakly_supervised_ag_news

We should now train the first model on that and save it to a place accessible for the rest of the team.

Suggested steps:

  • fine-tune distilbert
  • train using huggingface Transformers API
  • track experiment in W&B: docs
  • store the Model in the W&B model registry and/or hugging face hub

We should have a team space in W&B if I am not wrong.

Finish Active Learning script + documentation/readme

Effort: 2 people (max 3)

First attempt of an active learning loop with connection to Rubrix for labelling was made in branch active-learning-rubrix. See this notebook

Some parts are missing:

  • use unlabelled data resulted from weak labelling
  • decide on loop initialisation (probably using prediction of model created in #7)
  • decide sampling strategy
  • decide whether small-text should be used or we should code something from scratch
  • decide if we can switch away from the @listener API to something more straightforward like this example

Also we should create a documentation explaining the design decisions, describing how an active learning loop could be used and deployed, etc.

FYI @bergr7

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.