Coder Social home page Coder Social logo

mass-labeling's Introduction

Mass Labeling

version License node mongodb

Description

Mass Labeling is an open source project for data assessment. It may be used to assess data for classification tasks.

Advantages

The main advantages of this project are:

  • easy to deploy,
  • easy to manage,
  • data is not distributed,
  • no need to manually collect results.

The last two advantages are worth describing in detail.

"Private data" case

If you decide to use any proprietary online data assessment service, then you should transfer your data to a server belonging to the service. Sometimes it is not comfortable, and sometimes it is not even possible due to the privacy rights to the data. In this case, you need to host an assessment tool on your own server.

"Offline tool" case

If you decide to use an offline data assessment tool, then you should distribute data between assessors. So, each of them gets a whole dataset, which is valuable for you or your company. Also, after the assessment job is done the new problem occurs. You will need to collect all the labels and merge them into one dataset.

So, if you wish to keep all the data and labels on your server together, then mass-labeling is the right choice for you.

Useful features

Mass Labeling has

  • built-in slider mechanism to look throw labeled data,
  • statistics to measure assessors work quality,
  • multilanguage support.

Technogical stack

  • node.js
  • mongodb

Installation

Installation is described in this guide

User guide

See the user guide which describes the basic user and administrator operations.

Contribution

This project was separated from the family of internal projects. So, some variables in the code may be confusing. The refactoring is welcome.

License

Apache License 2.0

mass-labeling's People

Contributors

v3il avatar vlad-iakovlev avatar romansteinberg avatar

Watchers

James Cloos avatar

mass-labeling's Issues

Импорт иногда некорректно отрабатывает, при этом не возникает ошибки в процессе импорта

При попытке импортировать некорректный датасет через cli (например, NaN в поле "url") скрипт не сообщает об ошибке. При этом этот датасет не попадает в список доступных датасетов. Приходится самостоятельно догадываться, в чем же проблема и почему некорректен датасет.

Forms edit

We need to develop editing mode for forms assessment.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.