Coder Social home page Coder Social logo

clianer's Introduction

CLI-a-ner

image

Installation

First, have the OpusCleaner installed on your system.

Then, clone this repository and install the additional requirements (at this point it's only urwid beyond what you already need to install to get a working install of OpusCleaner)

Usage

Set up the DATA_PATH (and perhaps the SAMPLE_SIZE) environment variables (these are used by OpusCleaner as usual). Then, run the app with ./main.py.

For example:

export DATA_PATH='/home/helcl/hplt/translation-models/en-cs/*.*.gz'
export SAMPLE_SIZE=100
cd path/to/clianer/
./main.py

Controls

Most of the controls are listed in the bottom bar of the app frame. However, there are some other controls depending the current application focus. Move focus between filter view and dataset view using left and right arrow.

Common controls

These work independently or whether focus is in the filter view or in the dataset view.

  • F2 opens up a new dataset
  • F3 adds a new filter
  • F6 show clean version of the data in the dataset view
  • F7 assign categories to current dataset
  • F10, q exit the application
  • Down, Up move within the focused window (PgUp and PgDn also work)

Filter view controls

  • F4 edit filter
  • F5 import filter pipeline from a different dataset (careful, this overwrites whatever is the current pipeline)
  • F8 remove filter
  • w, s move selected filter up or down
  • d mark filter for diffing
  • r reset diffing

Dataset view controls

  • F4 show diff (select which filter steps to diff in the filter view)
  • F5 show clean version of the data

clianer's People

Contributors

jindrahelcl avatar

Watchers

 avatar

Forkers

hplt-project

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.