Coder Social home page Coder Social logo

shangzixue / edunlp Goto Github PK

View Code? Open in Web Editor NEW

This project forked from bigdata-ustc/edunlp

0.0 0.0 0.0 126.39 MB

A library for advanced Natural Language Processing towards multi-modal educational items.

License: Apache License 2.0

Python 99.88% Makefile 0.12%

edunlp's Introduction

EduNLP

VERSION PyPI test codecov Documentation Status Download License DOI

EduNLP is a library for advanced Natural Language Processing in Python and is one of the projects of EduX plan of BDAA. It's built on the very latest research, and was designed from day one to be used in real educational products.

EduNLP now comes with pretrained pipelines and currently supports segment, tokenization and vertorization. It supports varies of preprocessing for NLP in educational scenario, such as formula parsing, multi-modal segment.

EduNLP is commercial open-source software, released under the Apache-2.0 license.

Quickstart

Installation

Git and install by pip

# basic installation
pip install .

# full installation
pip install .[full]

or install from pypi:

# basic installation
pip install EduNLP

# full installation
pip install EduNLP[full]

Usage

from EduNLP import get_pretrained_i2v
i2v = get_pretrained_i2v("d2v_all_300", "./model")
item_vector, token_vector = i2v(["the content of item 1", "the content of item 2"])

Tutorial

For more details, please refer to the full documentation (latest | stable).

Resource

We will continuously publish new datasets in Standard Item Format (SIF) to encourage the relevant research works. The data resources can be accessed via another EduX project EduData

Contribute

EduNLP is still under development. More algorithms and features are going to be added and we always welcome contributions to help make EduNLP better. If you would like to contribute, please follow this guideline(开发指南).

Citation

If this repository is helpful for you, please cite our work

@misc{bigdata2021edunlp,
  title={EduNLP},
  author={bigdata-ustc},
  publisher = {GitHub},
  journal = {GitHub repository},
  year = {2021},
  howpublished = {\url{https://github.com/bigdata-ustc/EduNLP}},
}

edunlp's People

Contributors

baooooom avatar tswsxk avatar kenelmqlh avatar pingzhili avatar nnnyt avatar karin0018 avatar fannazya avatar shangzixue avatar wintermelon008 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.