EduNLP

EduNLP is a library for advanced Natural Language Processing in Python and is one of the projects of EduX plan of BDAA. It's built on the very latest research, and was designed from day one to be used in real educational products.

EduNLP now comes with pretrained pipelines and currently supports segment, tokenization and vertorization. It supports varies of preprocessing for NLP in educational scenario, such as formula parsing, multi-modal segment.

EduNLP is commercial open-source software, released under the Apache-2.0 license.

Quickstart

Installation

Git and install by pip

# basic installation
pip install .

# full installation
pip install .[full]

or install from pypi:

# basic installation
pip install EduNLP

# full installation
pip install EduNLP[full]

Usage

from EduNLP import get_pretrained_i2v
i2v = get_pretrained_i2v("d2v_all_300", "./model")
item_vector, token_vector = i2v(["the content of item 1", "the content of item 2"])

Tutorial

For more details, please refer to the full documentation (latest | stable).

Resource

We will continuously publish new datasets in Standard Item Format (SIF) to encourage the relevant research works. The data resources can be accessed via another EduX project EduData

Contribute

EduNLP is still under development. More algorithms and features are going to be added and we always welcome contributions to help make EduNLP better. If you would like to contribute, please follow this guideline(开发指南).

Citation

If this repository is helpful for you, please cite our work

@misc{bigdata2021edunlp,
  title={EduNLP},
  author={bigdata-ustc},
  publisher = {GitHub},
  journal = {GitHub repository},
  year = {2021},
  howpublished = {\url{https://github.com/bigdata-ustc/EduNLP}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 1,142 Commits
.github		.github
EduNLP		EduNLP
asset/_static		asset/_static
docs		docs
examples		examples
scripts/extlib		scripts/extlib
static/test_data		static/test_data
tests		tests
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
AUTHORS.md		AUTHORS.md
CHANGE.txt		CHANGE.txt
CONTRIBUTE.md		CONTRIBUTE.md
CONTRIBUTE_CH.md		CONTRIBUTE_CH.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
pytest.ini		pytest.ini
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EduNLP

Quickstart

Installation

Usage

Tutorial

Resource

Contribute

Citation

About

Releases 11

Packages

Contributors 9

Languages

License

bigdata-ustc/EduNLP

Folders and files

Latest commit

History

Repository files navigation

EduNLP

Quickstart

Installation

Usage

Tutorial

Resource

Contribute

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 11

Packages 0

Contributors 9

Languages

Packages