-
Notifications
You must be signed in to change notification settings - Fork 18
/
CHANGE.txt
60 lines (51 loc) · 2.28 KB
/
CHANGE.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
v1.0.0
1. Support cuda for I2V and T2V.
2. Add demos for downstream tasks including knowledge & difficulty & discrimination prediction, similarity prediction and paper segmentation.
3. Refactor quesnet for pretrain and vectorization.
4. Update documents about tutorials and API.
v0.0.9
1. Refactor tokenizer Basic Tokenizer and Pretrained Tokenizer
2. Refactor model structures following huggingface styles for Elmo, BERT, DisenQNet and QuesNet
3. Add PreprocessingPipeline and Pipeline
4. Add downstream task: knowledge prediction and property prediction
5. Fix a bug in RNN which causes ELMo not converging
6. Move all the test models to modelhub
7. Update test data files
v0.0.8
1. add Emlo
2. add DisenQNet
3. add QuesNet
4. add tal-edu-bert
5. add dynamic mapping table from modelhub
6. fix cuda error
7. update pretrained models
v0.0.7:
1. add BERT and pretrained model (luna_bert)
2. speed up the process in sif
3. handling OOV in word2vec
4. add English tutorials
5. add api docs and prettify tutorials
6. fix the np.error in gensim_vec.W2V.infer_vector
7. fix the parameters lost in tokenization
v0.0.6:
1. dev: add half-pretrained rnn model
2. important!!!: rename TextTokenizer to PureTextTokenizer, and add a new tokenizer named TextTokenizer (the two have similar but not the same behaviours).
3. sif: add $\textf{}$ syntax
4. add two pretrained w2v model: w2v_sci_300 and w2v_lit_300
v0.0.5:
1. fix the missing stopwords.txt when use pip install
v0.0.4:
1. fix the project errors
v0.0.3:
1. update formula ast: supporting more symbols and functions defined in katex
2. add tokens to vector tools, including word2vec and doc2vec using gensim
3. sci4sif support tokenization grouped by segments
4. add special tokens: \SIFTag and \SIFSep
5. add item to vector tools
6. add interface for getting pretrained models, where the supported model names can be accessed by `edunlp i2v` in the command console
v0.0.2:
1. fix potential ModuleNotFoundError
v0.0.1:
1. Add Formula class to parse latex formula, which will generate the abstract syntax tree.
2. Add SIF v0.0.2.
3. Add sif4sci function which serves as a preprocess function for downstream tasks.