Skip to content

Commit

Permalink
中版本+1:v1.3.0新增IO适配器接口、内存池、臺灣正體、香港繁體
Browse files Browse the repository at this point in the history
  • Loading branch information
hankcs committed Sep 24, 2016
1 parent 37ca334 commit 4e668b0
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 5 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ HanLP: Han Language Processing
* 声调
> * 简繁转换
* 繁体中文分词
* 简繁分歧词
* 简繁分歧词(简体、繁体、臺灣正體、香港繁體)
> * 文本推荐
* 语义推荐
* 拼音推荐
Expand All @@ -57,7 +57,7 @@ HanLP: Han Language Processing
* CoNLL UA/LA/DA评测工具


在提供丰富功能的同时,**HanLP**内部模块坚持低耦合、模型坚持惰性加载、服务坚持静态提供、词典坚持明文发布,使用非常方便,同时自带一些语料处理工具,帮助用户训练自己的语料
在提供丰富功能的同时,**HanLP**内部模块坚持低耦合、模型坚持惰性加载、服务坚持静态提供、词典坚持明文发布,使用非常方便,同时自带一些语料处理工具,帮助用户训练自己的模型

------

Expand All @@ -79,7 +79,7 @@ Solr5.x、Lucene5.x插件:https://github.com/hankcs/hanlp-solr-plugin
<dependency>
<groupId>com.hankcs</groupId>
<artifactId>hanlp</artifactId>
<version>portable-1.2.11</version>
<version>portable-1.3.0</version>
</dependency>
```

Expand Down
6 changes: 4 additions & 2 deletions src/main/java/com/hankcs/hanlp/seg/Segment.java
Original file line number Diff line number Diff line change
Expand Up @@ -278,10 +278,12 @@ private static void combineWords(Vertex[] wordNet, int start, int end, CoreDicti
else
{
StringBuilder sbTerm = new StringBuilder();
for (int j = start; j < end; ++j)
for (int j = start; j < end;)
{
sbTerm.append(wordNet[j]);
String realWord = wordNet[j].realWord;
sbTerm.append(realWord);
wordNet[j] = null;
j += realWord.length();
}
wordNet[start] = new Vertex(sbTerm.toString(), value);
}
Expand Down

0 comments on commit 4e668b0

Please sign in to comment.