Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如何把hanlp 与 solr的DataImportHandler对接? #13

Open
tabascobath opened this issue Sep 6, 2016 · 4 comments
Open

如何把hanlp 与 solr的DataImportHandler对接? #13

tabascobath opened this issue Sep 6, 2016 · 4 comments
Labels

Comments

@tabascobath
Copy link

你好:
本人solr新手,麻烦详细指教
我已经导入hanlp-sor-plugin并且在solr里面运行analysis正常了

之后我想从PDF, word,... 等档案格式中提取数据导入索引
请参考 http://iamyida.iteye.com/blog/2214600

请问hanlp是否也有同tika一样(TikaEntityProcessor)提供相应的processor?
或者有其他方法呢?

@hankcs
Copy link
Owner

hankcs commented Sep 6, 2016

你好,

各司其职,DataImportHandler由solr提供,Processor由tika提供,Analyzer由hanlp(或类似ik等分词器)提供。

@hankcs hankcs added the question label Sep 6, 2016
@tabascobath
Copy link
Author

谢谢你的说明, 我有成功作到data import了!

另外一个初学者的问题
我想要通过class path下的hanlp.properties进行配置
在tomcat环境下的sole应该要把hanlp.properties放在哪里呢?
如果辞典成功载入是否会在console里面显示呢?

已经尝试放再多个地方 可是切词看起来没有根据CustomDictionary.txt里面的关键字来切词
麻烦说明 谢谢!

@tabascobath
Copy link
Author

謝謝!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants