-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于preprocess.py预处理结果 #10
Comments
|
你好,当前版本针对lcsts 1.0。 |
针对原始文本抽取预处理的文件在githun没有放出来,楼主可以提供一下吗@shumingma |
原始文本的抽取,需要有什么格式么? 下面我尝试出现了错误。希望能得到指点。。。 |
楼上是在遍历的时候不支持张量。加一个tonumpy方法可以解决上述的问题 |
您好,请问您拿到抽取原始文本的文件了吗? |
我在prune函数里面for i in idx[:size]:前面加了idx = idx.numpy() |
谢谢楼主! |
您好,请问当前版本的preprocess.py是针对LCSTS2.0数据集吗?
(LCSTS2.0的数据文件中有大量<>tag,但似乎没有见到去除这些tag的操作?)
想了解一下您从LCSTS2.0到lcsts.low.share.train.pt的操作,谢谢!
(是因为在预处理其他数据集时发现,处理后的结果运行时报错)
The text was updated successfully, but these errors were encountered: