-
Notifications
You must be signed in to change notification settings - Fork 806
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
neuralconvo.ini + missing data/train.enc #3
Comments
|
The gitignore excluded his data folder from being checked in. |
How do you create the .enc files from the movie-dialogs corpus he put in the readme ? EDIT : Okay got them, https://github.com/suriyadeepan/datasets/tree/master/seq2seq/cornell_movie_corpus/ |
doh, can't perform git-lfs check for the repo properly because the account is over quota: |
To rebuild them: These files are just lines of text. I guess matching line numbers between enc and dec are conversation pairs. |
I had to grab that script manually as I couldn't checkout the repo as mentioned. It appears to be training now. |
Same here, had to correct a few issues with python 3.5 and the use of re.split that caused errors. |
What was the re.split issue? |
it is a problem with this line line 8, in get_id2line: it gives this error and will not let the file execute: how did you guys fix it? |
got it you have to run the prepare_data.py in python 2.7 you have to also uncomment the last lines so prepare_seq2seq_files executes. and make sure you have a python 2.7 environment for the prepare_data.py and remember the assignment code is in python 2.7 as well. |
@shlomis The re.split issue was about 're' not able to apply a byte rule to a string while training. Had to replace the line 50 of data_utils.py : |
@Niko2756 I am able to run it on Python 3 with a minimum amount of tweaks (a few print to change and maybe a small error message) |
I think this could be related. |
Here is my fix for python 3:
and of course change print to print() |
@drewp - sorry, python newbie here. I get this error when ran
UPDATE - I switched to python2 and it ran successfully. |
I have been running the training for a week almost. currently |
No, It doesn't end... it's an infinite loop. you can stop it at any time and it should pick up the latest checkpoint. |
It doesn't end however your results aren't correct. Check if you're running the good Python and tensorflow version. You should also check in the .ini file if the train_dec line is correctly set to a .dec file. |
I am running with python 2 i think, I did not know it was infinite loop. Thank you. And I checked .ini file, looks fine. Also on test mode, how are you supposed to use it? can i still use it as chatbot (like talk to it) what are the output about? |
Can I test my chatbot, im getting this, why is it doing it?
Reading model parameters from working_dir/seq2seq.ckpt-266100
|
I trained the using the default values in seq2seq.ini file as below after checkpoint at 16200(Reading model parameters from working_dir/seq2seq.ckpt-16200), but always getting responses as _UNK _UNK _UNK _UNK _UNK _UNK _UNK _UNK _UNK _UNK _UNK _UNK _UNK [strings] working_directory = working_dir/ [ints] enc_vocab_size = 20000 num_layers = 3 layer_size = 256 max_train_data_size = 0 [floats] Any help would be appreciated. Thanks. |
Have anyone tried training neuralConvo.ini model ? How to test this ? should we enter the questions listed from test.enc and expect predicted output from test.dec ? or is there any other way out to test this ? |
Did you read the introduction tutorial, @hariom-yadaw, at tensorflow https://www.tensorflow.org/tutorials/seq2seq/ they explain roughly the .ini settings. I had the UNK issue as well, and could not get to a stage, where you would have something like a conversational experience, no matter if had a few or a lot (> 2hrs) of training iterations. |
@2075 Yes, I had gone through the tutorial. I also trained it overnight(> 12 hrs), but UNK issue is always there. Can you please explain how to overcome this issue ? Thanks |
I only get "facing Klein Chub Chub Chub Chub Strip Strip Strip Strip" :( |
@hariom-yadaw In my case with 3 layers and 256 I get kind of usable results after more than a day training. Before I tried smaller layer sizes, less and more layers, but my dual GTX 690 quad SLI setup cannot crunch all of it. As long as your training sources are well, you should get some kind of result and the longer I train, the less UNK replies come back, though it is far from a real conversation, but worthwhile test. |
@2075 I was following below details about LSTM networks. I'm not very sure what num_layers =3 & layer_size = 256 refers to here. I want to play with these parameter which are related to network size, but don't have clear understanding of these ? can you (or anyone else please) explain these and how it affects the performance. Thanks! |
I am having issues with the tensorflow chatbot and was wondering if I could get pointed in the right direction. when running the execute.py I get the error
Traceback (most recent call last): Process finished with exit code 1 I also tried the suggested corrections and still nothing |
@jonsanti Check out the issue I started: #34 |
@Denisolt - thanks |
I fix the tensorflow.model import error downloading the models module of tensorflow and changing the reference to "tensorflow.models.tutorials.rnn" it is the correct path. |
@drewp But there is no |
Changed the readme to include that we have to put the train and test encoder and decoder files in the data folder otherwise this does not run and throws an error as per the closed issue llSourcell#3. Added Dropbox links to download the files.
'module' object has no attribute 'seq2seq' |
#61 gaoshuming |
pywrap_tensorflow.TF_GetCode(status)) NotFoundError: NewRandomAccessFile failed to Create/Open: data/train.enc : The system cannot find the path specified. |
Hello Dears, |
How to solve the UNK problem? |
Hi guys, do you face this problem? |
TypeError: sampled_loss() got an unexpected keyword argument 'logits' |
Hi i am getting this error?can someone help me regarding this |
On windows you can fix this by getting a peice of software called MiniConda, installing it on your system and then creating a 2.7 environment to run the importer for the movie pack. |
also noteworthy - https://github.com/vyraun/chatbot-MemN2N-tensorflow |
I'm able to run python execute.py after making changes to data_utils.py and seq2seq_model.py It's started training the model and testing once mode is changed to test Thanks to Chrisfauerbach for correction |
Hi guys, can someone help me fix this problem: >> Mode : test Traceback (most recent call last): |
@uccmen you have to change |
I get the below error how should i fix this error File "D:\Anaconda\lib\site-packages\tensorflow\python\framework\errors_impl.py", line 466, in raise_exception_on_not_ok_status NotFoundError: NewRandomAccessFile failed to Create/Open: data/train.enc : The system cannot find the path specified. |
Preparing data in working_dir/ anyone can help?? |
Following change fixed it for me: #Fix for Error: TypeError: sampled_loss() got an unexpected keyword argument 'logits' #def sampled_loss(inputs, labels): |
Dear Friends, Gurus and Experts, I have one general question about creating a bot using this code. In this exercise we have a knowledge set with train.enc having one part of the conversation and train.dec having the other part (replies) of the conversation to train the bot. Any help or a pointer in this regard will be very much appreciated. |
the neuralconvo.ini specifies following files
[strings]
Mode : train, test, serve
mode = train
train_enc = data/train.enc
train_dec = data/train.dec
test_enc = data/test.enc
test_dec = data/test.enc
but there is no data folder in repo.
there is the working_dir
python3 execute.py
Preparing data in working_dir/
Tokenizing data in data/train.enc
Traceback (most recent call last):
File "execute.py", line 313, in
train()
File "execute.py", line 127, in train
enc_train, dec_train, enc_dev, dec_dev, _, _ = data_utils.prepare_custom_data(gConfig['working_directory'],gConfig['train_enc'],gConfig['train_dec'],gConfig['test_enc'],gConfig['test_dec'],gConfig['enc_vocab_size'],gConfig['dec_vocab_size'])
File "/Users/johndpope/Documents/gitWorkspace/tensorflow_chatbot/data_utils.py", line 137, in prepare_custom_data
data_to_token_ids(train_enc, enc_train_ids_path, enc_vocab_path, tokenizer)
File "/Users/johndpope/Documents/gitWorkspace/tensorflow_chatbot/data_utils.py", line 121, in data_to_token_ids
normalize_digits)
File "/Users/johndpope/Documents/gitWorkspace/tensorflow_chatbot/data_utils.py", line 100, in sentence_to_token_ids
words = basic_tokenizer(sentence)
File "/Users/johndpope/Documents/gitWorkspace/tensorflow_chatbot/data_utils.py", line 50, in basic_tokenizer
words.extend(re.split(_WORD_SPLIT, space_separated_fragment))
File "/usr/local/Cellar/python3/3.5.2_1/Frameworks/Python.framework/Versions/3.5/lib/python3.5/re.py", line 203, in split
return _compile(pattern, flags).split(string, maxsplit)
TypeError: cannot use a bytes pattern on a string-like object
The text was updated successfully, but these errors were encountered: