Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

run vgg_16_cifar wrong. #63

Closed
zhuyong0000 opened this issue Sep 10, 2016 · 5 comments
Closed

run vgg_16_cifar wrong. #63

zhuyong0000 opened this issue Sep 10, 2016 · 5 comments
Labels

Comments

@zhuyong0000
Copy link

zhuyong0000 commented Sep 10, 2016

运行./train.sh 出现下面错误:
I0910 08:11:05.670004 1881 GradientMachine.cpp:134] Initing parameters..
I0910 08:11:06.422868 1881 GradientMachine.cpp:141] Init parameters done.
/usr/local/bin/paddle: line 46: 1881 Killed ${DEBUGGER} $MYDIR/../opt/paddle/bin/paddle_trainer ${@:2}
No data to plot. Exiting!

运行的环境是paddledev/paddle:cpu-demo-latest,paddle版本为:
PaddlePaddle 0.8.0b, compiled with
with_avx: OFF
with_gpu: OFF
with_double: OFF
with_python: ON
with_rdma: OFF
with_glog: ON
with_gflags: ON
with_metric_learning:
with_timer: OFF
with_predict_sdk:

@sss534534
Copy link

Maybe Out of Memory, See in /var/log/messages

@reyoung
Copy link
Collaborator

reyoung commented Sep 12, 2016

It says that Killed. So maybe just out of memory. Please

  • set the batch_size smaller
  • set dataprovider's pool size smaller.

We will give a documentation to guide how to use less memory to train the job.

@reyoung reyoung changed the title 运行vgg_16_cifar 错误 run vgg_16_cifar wrong. Sep 12, 2016
@zhangscth
Copy link

遇到了相同的问题。。

@reyoung
Copy link
Collaborator

reyoung commented Oct 8, 2016

@zhangscth https://github.com/baidu/Paddle/pull/128/files#diff-d718076d937b4f0e340765bf95c122c3

如果是killed的话,看一下如何减少内存占用。这块文档还在写

@qingqing01
Copy link
Contributor

@zhangscth @zhuyong0000 如何减少内存的文档:http://www.paddlepaddle.org/doc_cn/faq/index.html#id1

在这个demo中您可以尝试减少DataProvider缓存、或者减少batch size.

@reyoung reyoung closed this as completed Nov 29, 2016
qingqing01 pushed a commit to qingqing01/Paddle that referenced this issue Apr 30, 2020
thisjiang pushed a commit to thisjiang/Paddle that referenced this issue Oct 28, 2021
refactor Object.As to as to remind it lack type check
gglin001 pushed a commit to graphcore/Paddle-fork that referenced this issue Dec 8, 2021
* add InstanceNorm and LayerNorm Ops

* add annotation

* transpose

* merge develop-ipu

* merge develop-ipu

* add GetOutputNode

* delete layernorm from backend

* groupnorm, instancenorm , layernorm

* transpose, reshape

* reshape, transpose unitest

* pre-commit
wangxicoding pushed a commit to wangxicoding/Paddle that referenced this issue Dec 9, 2021
* Update express_ner example

* update run_bigru_crf

* fix msra_ner example
zhoutianzi666 pushed a commit to zhoutianzi666/Paddle that referenced this issue May 23, 2022
AnnaTrainingG pushed a commit to AnnaTrainingG/Paddle that referenced this issue Sep 19, 2022
* Update README
* Change docs dir to en_US and zh_CN
* Add paper reference
* Fix link
zmxdream pushed a commit to zmxdream/Paddle that referenced this issue Oct 10, 2023
expand slot's feasign for cvr model in fused_cvm_op
lizexu123 pushed a commit to lizexu123/Paddle that referenced this issue Feb 23, 2024
Fridge003 pushed a commit to Fridge003/Paddle that referenced this issue Mar 13, 2024
Implement iterator vars fetching in ReduceOp
Galaxy1458 added a commit that referenced this issue Apr 24, 2024
SigureMo added a commit that referenced this issue Apr 26, 2024
zmxdream pushed a commit to zmxdream/Paddle that referenced this issue May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants