Skip to content
This repository was archived by the owner on Jan 24, 2024. It is now read-only.
This repository was archived by the owner on Jan 24, 2024. It is now read-only.

vgg16 模型random出现" Segmentation fault"  #76

@guochaorong

Description

@guochaorong

CE 框架,vgg16 出现两次 seg fault,

第一次job地址:http://18.222.34.7:8080/viewLog.html?buildId=1383&buildTypeId=Paddle_ContinuousEvaluation&tab=buildLog

第二次job地址:http://180.76.57.222:8111/viewLog.html?buildId=118&buildTypeId=PaddleCe_CEBuild&tab=buildLog&_focus=7990

:19][Step 1/1] Pass: 0, Loss: 4.501836, Train Accuray: 0.000000
[17:55:19][Step 1/1] 
[17:55:19][Step 1/1] 
[17:55:19][Step 1/1] Total examples: 3040, total time: 68.43846, 44.41947 examples/sed
[17:55:19][Step 1/1] 
[17:55:19][Step 1/1] *** Aborted at 1531245319 (unix time) try "date -d @1531245319" if you are using GNU date ***
[17:55:19][Step 1/1] PC: @                0x0 (unknown)
[17:55:19][Step 1/1] *** SIGSEGV (@0x58) received by PID 4890 (TID 0x7fbc3a8c7700) from PID 88; stack trace: ***
[17:55:19][Step 1/1]     @     0x7fbcc2fe37e0 (unknown)
[17:55:19][Step 1/1]     @     0x7fbcc32f650c PyEval_EvalFrameEx
[17:55:19][Step 1/1]     @     0x7fbcc32ff37d PyEval_EvalCodeEx
[17:55:19][Step 1/1]     @     0x7fbcc3276905 (unknown)
[17:55:19][Step 1/1]     @     0x7fbcc3244d33 PyObject_Call
[17:55:19][Step 1/1]     @     0x7fbcc32fa0a2 PyEval_EvalFrameEx
[17:55:19][Step 1/1]     @     0x7fbcc32fce9e PyEval_EvalFrameEx
[17:55:19][Step 1/1]     @     0x7fbcc32fce9e PyEval_EvalFrameEx
[17:55:19][Step 1/1]     @     0x7fbcc32ff37d PyEval_EvalCodeEx
[17:55:19][Step 1/1]     @     0x7fbcc3276830 (unknown)
[17:55:19][Step 1/1]     @     0x7fbcc3244d33 PyObject_Call
[17:55:19][Step 1/1]     @     0x7fbcc325374d (unknown)
[17:55:19][Step 1/1]     @     0x7fbcc3244d33 PyObject_Call
[17:55:19][Step 1/1]     @     0x7fbcc32f5897 PyEval_CallObjectWithKeywords
[17:55:19][Step 1/1]     @     0x7fbcc3341f32 (unknown)
[17:55:19][Step 1/1]     @     0x7fbcc2fdbaa1 start_thread
[17:55:19][Step 1/1]     @     0x7fbcc269dbcd clone
[17:55:19][Step 1/1]     @                0x0 (unknown)
[17:55:19][Step 1/1] ./run.xsh: line 14:  4890 Segmentation fault      FLAGS_benchmark=true FLAGS_fraction_of_gpu_memory_to_use=0.0 python model.py --device=GPU --batch_size=32 --data_set=flowers --iterations=100 --gpu_id=$cudaid
[17:55:20][Step 1/1] 4887

均在最后预测阶段:

        if args.with_test:
            pass_test_acc = test(exe)
        break

模型代码:
https://github.com/PaddlePaddle/paddle-ce-latest-kpis/blob/master/vgg16/model.py

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions