Skip to content

CRNN-CTC cannot download file data.tar.gz #897

@sfraczek

Description

@sfraczek

Problem description

I cannot download CRNN-CTC dataset https://github.com/PaddlePaddle/models/blob/develop/fluid/ocr_recognition/ctc_reader.py#L14.
Sever resets connection. I already retried on different networks and machines with the same result. Can you upload the file somewhere else? What is this file? Can I find it elsewhere?

Command run

python ctc_train.py

Output

λ 2b5c263aa3a5 /dataset/sfraczek/paddle-models/fluid/ocr_recognition {develop} python ctc_train.py
-----------  Configuration Arguments -----------
average_window: 0.15
batch_size: 32
eval_period: 15000
init_model: None
log_period: 1000
max_average_window: 15625
min_average_window: 10000
parallel: False
pass_num: 100
save_model_dir: ./models
save_model_period: 15000
use_gpu: True
------------------------------------------------
/usr/local/lib/python2.7/dist-packages/paddle/fluid/evaluator.py:64: Warning: The EditDistance is deprecated, because maintain a modified program inside evaluator cause bug easily, please use fluid.metrics.EditDistance instead.
  % (self.__class__.__name__, self.__class__.__name__), Warning)
file md5 b5df92d1c0ab0d585f4771e062a065e1 1de60d54d19632022144e4e58c2637b5
Cache file /root/.cache/paddle/dataset/ctc_data/data.tar.gz not found, downloading http://cloud.dlnel.org/filepub/?uuid=df937251-3c0b-480d-9a7b-0080dfeee65c
Traceback (most recent call last):
  File "ctc_train.py", line 150, in <module>
    main()
  File "ctc_train.py", line 146, in main
    train(args, data_reader=ctc_reader)
  File "ctc_train.py", line 52, in train
    train_list_file=train_list)
  File "/dataset/sfraczek/paddle-models/fluid/ocr_recognition/ctc_reader.py", line 166, in train
    data_dir = download_data()
  File "/dataset/sfraczek/paddle-models/fluid/ocr_recognition/ctc_reader.py", line 194, in download_data
    DATA_URL, CACHE_DIR_NAME, DATA_MD5, save_name=SAVED_FILE_NAME)
  File "/usr/local/lib/python2.7/dist-packages/paddle/v2/dataset/common.py", line 90, in download
    shutil.copyfileobj(r.raw, f)
  File "/usr/lib/python2.7/shutil.py", line 49, in copyfileobj
    buf = fsrc.read(length)
  File "/usr/local/lib/python2.7/dist-packages/requests/packages/urllib3/response.py", line 311, in read
    flush_decoder = True
  File "/usr/lib/python2.7/contextlib.py", line 35, in __exit__
    self.gen.throw(type, value, traceback)
  File "/usr/local/lib/python2.7/dist-packages/requests/packages/urllib3/response.py", line 244, in _error_catcher
    raise ProtocolError('Connection broken: %r' % e, e)
requests.packages.urllib3.exceptions.ProtocolError: ('Connection broken: IncompleteRead(15973 bytes read, 411 more expected)', IncompleteRead(15973 bytes read, 411 more expected))

@wanghaoshuang maybe you could help me?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions