Skip to content

data parser doesn't work #4

@zparcheta

Description

@zparcheta

the example from README.md python -m extras.make_dataset --parser brsp \ --input_parser mfcc --label_parser simple_char_parser returns the following error:

File "/home/zparcheta/anaconda2/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"main", fname, loader, pkg_name)
File "/home/zparcheta/anaconda2/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/data/forked/asr-study/extras/make_dataset.py", line 32, in
regex=True)
File "utils/generic_utils.py", line 62, in get_from_module
(name, module, ', '.join(members.keys())))
KeyError: 'brsp not found in datasets*.\n Valid values are: dummy, sid, brsd, voxforge, lapsbm, cslu, datasetparser'

If I change brsp for brsd (which is the available parser in dataset folder) then

datasets.dataset_parser.BRSD: WARNING File /data/forked/asr-study/data/lapsbm/LapsBM-F019/LapsBM_0378.wav has a forbidden label: "acertou o alvo em quarenta e três por cento das suas chances". Skipping
Traceback (most recent call last):
File "/home/zparcheta/anaconda2/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"main", fname, loader, pkg_name)
File "/home/zparcheta/anaconda2/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/data/forked/asr-study/extras/make_dataset.py", line 46, in
override=args.override)
File "datasets/dataset_parser.py", line 128, in to_h5
group = f.create_group(dataset)
File "/home/zparcheta/anaconda2/lib/python2.7/site-packages/h5py/_hl/group.py", line 52, in create_group
gid = h5g.create(self.id, name, lcpl=lcpl)
File "h5py/_objects.pyx", line 54, in h5py._objects.with_phil.wrapper (/home/ilan/minonda/conda-bld/h5py_1490028130695/work/h5py/_objects.c:2846)
File "h5py/_objects.pyx", line 55, in h5py._objects.with_phil.wrapper (/home/ilan/minonda/conda-bld/h5py_1490028130695/work/h5py/_objects.c:2804)
File "h5py/h5g.pyx", line 151, in h5py.h5g.create (/home/ilan/minonda/conda-bld/h5py_1490028130695/work/h5py/h5g.c:2929)
ValueError: Unable to create group (Name already exists)

The warning appears for each line of text and is skipping it.
How can I prepare data to training? I have already downloaded the data in data folder.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions