Skip to content

Fetcher cannot serialize unicode pages #10

@GiovanH

Description

@GiovanH

In pages with unicode symbols, self._response2namespace(response).dump(self._url2path(url)) fails with

UnicodeEncodeError: 'charmap' codec can't encode           
                    character '\u2696' in position 6182: character             
                    maps to <undefined>

.dump() seems to be trying to save a yaml object with a default encoding, when it needs to force utf-8.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions