Skip to content

Problem in nltkvid18.py (the charset shows unknown 8 bit in positive.txt and negative.txt) #3

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
pratimaupadhyay02 opened this issue May 26, 2017 · 1 comment

Comments

@pratimaupadhyay02
Copy link

Sir
When running nltkvid18.py, the following error is encountered:
UnicodeDecodeError: 'ascii' codec can't decode byte 0xed in position 6: ordinal not in range(128)
I am attaching the screenshot here
issue1
I found out that the files positive.txt and negative.txt show unknown charset type instead of ascii or utf-8 so the file was not able to read it.
I used " cat -v negative.txt > negative1.txt" and cat -v positive.txt > positive1.txt inorder to change the unknown charset type to ascii in a new file named positive1.txt and negative1.txt . After using these files it worked.

@harrypotter0
Copy link

Thanks it helped.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants