Description
Describe the bug
I get the following error when I try to run the example of creating a report:
error happended in column:PassengerId
Traceback (most recent call last):
File "", line 1, in
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dataprep/eda/create_report/init.py", line 68, in create_report
"components": format_report(df, cfg, mode, progress),
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dataprep/eda/create_report/formatter.py", line 76, in format_report
comps = format_basic(edaframe, cfg)
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dataprep/eda/create_report/formatter.py", line 274, in format_basic
data, completions = basic_computations(df, cfg)
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dataprep/eda/create_report/formatter.py", line 383, in basic_computations
variables_data = compute_variables(df, cfg)
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dataprep/eda/create_report/formatter.py", line 318, in compute_variables
data[col] = cont_comps(df.frame[col], cfg)
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dataprep/eda/distribution/compute/univariate.py", line 200, in cont_comps
data["chisq"] = chisquare(data["hist"][0])
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dask/array/stats.py", line 136, in chisquare
return power_divergence(f_obs, f_exp=f_exp, ddof=ddof, axis=axis, lambda="pearson")
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/dask/array/stats.py", line 144, in power_divergence
if lambda not in scipy.stats.stats._power_div_lambda_names:
File "/home/user/anaconda3/envs/test-data-prep/lib/python3.8/site-packages/scipy/stats/stats.py", line 54, in getattr
raise AttributeError(
AttributeError: scipy.stats.stats is deprecated and has no attribute _power_div_lambda_names. Try looking in scipy.stats instead.
To Reproduce
from dataprep.datasets import load_dataset
df = load_dataset("titanic")
from dataprep.eda import create_report
report = create_report(df)
Expected behavior
To get the EDA report.
Desktop (please complete the following information):
- OS: Ubuntu 20.04.4 LTS
- Platform [Python script]
- Platform Version [PyCharm 2021.3.2 (Community Edition)]
- Python Version [3.8.12]
- Dataprep Version [0.4.2]
Additional context
I have tested in a fresh conda env with pip install dataprep. Here are the packages installed:
# Name Version Build Channel
_libgcc_mutex 0.1 main
_openmp_mutex 4.5 1_gnu
aiohttp 3.8.1 pypi_0 pypi
aiosignal 1.2.0 pypi_0 pypi
argon2-cffi 21.3.0 pypi_0 pypi
argon2-cffi-bindings 21.2.0 pypi_0 pypi
asttokens 2.0.5 pypi_0 pypi
async-timeout 4.0.2 pypi_0 pypi
attrs 21.4.0 pypi_0 pypi
backcall 0.2.0 pypi_0 pypi
bleach 4.1.0 pypi_0 pypi
bokeh 2.4.2 pypi_0 pypi
ca-certificates 2021.10.26 h06a4308_2
certifi 2021.10.8 py38h06a4308_2
cffi 1.15.0 pypi_0 pypi
charset-normalizer 2.0.12 pypi_0 pypi
click 8.0.4 pypi_0 pypi
cloudpickle 2.0.0 pypi_0 pypi
cycler 0.11.0 pypi_0 pypi
dask 2021.12.0 pypi_0 pypi
dataprep 0.4.2 pypi_0 pypi
debugpy 1.5.1 pypi_0 pypi
decorator 5.1.1 pypi_0 pypi
defusedxml 0.7.1 pypi_0 pypi
entrypoints 0.4 pypi_0 pypi
executing 0.8.2 pypi_0 pypi
flask 2.0.3 pypi_0 pypi
flask-cors 3.0.10 pypi_0 pypi
fonttools 4.29.1 pypi_0 pypi
frozenlist 1.3.0 pypi_0 pypi
fsspec 2022.2.0 pypi_0 pypi
idna 3.3 pypi_0 pypi
importlib-resources 5.4.0 pypi_0 pypi
ipykernel 6.9.1 pypi_0 pypi
ipython 8.1.0 pypi_0 pypi
ipython-genutils 0.2.0 pypi_0 pypi
ipywidgets 7.6.5 pypi_0 pypi
itsdangerous 2.1.0 pypi_0 pypi
jedi 0.18.1 pypi_0 pypi
jinja2 3.0.3 pypi_0 pypi
joblib 1.1.0 pypi_0 pypi
jsonpath-ng 1.5.3 pypi_0 pypi
jsonschema 4.4.0 pypi_0 pypi
jupyter-client 7.1.2 pypi_0 pypi
jupyter-core 4.9.2 pypi_0 pypi
jupyterlab-pygments 0.1.2 pypi_0 pypi
jupyterlab-widgets 1.0.2 pypi_0 pypi
kiwisolver 1.3.2 pypi_0 pypi
ld_impl_linux-64 2.35.1 h7274673_9
levenshtein 0.16.0 pypi_0 pypi
libffi 3.3 he6710b0_2
libgcc-ng 9.3.0 h5101ec6_17
libgomp 9.3.0 h5101ec6_17
libstdcxx-ng 9.3.0 hd4cf53a_17
locket 0.2.1 pypi_0 pypi
markupsafe 2.1.0 pypi_0 pypi
matplotlib 3.5.1 pypi_0 pypi
matplotlib-inline 0.1.3 pypi_0 pypi
metaphone 0.6 pypi_0 pypi
mistune 0.8.4 pypi_0 pypi
multidict 6.0.2 pypi_0 pypi
nbclient 0.5.11 pypi_0 pypi
nbconvert 6.4.2 pypi_0 pypi
nbformat 5.1.3 pypi_0 pypi
ncurses 6.3 h7f8727e_2
nest-asyncio 1.5.4 pypi_0 pypi
nltk 3.7 pypi_0 pypi
notebook 6.4.8 pypi_0 pypi
numpy 1.22.2 pypi_0 pypi
openssl 1.1.1m h7f8727e_0
packaging 21.3 pypi_0 pypi
pandas 1.4.1 pypi_0 pypi
pandocfilters 1.5.0 pypi_0 pypi
parso 0.8.3 pypi_0 pypi
partd 1.2.0 pypi_0 pypi
pexpect 4.8.0 pypi_0 pypi
pickleshare 0.7.5 pypi_0 pypi
pillow 9.0.1 pypi_0 pypi
pip 21.2.4 py38h06a4308_0
ply 3.11 pypi_0 pypi
prometheus-client 0.13.1 pypi_0 pypi
prompt-toolkit 3.0.28 pypi_0 pypi
ptyprocess 0.7.0 pypi_0 pypi
pure-eval 0.2.2 pypi_0 pypi
pycparser 2.21 pypi_0 pypi
pydantic 1.9.0 pypi_0 pypi
pygments 2.11.2 pypi_0 pypi
pyparsing 3.0.7 pypi_0 pypi
pyrsistent 0.18.1 pypi_0 pypi
python 3.8.12 h12debd9_0
python-crfsuite 0.9.7 pypi_0 pypi
python-dateutil 2.8.2 pypi_0 pypi
python-stdnum 1.17 pypi_0 pypi
pytz 2021.3 pypi_0 pypi
pyyaml 6.0 pypi_0 pypi
pyzmq 22.3.0 pypi_0 pypi
rapidfuzz 1.8.3 pypi_0 pypi
readline 8.1.2 h7f8727e_1
regex 2021.11.10 pypi_0 pypi
scipy 1.8.0 pypi_0 pypi
send2trash 1.8.0 pypi_0 pypi
setuptools 58.0.4 py38h06a4308_0
six 1.16.0 pypi_0 pypi
sqlite 3.37.2 hc218d9a_0
stack-data 0.2.0 pypi_0 pypi
terminado 0.13.1 pypi_0 pypi
testpath 0.6.0 pypi_0 pypi
tk 8.6.11 h1ccaba5_0
toolz 0.11.2 pypi_0 pypi
tornado 6.1 pypi_0 pypi
tqdm 4.62.3 pypi_0 pypi
traitlets 5.1.1 pypi_0 pypi
typing-extensions 4.1.1 pypi_0 pypi
varname 0.8.1 pypi_0 pypi
wcwidth 0.2.5 pypi_0 pypi
webencodings 0.5.1 pypi_0 pypi
werkzeug 2.0.3 pypi_0 pypi
wheel 0.37.1 pyhd3eb1b0_0
widgetsnbextension 3.5.2 pypi_0 pypi
wordcloud 1.8.1 pypi_0 pypi
xz 5.2.5 h7b6447c_0
yarl 1.7.2 pypi_0 pypi
zipp 3.7.0 pypi_0 pypi
zlib 1.2.11 h7f8727e_4