-
Notifications
You must be signed in to change notification settings - Fork 367
Add index_col for spark IO reads. #769
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@ueshin , @HyukjinKwon Could you take a look at this maybe if you available? :) |
Codecov Report
@@ Coverage Diff @@
## master #769 +/- ##
==========================================
- Coverage 93.83% 93.82% -0.01%
==========================================
Files 32 32
Lines 5744 5753 +9
==========================================
+ Hits 5390 5398 +8
- Misses 354 355 +1
Continue to review full report at Codecov.
|
4b7e673 to
c2ca3b4
Compare
1179fe3 to
5eec6b5
Compare
Softagram Impact Report for pull/769 (head commit: 0c05081)⭐ Change Overview
📄 Full report
Impact Report explained. Give feedback on this report to [email protected] |
ueshin
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, pending tests.
|
Thanks! merging. |
Resolves #765 , I applied the same logic(as worked on #769) to all of the functions mentioned in above issue. So when we work with spark IO read, and also know about index column name, now we can use these functions with index_col like below and avoid creation of default index: ```python >>> ks.read_parquet(path, index_col=['i32', 'i64']) f bhello i32 i64 0 1 6.0 people 1 2 7.0 yo ```

related with #765 , I've added index_col for spark IO reads
If we know the index column already, We can prevent the creation of a default index by explicitly typing an index column as function arguments.
For example, now we can use 'read_table' like below.
Currently only added to 'read_table' functions.
And If you think this way is okay, I'm going to create a PR with all the other functions.