feat: export the data from a table to parquet files#1000
feat: export the data from a table to parquet files#1000v0y4g3r merged 15 commits intoGreptimeTeam:developfrom
Conversation
9a54859 to
3fe9f2f
Compare
Codecov Report
@@ Coverage Diff @@
## develop #1000 +/- ##
===========================================
- Coverage 86.18% 85.82% -0.37%
===========================================
Files 439 444 +5
Lines 63866 64459 +593
===========================================
+ Hits 55044 55321 +277
- Misses 8822 9138 +316
Flags with carried forward coverage won't be shown. Click here to find out more.
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
ed893da to
ab8258e
Compare
212ad30 to
bbfe8db
Compare
|
PTAL @MichaelScofield |
923e71e to
4076880
Compare
MichaelScofield
left a comment
There was a problem hiding this comment.
there're conflicts
|
@fengjiachun Maybe we can impl |
Good suggestion, I can implement it in a new PR |
4076880 to
f9c91c2
Compare
f9c91c2 to
ca4f564
Compare
This might be too small. 10 ~ 50 million should be better choices. |
I was worried about the high memory usage, I adjusted the default value to 5M, and in the future I think we should add an Option to configure it. |
|
Maybe it's time for us to find a way to support stream writes of SST files. Let me create an issue for this. |
Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com>
Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com>
Co-authored-by: fys <40801205+Fengys123@users.noreply.github.com>
* feat: copy table parser * feat: coopy table * chore: minor fix * chore: give stmt a more clearer name * chore: unified naming * chore: minor change * chore: add a todo * chore: end up with an empty file when occur an empty table * feat: format with copy table * feat: with options * chore: by cr * chore: default 5M rows per segment * Update src/datanode/src/sql/copy_table.rs Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com> * Update src/datanode/src/sql/copy_table.rs Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com> * Update src/datanode/src/error.rs Co-authored-by: fys <40801205+Fengys123@users.noreply.github.com> --------- Co-authored-by: Lei, HUANG <6406592+v0y4g3r@users.noreply.github.com> Co-authored-by: fys <40801205+Fengys123@users.noreply.github.com>
I hereby agree to the terms of the GreptimeDB CLA
What's changed and what's your intention?
To export the data from a table to a Parquet file, use the COPY statement.
If the data is too large, it will be automatically split into multiple files, one file per 5 million rows.
Checklist
Refer to a related PR or issue link (optional)