-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Migrate several indexing and compaction integration tests to embedded-tests #18207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
ITPerfectRollupParallelIndexTest
and ITBestEffortRollupParallelIndexTest
to embedded-tests
…druid into add_embedded_perfect_rollup_test
ITPerfectRollupParallelIndexTest
and ITBestEffortRollupParallelIndexTest
to embedded-tests
...ests/src/test/java/org/apache/druid/testing/embedded/compact/EmbeddedAutoCompactionTest.java
Fixed
Show fixed
Hide fixed
...rc/test/java/org/apache/druid/testing/embedded/indexing/EmbeddedKafkaClusterMetricsTest.java
Fixed
Show fixed
Hide fixed
Bug: Concurrent append uses lock of type APPEND which always uses a lock version of epoch 1970-01-01. This can cause data loss in a flow as follows: - Ingest data using an APPEND task to an empty interval - Mark all the segments as unused - Re-run the APPEND task - Data is not visible since old segment IDs (now unused) are allocated again Fix: In segment allocation, do not reuse an old segment ID, used or unused. This fix was already done for some cases back in #16380 . An embedded test for this has been included in #18207
@@ -0,0 +1,3 @@ | |||
{"timestamp": "2013-08-31T01:02:33Z", "page": "Gypsy Danger", "language" : "en", "tags": ["t1", "t2"], "user" : "nuclear", "unpatrolled" : "true", "newPage" : "true", "robot": "false", "anonymous": "false", "namespace":"article", "continent":"North America", "country":"United States", "region":"Bay Area", "city":"San Francisco", "added": 57, "deleted": 200, "delta": -143} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
drive by comment/nit: i know this isn't new, but imo we should fix the problem of referring to this dataset as 'wikipedia' because it is confusing with the quickstart wikipedia data which is also going to be used in some tests, and this stuff only has a vaguely similar schema, maybe tiny-wikipedia
or something to indicate that its a very small dataset would help clear things up?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the suggestion! I will rename these datasets accordingly.
InlineSchemaDataSourceCompactionConfig | ||
.builder() | ||
.forDataSource(dataSource) | ||
.withSkipOffsetFromLatest(Period.seconds(0)) | ||
.withMaxRowsPerSegment(compactedMaxRowsPerSegment) |
Check notice
Code scanning / CodeQL
Deprecated method or constructor invocation Note test
Summary
Changes
TaskBuilder
utility to createTask
objects using fluent syntaxembedded-tests
Test migrations
ITPerfectRollupParallelIndexTest
EmbeddedIndexParallelTaskTest
ITBestEffortRollupParallelIndexTest
EmbeddedIndexParallelTaskTest
, added as a new test parameter which uses dynamic partitioningITAutoCompactionTest
EmbeddedAutoCompactionTest
ITAutoCompactionLockContentionTest
EmbeddedAutoCompactionUpgradeTest
ITAutoCompactionLockContentionTest
EmbeddedKafkaClusterMetricsTest
, methodtest_ingestClusterMetrics_compactionSkipsLockedIntervals()
ITCompactionTaskTest
EmbeddedCompactionTaskTest
ITCompactionSparseColumnTest
EmbeddedCompactionSparseColumnTest
ITOverlordResourceTest
EmbeddedOverlordClientTest
ITOverlordResourceNotFoundTest
EmbeddedOverlordClientTest
New nested tests
EmbeddedCentralizedSchemaPublishFailureTest
for the groupcds-task-schema-publish-disabled
EmbeddedCentralizedSchemaMetadataQueryDisabledTest
for the groupcds-coordinator-metadata-query-disabled
Test run times
EmbeddedIndexParallelTaskTest
(indexer)EmbeddedAutoCompactionTest
(indexer)EmbeddedCompactionSparseColumnTest
(indexer)ITPerfectRollupParallelIndexTest
ITPerfectRollupParallelIndexTest
(Indexer, shuffle deep store test, only 1 config changed)
ITPerfectRollupParallelIndexTest
(MM, shuffle deep store test, only 1 config changed)
ITBestEffortRollupParallelIndexTest
ITBestEffortRollupParallelIndexTest
ITAutoCompactionTest
(middle manager)ITAutoCompactionTest
(indexer)ITCompactionSparseColumnTest
(indexer)This PR has: