NC | Concurrency & refactoring | Add delay, version move checks and GPFS refactoring #8419

romayalon · 2024-09-29T15:59:14Z

Explain the changes

NamespaceFS -
1.1. Added to the retries a delay, the delay is the sum of a base of 70 + random(0,50). Delay is common when using retries mechanism, specifically in multithreading, if another request moved the files in a way that caused a failure to our current request, retrying right away might still have the issue if the second request didn't finish its work, therefore we wait a bit and let the other request finish its work. I saw that usually above 100 ms is good, and having an addition of random ms is nice to have.
1.2. moved is_gpfs ? check to inside _open_files_gpfs() function
1.3. _delete_single_object_versions() - added 2 calls to _check_version_moved() for checking if the version moved between the latest version location and .versions/ at the time of the deletion in order to make sure that we will delete the version even if it moved. the check will throw VERSION_MOVED error that will trigger a retry, in the next try we will locate the new location of the version and remove it.
1.4. _check_version_moved() function receives key, version_id and cur_path, if cur_path is the latest version location, we will check if it was moved to .versions/, if cur_path is in .versions/ we will check if the latest version has the same version_id as the version_id param.

Issues: Fixed #xxx / Gap #xxx

Fixed NSFS | Versioning | Concurrent delete latest & delete object by ID which is also the latest #8414

Testing Instructions:

sudo jest --testRegex=jest_tests/test_versioning_conc -t 'concurrent delete objects by version id/latest'

Doc added/updated
Tests added

src/sdk/namespace_fs.js

shirady · 2024-10-01T11:06:04Z

@romayalon Could you please add a short explanation about "Added to the retries a delay, the delay is the sum of a base of 70 + random(0,50)." in the PR description? (Why the delay solves the issue, Why we chose it, etc.).

src/test/unit_tests/jest_tests/test_versioning_concurrency.test.js

romayalon · 2024-10-01T11:23:02Z

@romayalon Could you please add a short explanation about "Added to the retries a delay, the delay is the sum of a base of 70 + random(0,50)." in the PR description? (Why the delay solves the issue, Why we chose it, etc.).

Delay is common when using retries mechanism, specifically in multithreading, if another request moved the files in a way that caused a failure to our current request, retrying right away might still have the issue if the second request didn't finish its work, therefore we wait a bit and let the other request finish its work. I saw that usually above 100 ms is good, and having an addition of random ms is nice to have.

shirady · 2024-10-01T12:41:39Z

@romayalon Could you please add a short explanation about "Added to the retries a delay, the delay is the sum of a base of 70 + random(0,50)." in the PR description? (Why the delay solves the issue, Why we chose it, etc.).

Delay is common when using retries mechanism, specifically in multithreading, if another request moved the files in a way that caused a failure to our current request, retrying right away might still have the issue if the second request didn't finish its work, therefore we wait a bit and let the other request finish its work. I saw that usually above 100 ms is good, and having an addition of random ms is nice to have.

@romayalon Why the base is 70 and not 100?

shirady

LGTM

src/sdk/namespace_fs.js

…PFS refactoring Signed-off-by: Romy <[email protected]>

pull-request-size bot added the size/L label Sep 29, 2024

romayalon force-pushed the romy-gpfs-refactoring branch 3 times, most recently from 6cd3731 to 15972de Compare October 1, 2024 07:07

romayalon marked this pull request as ready for review October 1, 2024 07:36

romayalon force-pushed the romy-gpfs-refactoring branch from 15972de to fe34ee2 Compare October 1, 2024 07:57

romayalon requested review from nadavMiz and shirady October 1, 2024 08:51

nadavMiz reviewed Oct 1, 2024

View reviewed changes

src/sdk/namespace_fs.js Show resolved Hide resolved

src/sdk/namespace_fs.js Outdated Show resolved Hide resolved

src/sdk/namespace_fs.js Outdated Show resolved Hide resolved

shirady reviewed Oct 1, 2024

View reviewed changes

src/test/unit_tests/jest_tests/test_versioning_concurrency.test.js Show resolved Hide resolved

romayalon requested review from nadavMiz and shirady October 1, 2024 12:21

romayalon force-pushed the romy-gpfs-refactoring branch from fe34ee2 to 46d8b9f Compare October 1, 2024 12:38

shirady approved these changes Oct 1, 2024

View reviewed changes

src/sdk/namespace_fs.js Outdated Show resolved Hide resolved

src/sdk/namespace_fs.js Show resolved Hide resolved

src/sdk/namespace_fs.js Outdated Show resolved Hide resolved

src/sdk/namespace_fs.js Show resolved Hide resolved

NC | Concurrency & refactoring | Add delay, version move checks and G…

bf7d5be

…PFS refactoring Signed-off-by: Romy <[email protected]>

romayalon force-pushed the romy-gpfs-refactoring branch from 2c23a0c to bf7d5be Compare October 6, 2024 15:13

romayalon merged commit 405905b into noobaa:master Oct 6, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NC | Concurrency & refactoring | Add delay, version move checks and GPFS refactoring #8419

NC | Concurrency & refactoring | Add delay, version move checks and GPFS refactoring #8419

Uh oh!

romayalon commented Sep 29, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shirady commented Oct 1, 2024

Uh oh!

Uh oh!

romayalon commented Oct 1, 2024

Uh oh!

shirady commented Oct 1, 2024

Uh oh!

shirady left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NC | Concurrency & refactoring | Add delay, version move checks and GPFS refactoring #8419

NC | Concurrency & refactoring | Add delay, version move checks and GPFS refactoring #8419

Uh oh!

Conversation

romayalon commented Sep 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Explain the changes

Issues: Fixed #xxx / Gap #xxx

Testing Instructions:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shirady commented Oct 1, 2024

Uh oh!

Uh oh!

romayalon commented Oct 1, 2024

Uh oh!

shirady commented Oct 1, 2024

Uh oh!

shirady left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romayalon commented Sep 29, 2024 •

edited

Loading