Forward `CoreInfo` via an digest to the runtime #9002

bkchr · 2025-06-26T15:41:06Z

Before this pull request we had this rather inflexible SelectCore type in parachain-system. It was just taking the last byte of the block number as the core selector. This resulted in issues like #8893. While it was not totally static, it was very complicated to forward the needed information to the runtime. In the case of running with block bundling (500ms blocks), multiple blocks are actually validated on the same core. Finding out the selector and offset without having access to the claim queue is rather hard. The claim queue could be forwarded to the runtime, but it would waste POV size as we would need to include the entire claim queue of all parachains.

This pull request solves the problem by moving the entire core selection to the collator side. From there the information is passed via a PreRuntime digest to the runtime. The CoreInfo contains the selector, claim_queue_offset and number_of_cores. Doing this on the collator side is fine as long as we don't have slot durations that are lower than the relay chain slot duration. As we have agreed to always have equal or bigger slot durations on parachains, there should be no problem with this change.

Downstream users need to remove the SelectCore type from the parachain_system::Config:

- type SelectCore = ...;
+

Closes: #8893 #8906

…ard-core-to-runtime

bkchr · 2025-06-27T08:26:08Z

/cmd fmt

skunert

Overall changes look good.

I think it is worth mentioning that these changes have some impact on which component is limiting our block-building throughput. With the static CoreSelector we had before, we were always authoring at a fixed claim queue offset. So after our cores for that offset were used up, we would skip authoring. After this change, we are still limited by the slot_timer, which is updated based on the claim queue offset at 0. However, the main responsibility lies with the Velocity in the runtime. It needs to be configured correctly in order to prevent excessive production.

Also quickly discussed with @bkchr the possibility of abusing the dynamic claim queue offset in a scenario where a elastic-scaling chain is configured for example for 3 cores, but has only 1 scheduled. In that scenario, the velocity and runtime constraints are too generous and allow block stealing of future authors.

One thing that is not yet clear to me (or I forgot) is the exact backing time point:

If I build a block on claim queue offset 2, will this block be backed immediately or only when this claim arrives at position 0 in two relay chain blocks.
If I build a block on claim queue offset 2, can the cores at offset 0 and 1 still be used, or are they "blocked" by the usage of offset 2? From intuition I would expect that we need to use the cores in order.

@alindima do you know the details of these points?

skunert · 2025-06-27T15:17:18Z

cumulus/client/consensus/aura/src/collators/slot_based/block_builder_task.rs

+/// Determine the core for the given `para_id`.
+///
+/// Takes into account the `parent` core to find the next available core.
+async fn determine_core<Header: HeaderT, RI: RelayChainInterface + 'static>(


Can we have test for this one?

skunert · 2025-07-08T15:32:46Z

cumulus/client/consensus/aura/src/collators/slot_based/block_builder_task.rs

+	relay_parent: &RelayHeader,
+	para_id: ParaId,
+	parent: &Header,
+) -> Result<Option<(CoreSelector, ClaimQueueOffset, CoreIndex, u16)>, ()> {


Would be nice to add a little doc what the u16 is here.

skunert · 2025-07-08T16:07:50Z

polkadot/node/subsystem-util/src/runtime/mod.rs

+		});
+
+		for (offset, cores) in offset_to_core_count {
+			if (offset as u32) < claim_queue_offset {


Why bother adding items with offset < claim_queue_offset to the map in the first place?

skunert · 2025-07-09T06:47:00Z

cumulus/client/consensus/aura/src/collators/slot_based/block_builder_task.rs

+	let res = if relay_parent_offset >
+		core_info.as_ref().map(|ci| ci.claim_queue_offset).unwrap_or_default().0 as u32
+	{
+		claim_queue.find_core(para_id, 0, 0)
+	} else {
+		claim_queue.find_core(
+			para_id,
+			core_info.as_ref().map_or(0, |ci| ci.selector.0 as u32 + 1),
+			core_info
+				.as_ref()
+				.map_or(0, |ci| ci.claim_queue_offset.0 as u32 - relay_parent_offset),
+		)
+	};
+
+	Ok(res)


I found this part a bit hard to digest, what do you think about this?

Suggested change

let res = if relay_parent_offset >

core_info.as_ref().map(|ci| ci.claim_queue_offset).unwrap_or_default().0 as u32

{

claim_queue.find_core(para_id, 0, 0)

} else {

claim_queue.find_core(

para_id,

core_info.as_ref().map_or(0, |ci| ci.selector.0 as u32 + 1),

core_info

.as_ref()

.map_or(0, |ci| ci.claim_queue_offset.0 as u32 - relay_parent_offset),

)

};

Ok(res)

let (cores_claimed, queue_offset) = match core_info {

Some(CoreInfo { selector, claim_queue_offset, .. })

if relay_parent_offset <= claim_queue_offset.0 as u32 =>

(selector.0 as u32 + 1, claim_queue_offset.0 as u32 - relay_parent_offset),

_ => (0, 0),

};

Ok(claim_queue.find_core(para_id, cores_claimed, queue_offset))

skunert · 2025-07-09T06:52:47Z

cumulus/client/consensus/aura/src/collators/slot_based/block_builder_task.rs

-				?claimed_cores,
-				"Claimed cores.",
+			slot_timer.update_scheduling(
+				claim_queue


Why not use number_of_cores?

skunert · 2025-07-09T08:15:17Z

One more thing to think about is backward compatibility. These changes here are breaking, since older runtimes which use the CoreSelector runtiem API are not longer compatible with this node. However, technically ES is already released and chains are able to use it.

alindima · 2025-07-09T08:54:47Z

One more thing to think about is backward compatibility. These changes here are breaking, since older runtimes which use the CoreSelector runtiem API are not longer compatible with this node. However, technically ES is already released and chains are able to use it.

They can't yet use it since the v2 receipts feature is not yet enabled (but will soon be). And even after it's enabled, they could only use it if the enabled the experimental-ump-signals compile feature (or implemented their own custom logic for sending UMP signals).

but would be indeed worth thinking what's the worst case scenario if they did

alindima · 2025-07-09T09:11:34Z

If I build a block on claim queue offset 2, will this block be backed immediately or only when this claim arrives at position 0 in two relay chain blocks.

If you also have a claim at offset 0 on the same core it will be backed immediately.

If I build a block on claim queue offset 2, can the cores at offset 0 and 1 still be used, or are they "blocked" by the usage of offset 2? From intuition I would expect that we need to use the cores in order.

You can only occupy the core if you have the full candidate chain up until the latest included candidate of the para. And you can only occupy the cores at offset 0.

Therefore, you can't occupy a core at offset 0 if it's not building on the latest included block (or if you have the full chain being backed right now at offset 0). So your intuition is right

…ard-core-to-runtime

bkchr · 2025-08-05T18:33:25Z

/cmd prdoc --audience runtime_dev --bump major

…time_dev --bump major'

…ard-core-to-runtime' into bkchr-collator-forward-core-to-runtime

paritytech-workflow-stopper · 2025-08-05T21:25:21Z

All GitHub workflows were cancelled due to failure one of the required jobs.
Failed workflow url: https://github.com/paritytech/polkadot-sdk/actions/runs/16761436884
Failed job name: cargo-clippy

…ard-core-to-runtime

bkchr added 10 commits June 20, 2025 17:01

Let's start

36cda25

More work

c6866b1

More work

f8ea621

Some fixes

f8ac8e4

More fixes

79f1aab

Some docs

8c56db7

Rename CoreSelector to CoreInfo

3d738ba

Remove SelectCore runtime logic

e9bc7c7

Merge branch 'master'

7649714

Merge remote-tracking branch 'origin/master' into bkchr-collator-forw…

35322f4

…ard-core-to-runtime

bkchr requested review from skunert and alindima June 26, 2025 15:41

bkchr requested a review from a team as a code owner June 26, 2025 15:41

bkchr added T0-node This PR/Issue is related to the topic “node”. T9-cumulus This PR/Issue is related to cumulus. labels Jun 26, 2025

github-actions bot and others added 2 commits June 27, 2025 08:28

Update from github-actions[bot] running command 'fmt'

c6be8a4

Fixes

2a03e75

skunert reviewed Jul 9, 2025

View reviewed changes

bkchr added 2 commits August 2, 2025 00:26

Merge remote-tracking branch 'origin/master' into bkchr-collator-forw…

9137d7f

…ard-core-to-runtime

More fixes

783a7a8

bkchr changed the title ~~Forward CoreInfo via an inherent to the runtime~~ Forward CoreInfo via an digest to the runtime Aug 4, 2025

bkchr added 3 commits August 4, 2025 17:15

Fix warnings

1689215

Fix tests

b12e514

Merge remote-tracking branch 'origin/master' into bkchr-collator-forw…

b477113

…ard-core-to-runtime

bkchr mentioned this pull request Aug 5, 2025

Claim Queue Offset: Make it dynamic! #9428

Open

Make the claim queue offset static

786740e

Add tests

09547f8

bkchr requested a review from skunert August 5, 2025 18:14

FMT

c252015

github-actions bot and others added 5 commits August 5, 2025 18:36

Update from github-actions[bot] running command 'prdoc --audience run…

e213c7a

…time_dev --bump major'

Fix compilation

d46c89e

Merge remote-tracking branch 'refs/remotes/origin/bkchr-collator-forw…

184d1fc

…ard-core-to-runtime' into bkchr-collator-forward-core-to-runtime

Prdoc

0b92fc7

Fix bug

90dae09

bkchr added 2 commits August 6, 2025 14:51

Fix errors

18c24dd

Merge remote-tracking branch 'origin/master' into bkchr-collator-forw…

e0619d9

…ard-core-to-runtime

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Forward `CoreInfo` via an digest to the runtime #9002

Forward `CoreInfo` via an digest to the runtime #9002

Uh oh!

bkchr commented Jun 26, 2025

Uh oh!

bkchr commented Jun 27, 2025

Uh oh!

skunert left a comment

Uh oh!

skunert Jun 27, 2025

Uh oh!

skunert Jul 8, 2025

Uh oh!

skunert Jul 8, 2025

Uh oh!

skunert Jul 9, 2025

Uh oh!

skunert Jul 9, 2025

Uh oh!

skunert commented Jul 9, 2025

Uh oh!

alindima commented Jul 9, 2025

Uh oh!

alindima commented Jul 9, 2025

Uh oh!

bkchr commented Aug 5, 2025

Uh oh!

paritytech-workflow-stopper bot commented Aug 5, 2025

Uh oh!

Uh oh!

Forward CoreInfo via an digest to the runtime #9002

Are you sure you want to change the base?

Forward CoreInfo via an digest to the runtime #9002

Uh oh!

Conversation

bkchr commented Jun 26, 2025

Uh oh!

bkchr commented Jun 27, 2025

Uh oh!

skunert left a comment

Choose a reason for hiding this comment

Uh oh!

skunert Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

skunert Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

skunert Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

skunert Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

skunert Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

skunert commented Jul 9, 2025

Uh oh!

alindima commented Jul 9, 2025

Uh oh!

alindima commented Jul 9, 2025

Uh oh!

bkchr commented Aug 5, 2025

Uh oh!

paritytech-workflow-stopper bot commented Aug 5, 2025

Uh oh!

Uh oh!

Forward `CoreInfo` via an digest to the runtime #9002

Forward `CoreInfo` via an digest to the runtime #9002