Get DataContractSerializer to behave nicely with unloadable AssemblyLoadContext #88791

StephenMolloy · 2023-07-12T21:11:33Z

The first commit in this draft PR adds tests to verify DCS did not work with unloadable ALC's before, and does after.
The second commit is the net effect of PR 73893. As I was working through this it became clear that that PR would not interfere with this ALC work, and actually helps because it moved one of our collections from using TypeHandle to nint. So I started with that as a base before adding on top. I propose we accept that PR since it's been pretty well reviewed at this point.
The third commit is taking the change above and applying the same technique to DCJS.
Everything after is the new work to fix the ALC scenario. Hopefully it's easy to cherry-pick commits and apply them and fix nits after accepting the PR referenced above.

I ran through some super simple perf testing - just on my dev machine. The simple benchmark used to explore PR 73893 shows that the full PR here is on par with the perf gains for GetId that come from PR 73893. There is some jitter - especially at high concurrency - but the two PR's seem to mostly take turns with who comes out the fastest in that simple benchmark. Both show marked improvement over the baseline .net 6/7/8 numbers.

I also did an even super-simpler comparison of the various ways to keep an "array" of either strong or weak references to items that can be indexed with an integer. This is what helped me decide to use the named ValueTuple approach in this PR for s_dataContractCache/ContextAwareIndex. Obviously using a single array of only strong references was the fastest in all cases, but both the two-array approach and the array-of-pairs approach stood out from the other options. The overhead of each is negligible in 0-concurrent/0-weak-reference scenarios, and keeps relative pace with the baseline as concurrency increases. Increasing the number of weak-references does start to show additional overhead, but it's a price that is only paid for unloadable contexts and we can't avoid it.

…to interfere with what comes next.

…ss that we no longer need.

teo-tsirpanis · 2023-07-13T09:48:51Z

...es/System.Private.DataContractSerialization/src/System/Runtime/Serialization/ContextAware.cs

+        where TValue : class?
+    {
+        private readonly ConcurrentDictionary<TKey, TValue> _fastDictionary = new();
+        private readonly ConditionalWeakTable<TKey, TValue> _collectibleTable = new();


ConditionalWeakTable is pretty fast itself. Have you ran any benchmarks to see if non-unloadable assemblies show an improvement with ConcurrentDictionary?

teo-tsirpanis · 2023-07-13T09:50:35Z

...es/System.Private.DataContractSerialization/src/System/Runtime/Serialization/ContextAware.cs

+    }
+
+    internal sealed class ContextAwareDictionary<TKey, [DynamicallyAccessedMembers(DynamicallyAccessedMemberTypes.PublicParameterlessConstructor)] TValue>
+        where TKey : Type


Does TKey have to be generic? You could just use Type.

teo-tsirpanis · 2023-07-13T09:52:52Z

...es/System.Private.DataContractSerialization/src/System/Runtime/Serialization/ContextAware.cs

+                {
+                    if (!_collectibleTable.TryGetValue(t, out ret))
+                    {
+                        ret = f(t);


Running the delegate inside the lock is prone to deadlocks. Can you move it outside?

The delegate isn't necessarily a simple amount of work depending on your OM design. It's something you really want to avoid executing multiple times. For example, if you instantiate a DCS on an incoming request on a REST api for example, having 100 concurrent requests could result in redoing this expensive work 100 times. You could negatively impact your first request time significantly.
Can you provide me information about it being prone to deadlocks? The code run by the delegate isn't going to do anything async so any reentrance will occur on the same thread. I don't believe this specific scenario is able to deadlock, but I'm open to learning about ways it can deadlock that I might not be aware of.

You know better if you think that creating a DCS will not execute arbitrary code. But note that ConcurrentDictionary.GetOrAdd will not execute the delegate inside a lock. In the most common case of non-unloadable assemblies there is still the possibility that the delegate will run many times.

There is also ConditionalWeakTable.CreateValue that you can use like Concurrent.Dictionary.GetOrAdd.

I agree, we should place a lock around ConcurrentDictionary.GetOrAdd. We don't need to check a second time if we are always holding a lock when adding as that would guarantee the prior add has completed before calling GetOrAdd and it will act like a Get.

mconnew · 2023-08-03T21:18:48Z

...es/System.Private.DataContractSerialization/src/System/Runtime/Serialization/ContextAware.cs

+
+            // Common case for collectible contexts
+            if (_collectibleTable.TryGetValue(t, out ret))
+                return ret;


Need to hold the lock on lookup too

StephenMolloy · 2023-08-11T22:48:58Z

Closing in favor of #90437

StephenMolloy added 5 commits July 11, 2023 17:22

Add tests to check for ALC goodness with DCS.

53df6c0

Daniel-Svensson's proposed perf improvement... since it doesn't seem …

a147839

…to interfere with what comes next.

Apply same perf improvement to JDCS; Remove Int/TypeHandleRef silline…

1477cbe

…ss that we no longer need.

Add context-aware collections for caching/indexing DataContracts.

9f5e962

Refine the context-aware collection types.

a4104e5

StephenMolloy added the area-Serialization label Jul 12, 2023

StephenMolloy added this to the 8.0.0 milestone Jul 12, 2023

StephenMolloy requested review from mconnew and HongGit July 12, 2023 21:11

ghost assigned StephenMolloy Jul 12, 2023

teo-tsirpanis reviewed Jul 13, 2023

View reviewed changes

mconnew reviewed Aug 3, 2023

View reviewed changes

StephenMolloy mentioned this pull request Aug 11, 2023

DCS work nicely with collectible AssemblyLoadContext #90437

Merged

StephenMolloy closed this Aug 11, 2023

ghost locked as resolved and limited conversation to collaborators Sep 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Get DataContractSerializer to behave nicely with unloadable AssemblyLoadContext #88791

Get DataContractSerializer to behave nicely with unloadable AssemblyLoadContext #88791

Uh oh!

StephenMolloy commented Jul 12, 2023

Uh oh!

teo-tsirpanis Jul 13, 2023

Uh oh!

teo-tsirpanis Jul 13, 2023

Uh oh!

teo-tsirpanis Jul 13, 2023

Uh oh!

mconnew Jul 15, 2023

Uh oh!

teo-tsirpanis Jul 15, 2023

Uh oh!

mconnew Jul 17, 2023

Uh oh!

mconnew Aug 3, 2023

Uh oh!

StephenMolloy commented Aug 11, 2023

Uh oh!

Uh oh!

Get DataContractSerializer to behave nicely with unloadable AssemblyLoadContext #88791

Get DataContractSerializer to behave nicely with unloadable AssemblyLoadContext #88791

Uh oh!

Conversation

StephenMolloy commented Jul 12, 2023

Uh oh!

teo-tsirpanis Jul 13, 2023

Choose a reason for hiding this comment

Uh oh!

teo-tsirpanis Jul 13, 2023

Choose a reason for hiding this comment

Uh oh!

teo-tsirpanis Jul 13, 2023

Choose a reason for hiding this comment

Uh oh!

mconnew Jul 15, 2023

Choose a reason for hiding this comment

Uh oh!

teo-tsirpanis Jul 15, 2023

Choose a reason for hiding this comment

Uh oh!

mconnew Jul 17, 2023

Choose a reason for hiding this comment

Uh oh!

mconnew Aug 3, 2023

Choose a reason for hiding this comment

Uh oh!

StephenMolloy commented Aug 11, 2023

Uh oh!

Uh oh!