Fix space leaks in dependency solver logging. #2914

grayjay · 2015-11-06T07:39:47Z

This commit removes references to the solver log that prevented it from being garbage collected. It also forces evaluation of the current level and variable stack in Message.showMessages.

There is still a space leak in backjumping, so I profiled without backjumping enabled by merging in this branch: https://github.com/grayjay/cabal/tree/no-backjumping . I stopped each of these runs after a minute, but the times on my branch and master were similar when they solved for fewer packages.

EDIT: I compiled with GHC 7.10.2 and ran cabal --ignore-sandbox install --dry-run --max-backjumps -1 hackage-server-0.4 aeson yesod --with-compiler C:/ghc-7.6.3_64/bin/ghc +RTS -p -s -h -L50

master (bbc3638):

this branch (90c7772):

this branch without backjumping:

kosmikus · 2015-11-06T07:44:40Z

Wow. Thank you! This looks extremely promising.

You did not (intend to) change any of the actual behavior, right?

Do you think there's any way we could add some kind of regression test for this? Or even a semi-manual test? But at least something that makes it easy to test after further solver changes that the space behavior has not become worse again?

grayjay · 2015-11-06T22:57:00Z

@kosmikus Thanks! My goal was to keep the existing behavior but avoid converting the Progress to and from a list of messages paired with a result.

I'm not familiar with regression testing for performance, especially comparing across versions. I mainly tested this change by checking that the heap profile leveled off after cabal finished reading the package index. A test program could at least isolate the solver and use a very small set of unchanging packages. Then a space leak would have a much larger effect on maximum residency, and we might be able to use a fixed cutoff. We could choose a difficult test and run it for a limited number of steps, so that run time isn't affected by other changes to the solver, like goal order. I'm not sure how to test for increases in constant memory use, though.

grayjay · 2015-11-11T07:38:41Z

I started writing a benchmark, and it seems to detect these space leaks reliably. It runs cabal on a .cabal file with a lot of flags but no actual dependencies. The cabal.config file points to an empty package index to avoid reading the index from Hackage into memory. Then the test checks the memory use after a fixed number of backjumps with +RTS -t --machine-readable. I had to use the average, instead of the maximum, because there was often a spike at the beginning of the run.
https://github.com/grayjay/cabal/tree/solver-benchmarks .

This commit removes references to the solver log that prevented it from being garbage collected. It also forces evaluation of the current level and variable stack in 'Message.showMessages'.

BardurArantsson · 2016-01-29T19:07:59Z

I looked through this a few days ago and it seemed reasonable to me. That said... a lot of the code complexity/churn here seems to arise from not using a special-purpose monad for the solver (and/or logging its doings) -- of course doing that would be a huge refactoring, but would it make sense?

Regardless, I don't think that this should have to wait for a method-of-doing-automated-performance-regression-testing. You've provided ample evidence that this concrete PR fixes a leak. (Don't get me wrong... if you feel like pursuing the automated-performance-regression-testing thing, then I'd be very excited about that!)

grayjay · 2016-01-29T21:09:32Z

@BardurArantsson Thanks for the feedback. I'm not sure I understand the monad approach. The solver creates the log when it explores the tree, so it's only logging the last step. Earlier steps just leave information in the tree, such as pruned nodes, so that it can be included in the log. Then there are several log-processing steps. I think it would be nice to move some of that processing into the exploration step, where it might be easier. It would help with #2917 (2nd bullet point in #2917 (comment)). Also, we could remove the variable stack here if we could record the current variable during exploration:

cabal/cabal-install/Distribution/Client/Dependency/Modular/Message.hs

Line 50 in da8caf4

go :: [Var QPN] -> Int -> [Message] -> [String]

.

Even though I tested this PR, I'd really like to add some solver performance tests. It's too easy to accidentally introduce performance problems. I have a test for space leaks, but I'm not sure how it could fit into the project. Testing speed would be even more useful. Do you have any ideas for testing?

BardurArantsson · 2016-01-29T22:07:31Z

@grayjay It really was just an off-hand "wouldn't this be nice?" comment, so REALLY don't worry about it :).

AFAIU the PR is absolutely fine. (But I stress that my understanding of the solver is limited... very limited. Just FTR)

grayjay · 2016-01-29T22:20:43Z

I was just curious because I had some ideas for refactoring that area of code, and I used a State monad in #2917. It was mostly for supporting the new feature, though.

BardurArantsson · 2016-02-04T16:36:10Z

@23Skidoo What do you think about just merging this given the arguments above? I think adding performance regression tests would be a different PR.

23Skidoo · 2016-02-04T16:45:34Z

@BardurArantsson
Yes, I want to see this and #2916 in 1.24, I just need to do a review first.

I think adding performance regression tests would be a different PR.

This is fine with me.

23Skidoo · 2016-02-20T22:06:23Z

Since this PR only touches cabal-install code, I'm postponing it for the 1.24.1 release.

kosmikus · 2016-03-03T11:49:42Z

I'm looking at this (and #2916) right now.

kosmikus · 2016-03-03T20:18:24Z

Still looking. Have been running a few performance comparisons, and they all take a long time to complete. But in general, it is all looking very good so far.

kosmikus · 2016-03-04T13:26:52Z

This one definitely looks good to me. Still looking at #2916.

Fix space leaks in dependency solver logging.

BardurArantsson · 2016-03-04T20:58:37Z

Excellent to have this merged! I may revisit the solver split this weekend (but in a slightly more step-wise way)... @kosmikus Are you planning any big changes to the solver soon?

My revised approach will be to start by generalizing various types to have the package location be a type parameter. Unless I've missed some drastic recent developments this will allow breaking the dependency from the modular solver to cabal-install... which will be the second step.

kosmikus · 2016-03-04T21:12:09Z

Yes, @grayjay, let me say again that I am very very sorry for not dealing with these PRs for so long. These PRs are fantastic work. Thank you very much.

@BardurArantsson I am not generally opposed to the solver split (if we can work out the details). It's a bit difficult to say what I'm planning. It's the first time in ages I've managed to set aside a bit of time for work on this, and I will first try to look at outstanding PRs and issues. There are a few things I'd like to do myself, ultimately, but I don't think any of that will happen too soon. In any case, I'm aware of the ideas to separate out the solver, and I won't do anything major that isn't yet on the issue tracker in some form without coordinating with you.

BardurArantsson · 2016-03-04T21:14:49Z

Ok, thanks. That sounds great.

I'm very happy to discuss the details -- I think the first step should be pretty uncontroversial (and non-disruptive unlike my first attempt), so I'll try to get a PR done sometime this weekend and we can perhaps discuss it then.

EDIT: I'll make sure to /cc you on that.

kosmikus · 2016-03-04T21:43:52Z

@BardurArantsson Ok, sounds good. Thanks.

grayjay · 2016-03-04T22:09:56Z

@kosmikus No problem. Thank you for the thorough testing and review!

23Skidoo · 2016-03-04T22:45:11Z

Thanks, @grayjay!

BardurArantsson · 2016-03-05T11:56:24Z

@kosmikus First step filed as PR #3210

grayjay · 2016-03-05T23:51:52Z

I created an issue for the performance tests: #3211

kosmikus added cabal-install: other cabal-install: solver labels Nov 6, 2015

kosmikus self-assigned this Nov 6, 2015

grayjay mentioned this pull request Nov 8, 2015

Fix space leak in solver backjumping #2916

Merged

grayjay mentioned this pull request Dec 15, 2015

Use explicit export lists for modular solver #2924

Merged

grayjay force-pushed the solver-log-space-leaks branch 2 times, most recently from 5dfae2e to 3f199cb Compare December 22, 2015 09:41

Fix space leaks in dependency solver logging.

37f28f2

This commit removes references to the solver log that prevented it from being garbage collected. It also forces evaluation of the current level and variable stack in 'Message.showMessages'.

grayjay force-pushed the solver-log-space-leaks branch from 3f199cb to 37f28f2 Compare January 17, 2016 22:00

23Skidoo added this to the cabal-install 1.24.1 milestone Feb 20, 2016

23Skidoo modified the milestones: cabal-install 1.24.1, cabal-install 1.24 Feb 23, 2016

23Skidoo mentioned this pull request Mar 4, 2016

Improve goal reorder heuristics. #3208

Merged

kosmikus added a commit that referenced this pull request Mar 4, 2016

Merge pull request #2914 from grayjay/solver-log-space-leaks

83425ec

Fix space leaks in dependency solver logging.

kosmikus merged commit 83425ec into haskell:master Mar 4, 2016

grayjay deleted the solver-log-space-leaks branch March 4, 2016 22:10

grayjay mentioned this pull request Mar 5, 2016

Dependency solver performance tests #3211

Open

mgsloan mentioned this pull request Mar 21, 2016

cabal-install uses a lot of memory during stack init --solver commercialhaskell/stack#1677

Closed

Fix space leaks in dependency solver logging. #2914

Fix space leaks in dependency solver logging. #2914

Uh oh!

Conversation

grayjay commented Nov 6, 2015

Uh oh!

kosmikus commented Nov 6, 2015

Uh oh!

grayjay commented Nov 6, 2015

Uh oh!

grayjay commented Nov 11, 2015

Uh oh!

BardurArantsson commented Jan 29, 2016

Uh oh!

grayjay commented Jan 29, 2016

Uh oh!

BardurArantsson commented Jan 29, 2016

Uh oh!

grayjay commented Jan 29, 2016

Uh oh!

BardurArantsson commented Feb 4, 2016

Uh oh!

23Skidoo commented Feb 4, 2016

Uh oh!

23Skidoo commented Feb 20, 2016

Uh oh!

kosmikus commented Mar 3, 2016

Uh oh!

kosmikus commented Mar 3, 2016

Uh oh!

kosmikus commented Mar 4, 2016

Uh oh!

BardurArantsson commented Mar 4, 2016

Uh oh!

kosmikus commented Mar 4, 2016

Uh oh!

BardurArantsson commented Mar 4, 2016

Uh oh!

kosmikus commented Mar 4, 2016

Uh oh!

grayjay commented Mar 4, 2016

Uh oh!

23Skidoo commented Mar 4, 2016

Uh oh!

BardurArantsson commented Mar 5, 2016

Uh oh!

grayjay commented Mar 5, 2016

Uh oh!

Uh oh!