rescue less #55

wfleming · 2015-12-17T22:34:45Z

The Analyzer #run loop was rescuing all exceptions, which is clearly
excessive. But I don't think we should hard-fail on all exceptions, either:
the Ruby parser gem is definitely known to barf on some valid but
esoteric Ruby code, and I wouldn't be surprised if the other parsers had
similar edge cases.

So this changes behavior to only skip files for a known set of catchable
errors. The out-of-process parsers are a little tricky since all you can
get from them is an exit code and output streams: for now wrapping any non-zero exit from them
in our own exception class seems reasonable. They're not effected by Java heap problems, and since we don't control the heap anymore, any potential memory problems should kill the whole container with OOM.

I'm still catching all exceptions, but only so that we can log a message
about which file is impacted before we re-raise. Since a raw exception
that will cause an abort may not contain helpful information about the file
that triggered it, this would be helpful for debugging cases.

Note that there is a similar case of an excessive rescue in the FileThreadPool. I'm investigating & fixing that separately.

Thoughts, @codeclimate/review ?

dblandin · 2015-12-17T22:40:34Z

lib/cc/engine/analyzers/command_line_runner.rb

+              err_output = stderr.gets
+              stderr.close
+
+              if 0 == exit_code


I noticed that there was a exit_code.to_i before the move. Is that still needed here?

Nope. Different invocation. This is already an int.

dblandin · 2015-12-17T22:40:52Z

This looks like a great approach 👍

wfleming · 2015-12-17T22:42:09Z

lib/cc/engine/analyzers/analyzer_base.rb

 module CC
  module Engine
    module Analyzers
      class Base
+        RESCUABLE_ERRORS = [


Opinion question for the team: should we also catch Timeout::Error? Flay does internally (not in a code path we're using, FWIW). I think the thinking behind it was that if the parser took too long on a file, it's probably a parser bug (or maybe an absurdly large file). We were implicitly catching it before.

I'd try to match the behavior of Flay run on the command line then. If the Flay catches the timeout error, logs a skip, and continues on, then I think that we should do the same.

@dblandin 's reasoning sounds good to me.

I agree as well. It has been added now.

Actually, I'm going to back this out: I remembered why I was hesitant about this in the first place. This series of changes was prompted by concerns of non-deterministic results from this engine (due to memory settings, but really a variety of system issues could come into play).

These timeouts are, by their nature, slightly non-deterministic. a file complex & big enough to be borderline might timeout some times & not others. Memory pressure on the system could cause execution time to vary between one run & another. Etc.

So instead I think I'm going to consider the timeout errors fatal & up the timeout limit to something absurd like 5 minutes. That should ensure it only gets triggered in pathological cases (like a parser bug leading to an infinite loop or something).

@jpignata this is relevant to your interests, so your thoughts are welcome.

The Analyzer `#run` loop was rescuing *all* exceptions, which is excessive. I don't think we should hard-fail on all exceptions, either: the Ruby parser gem is definitely known to barf on some valid but esoteric Ruby code, and I wouldn't be surprised if the other parsers had similar edge cases. So this changes behavior to only skip files for a known set of catchable errors. The out-of-process parsers are a little tricky since all you can get from them is an exit code and output streams: for now wrapping those in our own exception class seems reasonable. I'm still catching all exceptions, but only so that we can log a message about which file is impacted before we re-raise. Since a raw exception we'll likely abort on may not contain helpful information about the file that triggered it, this would be helpful for debugging cases.

Each of the out-of-process parser classes implemented its own CommandLineRunner, and they were all functionally the same (with some small differences that weren't actually used). This pulls the class up to one reused class.

Timeouts are slightly non-deterministic by their nature. So we should consider them fatal. They can still be useful, but should be for truly pathological cases since they are fatal, so I've upped the limit to 5 mintues.

rescue less

dblandin reviewed Dec 17, 2015
View reviewed changes

wfleming reviewed Dec 17, 2015
View reviewed changes

wfleming force-pushed the will/rescue-less branch 2 times, most recently from 618ac29 to 5f1fddd Compare December 18, 2015 01:28

wfleming added 3 commits December 18, 2015 10:48

refactor: extract CommandLineRunner

818ada3

Each of the out-of-process parser classes implemented its own CommandLineRunner, and they were all functionally the same (with some small differences that weren't actually used). This pulls the class up to one reused class.

Timeout errors should be fatal

724e48f

Timeouts are slightly non-deterministic by their nature. So we should consider them fatal. They can still be useful, but should be for truly pathological cases since they are fatal, so I've upped the limit to 5 mintues.

wfleming force-pushed the will/rescue-less branch from 5f1fddd to 724e48f Compare December 18, 2015 15:48

wfleming added a commit that referenced this pull request Dec 18, 2015

Merge pull request #55 from codeclimate/will/rescue-less

ae5f594

rescue less

wfleming merged commit ae5f594 into master Dec 18, 2015

wfleming deleted the will/rescue-less branch December 18, 2015 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rescue less #55

rescue less #55

wfleming commented Dec 17, 2015

dblandin Dec 17, 2015

wfleming Dec 17, 2015

dblandin Dec 17, 2015

dblandin commented Dec 17, 2015

wfleming Dec 17, 2015

dblandin Dec 17, 2015

pbrisbin Dec 17, 2015

wfleming Dec 17, 2015

wfleming Dec 17, 2015

rescue less #55

rescue less #55

Conversation

wfleming commented Dec 17, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dblandin commented Dec 17, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment