built-in add -p: add support for the same config settings as the Perl version #175

dscho · 2019-03-28T20:37:10Z

This is the final leg of the journey to a fully built-in git add: the git add -i and git add -p modes were re-implemented in C, but they lacked support for a couple of config settings.

The one that sticks out most is the interactive.singleKey setting: it was particularly hard to get to work, especially on Windows.

It also seems to be the setting that is incomplete already in the Perl version of the interactive add command: while the name of the config setting suggests that it applies to all of the interactive add, including the main loop of git add --interactive and to the file selections in that command, it does not. Only the git add --patch mode respects that setting.

As it is outside the purpose of the conversion of git-add--interactive.perl to C, we will leave that loose end for some future date.

Changes since v3:

Reverted that heavy-handed SIGPIPE handling.
Instead, changed the diffFilter test case to process the standard input instead of ignoring it.

(The range diff between v2 and v4 actually only shows the new patch 1/10 "t3701: adjust difffilter test".)

Changes since v2:

Fixed the SIGPIPE issue pointed out by Gábor Szeder.

Changes since v1:

Fixed the commit message where a copy/paste fail made it talk about another GIT_TEST_* variable than the GIT_TEST_ADD_I_USE_BUILTIN one.

gitgitgadget · 2020-01-13T08:29:59Z

Submitted as [email protected]

gitgitgadget · 2020-01-13T17:07:45Z

add-interactive.c

@@ -9,6 +9,7 @@
 #include "lockfile.h"


On the Git mailing list, SZEDER Gábor wrote (reply to this):

On Mon, Jan 13, 2020 at 08:29:22AM +0000, Johannes Schindelin via GitGitGadget wrote: > From: Johannes Schindelin <[email protected]> > > As noticed by Gábor Szeder, if we want to run `git add -p` with > redirected input through `test_must_fail` in the test suite, we must > expect that a SIGPIPE can happen due to `stdin` coming to its end. I don't think this issue is related to the redirected input: I modified that flaky test to send "unlimited" data to 'git add's stdin, i.e.: /usr/bin/yes | test_must_fail force_color git add -p and the test with --stress still failed with SIGPIPE all the same and just as fast. After looking into it, the issue seems to be sending data to the broken diffFilter process. So in that test the diff is "filtered" through 'echo too-short', which exits real fast, and doesn't read its standard input at all (well, apart from e.g. the usual kernel buffering that might happen on a pipe between the two processes). Making sure that the diffFilter process reads all the data before exiting, i.e. changing it to: test_config interactive.diffFilter "cat >/dev/null ; echo too-short" && made the test reliable, with over 2000 --stress repetitions, and that with only a single "y" on 'git add's stdin. Now, merely tweaking the test is clearly insufficient, because we not only want the test to be realiable, but we want 'git add' to die gracefully when users out there mess up their configuration. Ignoring SIGPIPE can surely accomplish that, but I'm not sure about the scope. I mean your patch seems to ignore SIGPIPE basically for almost the whole 'git add -(i|p)' process, but perhaps it should be limited only to the surroundings of the pipe_command() call running the diffFilter, and be done as part of the next patch adding the 'if (diff_filter)' block. Furthermore, I'm worried that by simply ignoring SIGPIPE we might just ignore a more fundamental issue in pipe_command(): shouldn't that function be smart enough not to write() to a fd that has no one on the other side to read it in the first place?! So, when the diffFilter process exits unexpectedly early, then the poll() call in pipe_command() -> pump_io() -> pump_io_round() returns with success and usually sets 'revents' for the child process' stdin to 12 (i.e. 'POLLOUT | POLLERR'; gah, how I hate unnamed constants :). Unfortunately, at that point we don't take any special action on POLLERR, but call xwrite() to try to write to the dead fd anyway, which then promptly triggers SIGPIPE. (This is what usually happens when stepping through the statements of those functions in a debugger, and the diffFilter process has all the time in the world to exit.) We could handle POLLERR with a patch like this: --- >8 --- Subject: run-command: handle POLLERR in pump_io_round() to reduce risk of SIGPIPE diff --git a/run-command.c b/run-command.c index 3449db319b..57093f0acc 100644 --- a/run-command.c +++ b/run-command.c @@ -1416,25 +1416,31 @@ static int pump_io_round(struct io_pump *slots, int nr, struct pollfd *pfd) if (poll(pfd, pollsize, -1) < 0) { if (errno == EINTR) return 1; die_errno("poll failed"); } for (i = 0; i < nr; i++) { struct io_pump *io = &slots[i]; if (io->fd < 0) continue; - if (!(io->pfd->revents & (POLLOUT|POLLIN|POLLHUP|POLLERR|POLLNVAL))) + if (io->pfd->revents & POLLERR) { + io->error = ECONNRESET; /* What should we report to the caller? */ + close(io->fd); + io->fd = -1; + continue; + } + if (!(io->pfd->revents & (POLLOUT|POLLIN|POLLHUP|POLLNVAL))) continue; if (io->type == POLLOUT) { ssize_t len = xwrite(io->fd, io->u.out.buf, io->u.out.len); if (len < 0) { io->error = errno; close(io->fd); io->fd = -1; } else { io->u.out.buf += len; io->u.out.len -= len; --- >8 --- Unfortunately #1, this changes the error 'git add -p' dies with from: error: mismatched output from interactive.diffFilter to: error: failed to run 'echo too-short' It might affect other commands as well, but FWIW the test suite doesn't catch any. Unfortunately #2, the above patch doesn't completely eliminates the SIGPIPE, but only (greatly) reduces its probability. It is possible that: - poll() returns with success and indicating a writable fd without any error, i.e. 'revents = 4'. - the bogus diffFilter exits, closing its stdin. - 'git add' attempts to xwrite() to the now closed fd, and triggers a SIGPIPE right away. This happens much rarer, 'GIT_TEST_ADD_I_USE_BUILTIN=1 ./t3701-add-interactive.sh -r 39,49 --stress-jobs=<4*nr-of-cores> --stress' tends to take over 200 repetitions. The patch below reproduces it fairly reliably by adding two strategically-placed sleep()s, with a bit of extra debug output: --- >8 --- diff --git a/add-patch.c b/add-patch.c index d8dafa8168..0fd017bbd3 100644 --- a/add-patch.c +++ b/add-patch.c @@ -421,6 +421,7 @@ static int parse_diff(struct add_p_state *s, const struct pathspec *ps) filter_cp.git_cmd = 0; filter_cp.use_shell = 1; strbuf_reset(&s->buf); + fprintf(stderr, "about to run diffFilter\n"); if (pipe_command(&filter_cp, colored->buf, colored->len, &s->buf, colored->len, diff --git a/run-command.c b/run-command.c index 57093f0acc..49ae88a922 100644 --- a/run-command.c +++ b/run-command.c @@ -1419,6 +1419,7 @@ static int pump_io_round(struct io_pump *slots, int nr, struct pollfd *pfd) die_errno("poll failed"); } + sleep(2); for (i = 0; i < nr; i++) { struct io_pump *io = &slots[i]; @@ -1435,8 +1436,11 @@ static int pump_io_round(struct io_pump *slots, int nr, struct pollfd *pfd) continue; if (io->type == POLLOUT) { - ssize_t len = xwrite(io->fd, + ssize_t len; + fprintf(stderr, "attempting to xwrite() %lu bytes to a fd with revents flags 0x%hx\n", io->u.out.len, io->pfd->revents); + len = xwrite(io->fd, io->u.out.buf, io->u.out.len); + fprintf(stderr, "after xwrite()\n"); if (len < 0) { io->error = errno; close(io->fd); diff --git a/t/t3701-add-interactive.sh b/t/t3701-add-interactive.sh index 12ee321707..acffc9af37 100755 --- a/t/t3701-add-interactive.sh +++ b/t/t3701-add-interactive.sh @@ -561,7 +561,7 @@ test_expect_success 'detect bogus diffFilter output' ' git reset --hard && echo content >test && - test_config interactive.diffFilter "echo too-short" && + test_config interactive.diffFilter "sleep 1 ; echo too-short" && printf y >y && test_must_fail force_color git add -p <y ' --- >8 --- and 'GIT_TEST_ADD_I_USE_BUILTIN=1 ./t3701-add-interactive.sh -r 39,49' fails with: + test_must_fail force_color git add -p about to run diffFilter attempting to xwrite() 224 bytes to a fd with revents flags 0x4 test_must_fail: died by signal 13: force_color git add -p I don't understand why we get SIGPIPE right away instead of some error that we can act upon (ECONNRESET?). FWIW, it fails the same way not only on my box, but on Travis CI's Linux and OSX images as well. https://travis-ci.org/szeder/git/jobs/636446843#L2937 Cc'ing Peff for all things SIGPIPE :) who also happens to be the author of both pipe_command() and that now flaky test. > The appropriate action here is to ignore that signal and treat it as a > regular end-of-file, otherwise the test will fail. In preparation for > such a test, introduce precisely this handling of SIGPIPE into the > built-in version of `git add -p`. > > For good measure, teach the built-in `git add -i` the same trick: it > _also_ runs a loop waiting for input, and can receive a SIGPIPE just the > same (and wants to treat it as end-of-file, too). > > Signed-off-by: Johannes Schindelin <[email protected]> > --- > add-interactive.c | 3 +++ > add-patch.c | 4 ++++ > 2 files changed, 7 insertions(+) > > diff --git a/add-interactive.c b/add-interactive.c > index a5bb14f2f4..3ff8400ea4 100644 > --- a/add-interactive.c > +++ b/add-interactive.c > @@ -9,6 +9,7 @@ > #include "lockfile.h" > #include "dir.h" > #include "run-command.h" > +#include "sigchain.h" > > static void init_color(struct repository *r, struct add_i_state *s, > const char *slot_name, char *dst, > @@ -1097,6 +1098,7 @@ int run_add_i(struct repository *r, const struct pathspec *ps) > ->util = util; > } > > + sigchain_push(SIGPIPE, SIG_IGN); > init_add_i_state(&s, r); > > /* > @@ -1149,6 +1151,7 @@ int run_add_i(struct repository *r, const struct pathspec *ps) > strbuf_release(&print_file_item_data.worktree); > strbuf_release(&header); > prefix_item_list_clear(&commands); > + sigchain_pop(SIGPIPE); > > return res; > } > diff --git a/add-patch.c b/add-patch.c > index 46c6c183d5..9a3beed72e 100644 > --- a/add-patch.c > +++ b/add-patch.c > @@ -6,6 +6,7 @@ > #include "pathspec.h" > #include "color.h" > #include "diff.h" > +#include "sigchain.h" > > enum prompt_mode_type { > PROMPT_MODE_CHANGE = 0, PROMPT_DELETION, PROMPT_HUNK, > @@ -1578,6 +1579,7 @@ int run_add_p(struct repository *r, enum add_p_mode mode, > }; > size_t i, binary_count = 0; > > + sigchain_push(SIGPIPE, SIG_IGN); > init_add_i_state(&s.s, r); > > if (mode == ADD_P_STASH) > @@ -1612,6 +1614,7 @@ int run_add_p(struct repository *r, enum add_p_mode mode, > parse_diff(&s, ps) < 0) { > strbuf_release(&s.plain); > strbuf_release(&s.colored); > + sigchain_pop(SIGPIPE); > return -1; > } > > @@ -1630,5 +1633,6 @@ int run_add_p(struct repository *r, enum add_p_mode mode, > strbuf_release(&s.buf); > strbuf_release(&s.plain); > strbuf_release(&s.colored); > + sigchain_pop(SIGPIPE); > return 0; > } > -- > gitgitgadget >

On the Git mailing list, Jeff King wrote (reply to this):

On Mon, Jan 13, 2020 at 06:04:17PM +0100, SZEDER Gábor wrote: > After looking into it, the issue seems to be sending data to the > broken diffFilter process. So in that test the diff is "filtered" > through 'echo too-short', which exits real fast, and doesn't read its > standard input at all (well, apart from e.g. the usual kernel > buffering that might happen on a pipe between the two processes). > Making sure that the diffFilter process reads all the data before > exiting, i.e. changing it to: > > test_config interactive.diffFilter "cat >/dev/null ; echo too-short" && > > made the test reliable, with over 2000 --stress repetitions, and that > with only a single "y" on 'git add's stdin. Yeah, I agree the test should be changed. What you wrote above was my first thought, too, but I think "sed 1d" is actually a more realistic test (and is shorter and one fewer process). > Now, merely tweaking the test is clearly insufficient, because we not > only want the test to be realiable, but we want 'git add' to die > gracefully when users out there mess up their configuration. I also agree that it would be nice to deal with this for real-world cases. I suspect it's not something that would come up a lot, though. > Ignoring SIGPIPE can surely accomplish that, but I'm not sure about > the scope. I mean your patch seems to ignore SIGPIPE basically for > almost the whole 'git add -(i|p)' process, but perhaps it should be > limited only to the surroundings of the pipe_command() call running > the diffFilter, and be done as part of the next patch adding the 'if > (diff_filter)' block. The scope there is probably OK in practice. In my opinion SIGPIPE is usually _not_ what the behavior we want. If we're carefully checking our write() return values, then we'd get EPIPE in such an instance and behave appropriately. And if we're not checking our write() return values, that's generally a bug that ought to be fixed. The big exception is when we are writing copious output to stdout (or the pager) via printf() or similar, and want to die rather than continue writing output nobody will see. But I don't think git-add really counts as generating a lot of output, where EPIPE could prevent us from doing useless work (unlike, say, git-log). > Furthermore, I'm worried that by simply ignoring SIGPIPE we might just > ignore a more fundamental issue in pipe_command(): shouldn't that > function be smart enough not to write() to a fd that has no one on the > other side to read it in the first place?! Maybe. As you noted below, checking for POLLERR is racy. Seeing that we "can" write to an fd and doing it to discover what write() returns (whether error or not) doesn't seem like the worst strategy. If the caller cares about pipe death, then it needs to be handling SIGPIPE anyway. I really wish there was a way to set a handler for SIGPIPE that tells _which_ descriptor caused it. Because I think logic like "die if it was fd 1, ignore and let write() return EPIPE otherwise" is the behavior we'd like. But I don't think there's a portable way to do so. I've been tempted to say that we should just ignore SIGPIPE everywhere, and convert even copious-output programs like git-log to just check for errors (they could probably even just check ferror(stdout) for each commit we output, if we didn't want to touch every printf call). -Peff

On the Git mailing list, Johannes Schindelin wrote (reply to this):

This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. --8323328-1685775113-1579006064=:46 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi G=C3=A1bor, On Mon, 13 Jan 2020, SZEDER G=C3=A1bor wrote: > On Mon, Jan 13, 2020 at 08:29:22AM +0000, Johannes Schindelin via GitGit= Gadget wrote: > > From: Johannes Schindelin <[email protected]> > > > > As noticed by G=C3=A1bor Szeder, if we want to run `git add -p` with > > redirected input through `test_must_fail` in the test suite, we must > > expect that a SIGPIPE can happen due to `stdin` coming to its end. > > I don't think this issue is related to the redirected input: I > modified that flaky test to send "unlimited" data to 'git add's stdin, > i.e.: > > /usr/bin/yes | test_must_fail force_color git add -p > > and the test with --stress still failed with SIGPIPE all the same and > just as fast. > > After looking into it, the issue seems to be sending data to the > broken diffFilter process. Ouch. Thank you for investigating. For my education, how did you debug this? I could not find a way to identify *what* caused that SIGPIPE... > So in that test the diff is "filtered" through 'echo too-short', which > exits real fast, and doesn't read its standard input at all (well, apart > from e.g. the usual kernel buffering that might happen on a pipe between > the two processes). Making sure that the diffFilter process reads all > the data before exiting, i.e. changing it to: > > test_config interactive.diffFilter "cat >/dev/null ; echo too-short" &= & > > made the test reliable, with over 2000 --stress repetitions, and that > with only a single "y" on 'git add's stdin. Ah, my diff filter simply ignores the `stdin`... That's easy enough to fix, and since real-world diff filters probably won't just blatantly ignore the input, I think it is legitimate to change the test. > Now, merely tweaking the test is clearly insufficient, because we not > only want the test to be realiable, but we want 'git add' to die > gracefully when users out there mess up their configuration. I think it is sufficient to tweak the test, but I agree that a better error message might be good when users out there mess up their configuration. > Ignoring SIGPIPE can surely accomplish that, but I'm not sure about > the scope. I mean your patch seems to ignore SIGPIPE basically for > almost the whole 'git add -(i|p)' process, but perhaps it should be > limited only to the surroundings of the pipe_command() call running > the diffFilter, and be done as part of the next patch adding the 'if > (diff_filter)' block. Right. Very heavy-handed, and probably inviting unwanted side effects. > Furthermore, I'm worried that by simply ignoring SIGPIPE we might just > ignore a more fundamental issue in pipe_command(): shouldn't that > function be smart enough not to write() to a fd that has no one on the > other side to read it in the first place?! > > So, when the diffFilter process exits unexpectedly early, then the > poll() call in pipe_command() -> pump_io() -> pump_io_round() returns > with success and usually sets 'revents' for the child process' stdin > to 12 (i.e. 'POLLOUT | POLLERR'; gah, how I hate unnamed constants :). > Unfortunately, at that point we don't take any special action on > POLLERR, but call xwrite() to try to write to the dead fd anyway, > which then promptly triggers SIGPIPE. (This is what usually happens > when stepping through the statements of those functions in a debugger, > and the diffFilter process has all the time in the world to exit.) > > We could handle POLLERR with a patch like this: > > --- >8 --- > > Subject: run-command: handle POLLERR in pump_io_round() to reduce risk o= f SIGPIPE > > diff --git a/run-command.c b/run-command.c > index 3449db319b..57093f0acc 100644 > --- a/run-command.c > +++ b/run-command.c > @@ -1416,25 +1416,31 @@ static int pump_io_round(struct io_pump *slots, = int nr, struct pollfd *pfd) > if (poll(pfd, pollsize, -1) < 0) { > if (errno =3D=3D EINTR) > return 1; > die_errno("poll failed"); > } > > for (i =3D 0; i < nr; i++) { > struct io_pump *io =3D &slots[i]; > > if (io->fd < 0) > continue; > > - if (!(io->pfd->revents & (POLLOUT|POLLIN|POLLHUP|POLLERR|POLLNVAL))) > + if (io->pfd->revents & POLLERR) { > + io->error =3D ECONNRESET; /* What should we report to the caller? *= / > + close(io->fd); > + io->fd =3D -1; > + continue; > + } > + if (!(io->pfd->revents & (POLLOUT|POLLIN|POLLHUP|POLLNVAL))) > continue; > > if (io->type =3D=3D POLLOUT) { > ssize_t len =3D xwrite(io->fd, > io->u.out.buf, io->u.out.len); > if (len < 0) { > io->error =3D errno; > close(io->fd); > io->fd =3D -1; > } else { > io->u.out.buf +=3D len; > io->u.out.len -=3D len; > > --- >8 --- > > Unfortunately #1, this changes the error 'git add -p' dies with from: > > error: mismatched output from interactive.diffFilter > > to: > > error: failed to run 'echo too-short' > > It might affect other commands as well, but FWIW the test suite > doesn't catch any. Hmm. My first impression is that the error message could be a bit better, but that it is probably a good thing to have. It would have helped _me_ understand the issue at hand. > Unfortunately #2, the above patch doesn't completely eliminates the > SIGPIPE, but only (greatly) reduces its probability. It is possible > that: > > - poll() returns with success and indicating a writable fd without > any error, i.e. 'revents =3D 4'. > > - the bogus diffFilter exits, closing its stdin. > > - 'git add' attempts to xwrite() to the now closed fd, and triggers > a SIGPIPE right away. > > This happens much rarer, 'GIT_TEST_ADD_I_USE_BUILTIN=3D1 > ./t3701-add-interactive.sh -r 39,49 --stress-jobs=3D<4*nr-of-cores> > --stress' tends to take over 200 repetitions. The patch below > reproduces it fairly reliably by adding two strategically-placed > sleep()s, with a bit of extra debug output: > > --- >8 --- > > diff --git a/add-patch.c b/add-patch.c > index d8dafa8168..0fd017bbd3 100644 > --- a/add-patch.c > +++ b/add-patch.c > @@ -421,6 +421,7 @@ static int parse_diff(struct add_p_state *s, const s= truct pathspec *ps) > filter_cp.git_cmd =3D 0; > filter_cp.use_shell =3D 1; > strbuf_reset(&s->buf); > + fprintf(stderr, "about to run diffFilter\n"); > if (pipe_command(&filter_cp, > colored->buf, colored->len, > &s->buf, colored->len, > diff --git a/run-command.c b/run-command.c > index 57093f0acc..49ae88a922 100644 > --- a/run-command.c > +++ b/run-command.c > @@ -1419,6 +1419,7 @@ static int pump_io_round(struct io_pump *slots, in= t nr, struct pollfd *pfd) > die_errno("poll failed"); > } > > + sleep(2); > for (i =3D 0; i < nr; i++) { > struct io_pump *io =3D &slots[i]; > > @@ -1435,8 +1436,11 @@ static int pump_io_round(struct io_pump *slots, i= nt nr, struct pollfd *pfd) > continue; > > if (io->type =3D=3D POLLOUT) { > - ssize_t len =3D xwrite(io->fd, > + ssize_t len; > + fprintf(stderr, "attempting to xwrite() %lu bytes to a fd with reven= ts flags 0x%hx\n", io->u.out.len, io->pfd->revents); > + len =3D xwrite(io->fd, > io->u.out.buf, io->u.out.len); > + fprintf(stderr, "after xwrite()\n"); > if (len < 0) { > io->error =3D errno; > close(io->fd); > diff --git a/t/t3701-add-interactive.sh b/t/t3701-add-interactive.sh > index 12ee321707..acffc9af37 100755 > --- a/t/t3701-add-interactive.sh > +++ b/t/t3701-add-interactive.sh > @@ -561,7 +561,7 @@ test_expect_success 'detect bogus diffFilter output'= ' > git reset --hard && > > echo content >test && > - test_config interactive.diffFilter "echo too-short" && > + test_config interactive.diffFilter "sleep 1 ; echo too-short" && > printf y >y && > test_must_fail force_color git add -p <y > ' > > --- >8 --- > > and 'GIT_TEST_ADD_I_USE_BUILTIN=3D1 ./t3701-add-interactive.sh -r 39,49' > fails with: > > + test_must_fail force_color git add -p > about to run diffFilter > attempting to xwrite() 224 bytes to a fd with revents flags 0x4 > test_must_fail: died by signal 13: force_color git add -p > > I don't understand why we get SIGPIPE right away instead of some error > that we can act upon (ECONNRESET?). Isn't it buffered? In any case, I would take the above-mentioned patch, even if it makes it "only" less likely to hit `SIGPIPE`. > FWIW, it fails the same way not only on my box, but on Travis CI's Linux > and OSX images as well. > > https://travis-ci.org/szeder/git/jobs/636446843#L2937 > > > Cc'ing Peff for all things SIGPIPE :) who also happens to be the > author of both pipe_command() and that now flaky test. I'll go with Peff's suggestion to use `sed 1d` instead of `echo too-short`. Thanks, Dscho --8323328-1685775113-1579006064=:46--

On the Git mailing list, Junio C Hamano wrote (reply to this):

Jeff King <[email protected]> writes: > On Mon, Jan 13, 2020 at 06:04:17PM +0100, SZEDER GÃ¡bor wrote: > >> After looking into it, the issue seems to be sending data to the >> broken diffFilter process. So in that test the diff is "filtered" >> through 'echo too-short', which exits real fast, and doesn't read its >> standard input at all (well, apart from e.g. the usual kernel >> buffering that might happen on a pipe between the two processes). >> Making sure that the diffFilter process reads all the data before >> exiting, i.e. changing it to: >> >> test_config interactive.diffFilter "cat >/dev/null ; echo too-short" && >> >> made the test reliable, with over 2000 --stress repetitions, and that >> with only a single "y" on 'git add's stdin. > > Yeah, I agree the test should be changed. What you wrote above was my > first thought, too, but I think "sed 1d" is actually a more realistic > test (and is shorter and one fewer process). I am not sure what we are aiming for. Are we making sure the command behaves well in the hands of end users, who may write a script that consumes only early parts of the input that is needed for its use and stops reading, or are we just aiming to claim "all our tests pass"? I was hoping that we would be doing the former, and I would understand if the suggestion were "sed 1q" for that exact reason. IOW, shouldn't we be fixing the part that drives the external process, so that the test "passes" even with such a "broken" filter? >> Now, merely tweaking the test is clearly insufficient, because we not >> only want the test to be realiable, but we want 'git add' to die >> gracefully when users out there mess up their configuration. Yes, and I was hoping that we do not have to touch the test if we did the latter. > I really wish there was a way to set a handler for SIGPIPE that tells > _which_ descriptor caused it. Because I think logic like "die if it was > fd 1, ignore and let write() return EPIPE otherwise" is the behavior > we'd like. But I don't think there's a portable way to do so. > > I've been tempted to say that we should just ignore SIGPIPE everywhere, > and convert even copious-output programs like git-log to just check for > errors (they could probably even just check ferror(stdout) for each > commit we output, if we didn't want to touch every printf call). Yeah, I share that temptation.

On the Git mailing list, Jeff King wrote (reply to this):

On Wed, Jan 15, 2020 at 10:32:59AM -0800, Junio C Hamano wrote: > >> test_config interactive.diffFilter "cat >/dev/null ; echo too-short" && > >> > >> made the test reliable, with over 2000 --stress repetitions, and that > >> with only a single "y" on 'git add's stdin. > > > > Yeah, I agree the test should be changed. What you wrote above was my > > first thought, too, but I think "sed 1d" is actually a more realistic > > test (and is shorter and one fewer process). > > I am not sure what we are aiming for. Are we making sure the > command behaves well in the hands of end users, who may write a > script that consumes only early parts of the input that is needed > for its use and stops reading, or are we just aiming to claim "all > our tests pass"? I was hoping that we would be doing the former, > and I would understand if the suggestion were "sed 1q" for that > exact reason. > > IOW, shouldn't we be fixing the part that drives the external > process, so that the test "passes" even with such a "broken" filter? The original motivation for this test (and the code that fixes it) was diff-so-fancy, which read all of the input but didn't have a 1:1 line correspondence in the output (IIRC it condensed some particular lines, like rename from/to into a single line). And I think most sane filters would end up reading all of the content. Or a misconfiguration would cause them to read nothing at all. So something like "sed 1d" is more representative of a real filter. If we want to test SIGPIPE, then the current one that reads _nothing_ is the most torturous. But "sed 1q" is neither realistic (if that's what we're going for) nor the hardest thing we can throw at the code (if that's what we want). > > I've been tempted to say that we should just ignore SIGPIPE everywhere, > > and convert even copious-output programs like git-log to just check for > > errors (they could probably even just check ferror(stdout) for each > > commit we output, if we didn't want to touch every printf call). > > Yeah, I share that temptation. Hmm. My recollection was that you were more of a fan of SIGPIPE than I am. But if you agree, then maybe the time has come for action. :) -Peff

On the Git mailing list, SZEDER Gábor wrote (reply to this):

On Mon, Jan 13, 2020 at 06:04:17PM +0100, SZEDER Gábor wrote: > and 'GIT_TEST_ADD_I_USE_BUILTIN=1 ./t3701-add-interactive.sh -r 39,49' > fails with: > > + test_must_fail force_color git add -p > about to run diffFilter > attempting to xwrite() 224 bytes to a fd with revents flags 0x4 > test_must_fail: died by signal 13: force_color git add -p > > I don't understand why we get SIGPIPE right away instead of some error > that we can act upon (ECONNRESET?). Doh', because it's a pipe, not a socket, that's why. pipe(7): "If all file descriptors referring to the read end of a pipe have been closed, then a write(2) will cause a SIGPIPE signal to be generated for the calling process." So ECONNRESET is definitely not the right error to set on POLLERR, though I'm still not sure what the right one would be (perhaps EPIPE?).

On the Git mailing list, Jeff King wrote (reply to this):

On Fri, Jan 17, 2020 at 03:32:36PM +0100, SZEDER Gábor wrote: > On Mon, Jan 13, 2020 at 06:04:17PM +0100, SZEDER Gábor wrote: > > and 'GIT_TEST_ADD_I_USE_BUILTIN=1 ./t3701-add-interactive.sh -r 39,49' > > fails with: > > > > + test_must_fail force_color git add -p > > about to run diffFilter > > attempting to xwrite() 224 bytes to a fd with revents flags 0x4 > > test_must_fail: died by signal 13: force_color git add -p > > > > I don't understand why we get SIGPIPE right away instead of some error > > that we can act upon (ECONNRESET?). > > Doh', because it's a pipe, not a socket, that's why. pipe(7): > > "If all file descriptors referring to the read end of a pipe have > been closed, then a write(2) will cause a SIGPIPE signal to be > generated for the calling process." > > So ECONNRESET is definitely not the right error to set on POLLERR, > though I'm still not sure what the right one would be (perhaps > EPIPE?). Yes, if SIGPIPE is ignored, then that write() would produce EPIPE. So if you're trying to emulate it via POLLERR, that would be accurate. Of course it could fail for _other_ reasons, and I don't think we'd know what those are without actually calling write(). Practically speaking, though, if we know it's a pipe with a valid descriptor then any error is basically equivalent to EPIPE (we don't care how, but for whatever reason we couldn't write to the other end). -Peff

gitgitgadget · 2020-01-13T19:16:06Z

This patch series was integrated into pu via git@06d4496.

In 42f7d45 (add--interactive: detect bogus diffFilter output, 2018-03-03), we added a test case that verifies that the diffFilter feature complains appropriately when the output is too short. In preparation for the upcoming change where the built-in `add -p` is taught to respect that setting, let's adjust that test a little. The problem is that `echo too-short` is configured as diffFilter, and it does not read the `stdin`. When calling it through `pipe_command()`, it is therefore possible that we try to feed the `diff` to it while it is no longer listening, and we receive a `SIGPIPE`. The Perl code apparently handles this in a way similar to an end-of-file, but taking a step back, we realize that a diffFilter that does not even _look_ at its standard input is very unrealistic. The entire point of this feature is to transform the diff, not to ignore it altogether. So let's modify the test case to reflect that insight: instead of printing some bogus text, let's use a diffFilter that deletes the first line of the diff instead. This still tests for the same thing, but it does not confuse the built-in `add -p` with that `SIGPIPE`. Helped-by: SZEDER Gábor <[email protected]> Helped-by: Jeff King <[email protected]> Signed-off-by: Johannes Schindelin <[email protected]>

The Perl version supports post-processing the colored diff (that is generated in addition to the uncolored diff, intended to offer a prettier user experience) by a command configured via that config setting, and now the built-in version does that, too. Signed-off-by: Johannes Schindelin <[email protected]>

The Perl version of `git add -p` reads the config setting `diff.algorithm` and if set, uses it to generate the diff using the specified algorithm. This patch ports that functionality to the C version. Note: just like `git-add--interactive.perl`, we do _not_ respect this config setting in `git add -i`'s `diff` command, but _only_ in the `patch` command. Signed-off-by: Johannes Schindelin <[email protected]>

We are about to introduce the function `enable_non_canonical()`, which shares almost the complete code with `disable_echo()`. Let's prepare for that, by refactoring out that shared code. Signed-off-by: Johannes Schindelin <[email protected]>

Git for Windows' Git Bash runs in MinTTY by default, which does not have a Win32 Console instance, but uses MSYS2 pseudo terminals instead. This is a problem, as Git for Windows does not want to use the MSYS2 emulation layer for Git itself, and therefore has no direct way to interact with that pseudo terminal. As a workaround, use the `stty` utility (which is included in Git for Windows, and which *is* an MSYS2 program, so it knows how to deal with the pseudo terminal). Note: If Git runs in a regular CMD or PowerShell window, there *is* a regular Win32 Console to work with. This is not a problem for the MSYS2 `stty`: it copes with this scenario just fine. Also note that we introduce support for more bits than would be necessary for a mere `disable_echo()` here, in preparation for the upcoming `enable_non_canonical()` function. Signed-off-by: Johannes Schindelin <[email protected]>

Typically, input on the command-line is line-based. It is actually not really easy to get single characters (or better put: keystrokes). We provide two implementations here: - One that handles `/dev/tty` based systems as well as native Windows. The former uses the `tcsetattr()` function to put the terminal into "raw mode", which allows us to read individual keystrokes, one by one. The latter uses `stty.exe` to do the same, falling back to direct Win32 Console access. Thanks to the refactoring leading up to this commit, this is a single function, with the platform-specific details hidden away in conditionally-compiled code blocks. - A fall-back which simply punts and reads back an entire line. Note that the function writes the keystroke into an `strbuf` rather than a `char`, in preparation for reading Escape sequences (e.g. when the user hit an arrow key). This is also required for UTF-8 sequences in case the keystroke corresponds to a non-ASCII letter. Signed-off-by: Johannes Schindelin <[email protected]>

The Perl version of `git add -p` supports this config setting to allow users to input commands via single characters (as opposed to having to press the <Enter> key afterwards). This is an opt-in feature because it requires Perl packages (Term::ReadKey and Term::Cap, where it tries to handle an absence of the latter package gracefully) to work. Note that at least on Ubuntu, that Perl package is not installed by default (it needs to be installed via `sudo apt-get install libterm-readkey-perl`), so this feature is probably not used a whole lot. In C, we obviously do not have these packages available, but we just introduced `read_single_keystroke()` that is similar to what Term::ReadKey provides, and we use that here. Signed-off-by: Johannes Schindelin <[email protected]>

This recapitulates part of b5cc003 (add -i: ignore terminal escape sequences, 2011-05-17): add -i: ignore terminal escape sequences On the author's terminal, the up-arrow input sequence is ^[[A, and thus fat-fingering an up-arrow into 'git checkout -p' is quite dangerous: git-add--interactive.perl will ignore the ^[ and [ characters and happily treat A as "discard everything". As a band-aid fix, use Term::Cap to get all terminal capabilities. Then use the heuristic that any capability value that starts with ^[ (i.e., \e in perl) must be a key input sequence. Finally, given an input that starts with ^[, read more characters until we have read a full escape sequence, then return that to the caller. We use a timeout of 0.5 seconds on the subsequent reads to avoid getting stuck if the user actually input a lone ^[. Since none of the currently recognized keys start with ^[, the net result is that the sequence as a whole will be ignored and the help displayed. Note that we leave part for later which uses "Term::Cap to get all terminal capabilities", for several reasons: 1. it is actually not really necessary, as the timeout of 0.5 seconds should be plenty sufficient to catch Escape sequences, 2. it is cleaner to keep the change to special-case Escape sequences separate from the change that reads all terminal capabilities to speed things up, and 3. in practice, relying on the terminal capabilities is a bit overrated, as the information could be incomplete, or plain wrong. For example, in this developer's tmux sessions, the terminal capabilities claim that the "cursor up" sequence is ^[M, but the actual sequence produced by the "cursor up" key is ^[[A. Signed-off-by: Johannes Schindelin <[email protected]>

When `interactive.singlekey = true`, we react immediately to keystrokes, even to Escape sequences (e.g. when pressing a cursor key). The problem with Escape sequences is that we do not really know when they are done, and as a heuristic we poll standard input for half a second to make sure that we got all of it. While waiting half a second is not asking for a whole lot, it can become quite annoying over time, therefore with this patch, we read the terminal capabilities (if available) and extract known Escape sequences from there, then stop polling immediately when we detected that the user pressed a key that generated such a known sequence. This recapitulates the remaining part of b5cc003 (add -i: ignore terminal escape sequences, 2011-05-17). Note: We do *not* query the terminal capabilities directly. That would either require a lot of platform-specific code, or it would require linking to a library such as ncurses. Linking to a library in the built-ins is something we try very hard to avoid (we even kicked the libcurl dependency to a non-built-in remote helper, just to shave off a tiny fraction of a second from Git's startup time). And the platform-specific code would be a maintenance nightmare. Even worse: in Git for Windows' case, we would need to query MSYS2 pseudo terminals, which `git.exe` simply cannot do (because it is intentionally *not* an MSYS2 program). To address this, we simply spawn `infocmp -L -1` and parse its output (which works even in Git for Windows, because that helper is included in the end-user facing installations). This is done only once, as in the Perl version, but it is done only when the first Escape sequence is encountered, not upon startup of `git add -i`; This saves on startup time, yet makes reacting to the first Escape sequence slightly more sluggish. But it allows us to keep the terminal-related code encapsulated in the `compat/terminal.c` file. Signed-off-by: Johannes Schindelin <[email protected]>

This job runs the test suite twice, once in regular mode, and once with a whole slew of `GIT_TEST_*` variables set. Now that the built-in version of `git add --interactive` is feature-complete, let's also throw `GIT_TEST_ADD_I_USE_BUILTIN` into that fray. Signed-off-by: Johannes Schindelin <[email protected]>

dscho · 2020-01-14T18:42:54Z

/submit

gitgitgadget · 2020-01-14T18:44:19Z

Submitted as [email protected]

gitgitgadget · 2020-01-15T23:40:27Z

This patch series was integrated into pu via git@00de1e7.

gitgitgadget · 2020-01-17T00:19:11Z

This patch series was integrated into pu via git@b1841a2.

gitgitgadget · 2020-01-24T12:00:01Z

This patch series was integrated into pu via git@8764040.

gitgitgadget · 2020-01-27T23:25:19Z

This patch series was integrated into pu via git@b63ffa8.

gitgitgadget · 2020-01-27T23:25:21Z

This patch series was integrated into next via git@ec5021a.

gitgitgadget · 2020-01-30T23:29:09Z

This patch series was integrated into pu via git@0a75cce.

gitgitgadget · 2020-02-05T23:38:57Z

This patch series was integrated into pu via git@76c57fe.

gitgitgadget · 2020-02-05T23:38:58Z

This patch series was integrated into next via git@76c57fe.

gitgitgadget · 2020-02-05T23:39:00Z

This patch series was integrated into master via git@76c57fe.

gitgitgadget · 2020-02-05T23:39:03Z

Closed via 76c57fe.

dscho force-pushed the add-p-in-c-config-settings branch 6 times, most recently from a2dc2d1 to 209a09c Compare April 3, 2019 20:28

This was referenced Apr 10, 2019

Offer a built-in version of git add -i and git add -p git-for-windows/git#2150

Merged

git add -i: add a rudimentary version in C (supporting only status and help so far) #170

Closed

dscho force-pushed the add-p-in-c-config-settings branch from 209a09c to de2903a Compare April 10, 2019 17:42

dscho mentioned this pull request Apr 14, 2019

Get the BusyBox-based MinGit production-ready git-for-windows/git#1439

Open

dscho changed the base branch from master to js/builtin-add-i August 26, 2019 12:51

dscho force-pushed the add-p-in-c-config-settings branch 2 times, most recently from 56a9b7e to 9cdd849 Compare August 26, 2019 13:09

dscho force-pushed the js/builtin-add-i branch from f3665cf to cf308ea Compare August 30, 2019 23:13

weekly-digest bot mentioned this pull request Sep 1, 2019

Weekly Digest (25 August, 2019 - 1 September, 2019) #326

Closed

dscho mentioned this pull request Sep 24, 2019

Re-implement git add -i as built-in git-for-windows/git#1871

Closed

dscho mentioned this pull request Oct 12, 2019

git-add--interactive.perl: Add progress counter in the prompt #349

Closed

dscho force-pushed the js/builtin-add-i branch 2 times, most recently from b8f458c to 2b5d5c1 Compare November 14, 2019 03:27

dscho force-pushed the add-p-in-c-config-settings branch 2 times, most recently from b43bdf3 to b3f5c67 Compare November 14, 2019 14:31

dscho force-pushed the js/builtin-add-i branch from 2b5d5c1 to 8c15904 Compare November 18, 2019 06:21

dscho force-pushed the add-p-in-c-config-settings branch 4 times, most recently from c06e42d to 79a735b Compare November 25, 2019 18:27

dscho changed the base branch from js/builtin-add-i to js/builtin-add-i-cmds November 25, 2019 18:58

dscho force-pushed the add-p-in-c-config-settings branch from 79a735b to 04f4971 Compare December 1, 2019 19:40

dscho force-pushed the js/builtin-add-i-cmds branch from 0de7b03 to 2e697ce Compare December 2, 2019 16:56

dscho force-pushed the add-p-in-c-config-settings branch from 04f4971 to 5f0a6d7 Compare December 3, 2019 16:05

gitgitgadget bot reviewed Jan 13, 2020

View reviewed changes

dscho added 10 commits January 14, 2020 13:41

dscho force-pushed the add-p-in-c-config-settings branch from c419596 to 7ab7ec6 Compare January 14, 2020 12:46

weekly-digest bot mentioned this pull request Feb 2, 2020

Weekly Digest (26 January, 2020 - 2 February, 2020) #543

Closed

gitgitgadget bot added the master label Feb 5, 2020

gitgitgadget bot closed this Feb 5, 2020

dscho deleted the add-p-in-c-config-settings branch February 5, 2020 23:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

built-in add -p: add support for the same config settings as the Perl version #175

built-in add -p: add support for the same config settings as the Perl version #175

dscho commented Mar 28, 2019 •

edited

Loading

gitgitgadget bot commented Jan 13, 2020

gitgitgadget bot Jan 13, 2020

gitgitgadget bot Jan 13, 2020

gitgitgadget bot Jan 14, 2020

gitgitgadget bot Jan 15, 2020

gitgitgadget bot Jan 15, 2020

gitgitgadget bot Jan 17, 2020

gitgitgadget bot Jan 17, 2020

gitgitgadget bot commented Jan 13, 2020

dscho commented Jan 14, 2020

gitgitgadget bot commented Jan 14, 2020

gitgitgadget bot commented Jan 15, 2020

gitgitgadget bot commented Jan 17, 2020

gitgitgadget bot commented Jan 24, 2020

gitgitgadget bot commented Jan 27, 2020

gitgitgadget bot commented Jan 27, 2020

gitgitgadget bot commented Jan 30, 2020

gitgitgadget bot commented Feb 5, 2020

gitgitgadget bot commented Feb 5, 2020

gitgitgadget bot commented Feb 5, 2020

gitgitgadget bot commented Feb 5, 2020

built-in add -p: add support for the same config settings as the Perl version #175

built-in add -p: add support for the same config settings as the Perl version #175

Conversation

dscho commented Mar 28, 2019 • edited Loading

gitgitgadget bot commented Jan 13, 2020

gitgitgadget bot Jan 13, 2020

Choose a reason for hiding this comment

gitgitgadget bot Jan 13, 2020

Choose a reason for hiding this comment

gitgitgadget bot Jan 14, 2020

Choose a reason for hiding this comment

gitgitgadget bot Jan 15, 2020

Choose a reason for hiding this comment

gitgitgadget bot Jan 15, 2020

Choose a reason for hiding this comment

gitgitgadget bot Jan 17, 2020

Choose a reason for hiding this comment

gitgitgadget bot Jan 17, 2020

Choose a reason for hiding this comment

gitgitgadget bot commented Jan 13, 2020

dscho commented Jan 14, 2020

gitgitgadget bot commented Jan 14, 2020

gitgitgadget bot commented Jan 15, 2020

gitgitgadget bot commented Jan 17, 2020

gitgitgadget bot commented Jan 24, 2020

gitgitgadget bot commented Jan 27, 2020

gitgitgadget bot commented Jan 27, 2020

gitgitgadget bot commented Jan 30, 2020

gitgitgadget bot commented Feb 5, 2020

gitgitgadget bot commented Feb 5, 2020

gitgitgadget bot commented Feb 5, 2020

gitgitgadget bot commented Feb 5, 2020

dscho commented Mar 28, 2019 •

edited

Loading