Make sure err isn't nil when returning failure #2861

jandubois · 2024-11-07T05:09:53Z

jandubois · 2024-11-07T05:11:19Z

A separate issue is if the pseudoLoopbackForwarder should accept connections from [::1] in addition to 127.0.0.1.

jandubois · 2024-11-07T05:23:32Z

@AkihiroSuda This fixes the crash, but doesn't actually forward to 127.0.0.1:

$ curl http://127.0.0.1:80
curl: (7) Failed to connect to 127.0.0.1 port 80 after 0 ms: Couldn't connect to server

jandubois · 2024-11-07T05:31:45Z

@AkihiroSuda This fixes the crash, but doesn't actually forward to 127.0.0.1:

Actually, it only stops accepting connections from 127.0.0.1 after it has rejected a connection from [::1] once.

This also reminds of something @Nino-K told me: when the guest agent sends a duplicate port to add, then the host agent removes the existing port forwarding. Will see if I can find this in his WIP PR.

AkihiroSuda · 2024-11-07T06:00:41Z

Thanks, can we have a test?

diff --git a/hack/test-templates.sh b/hack/test-templates.sh
index 57ff37a4..0d689312 100755
--- a/hack/test-templates.sh
+++ b/hack/test-templates.sh
@@ -307,9 +307,11 @@ if [[ -n ${CHECKS["port-forwards"]} ]]; then
                        fi
                        limactl shell "$NAME" $sudo $CONTAINER_ENGINE info
                        limactl shell "$NAME" $sudo $CONTAINER_ENGINE pull --quiet ${nginx_image}
-                       limactl shell "$NAME" $sudo $CONTAINER_ENGINE run -d --name nginx -p 8888:80 ${nginx_image}
-
-                       timeout 3m bash -euxc "until curl -f --retry 30 --retry-connrefused http://${hostip}:8888; do sleep 3; done"
+                       for hostport in 8888 80; do
+                               limactl shell "$NAME" $sudo $CONTAINER_ENGINE run -d --name nginx -p ${hostrport}:80 ${nginx_image}
+                               timeout 3m bash -euxc "until curl -f --retry 30 --retry-connrefused http://${hostip}:${hostport}; do sleep 3; done"
+                               limactl shell "$NAME" $sudo $CONTAINER_ENGINE rm -f nginx
+                       done
                fi
        fi
        set +x

jandubois · 2024-11-07T06:02:10Z

This PR now fixes the crash and also avoids closing the forwarder the first time a non-local address tries to connect to it.

$ curl 127.0.0.1
<!DOCTYPE html>
…

$ curl localhost
curl: (56) Recv failure: Connection reset by peer

$ curl 127.0.0.1
<!DOCTYPE html>
…

I guess we could use a custom error type instead of matching on the string of the error message to make it a bit more robust.

I also think we should accept connections from [::1], but that should be a separate PR.

jandubois · 2024-11-07T06:18:25Z

Fixes #2859

I don't actually know if this PR fixes the hanging problem; I only know it fixes the crash @AkihiroSuda has shown in the same issue.

And it makes the forwarding listeners a bit more robust in general against accidental removal.

I don't plan to make any more changes tonight; @rfay maybe you can test this PR (or wait until it has been merged into master so you also get my other fix in #2860)? Just so we know if this fixes all known issues, or if there is still something else.

AkihiroSuda · 2024-11-07T06:28:55Z

CI is failing

jandubois · 2024-11-07T06:34:35Z

CI is failing

Yes, because we cannot bind to ${hostip}:80 I think. I'll see if I can still fix it tonight.

jandubois · 2024-11-07T06:58:58Z

It is still failing in CI, but works for me locally. I'm out of time now and will look into this tomorrow. Unless somebody beats me to it...

jandubois · 2024-11-07T16:56:58Z

Converting back to draft since we'll switch the default back to SSH in #2864.

And I just tested curl localhost with the SSH forwarder, so it needs to work with gRPC too.

hack/test-templates.sh

jandubois · 2024-11-08T00:36:47Z

Yes, because we cannot bind to ${hostip}:80 I think. I'll see if I can still fix it tonight.

I've restricted the test binding to port 80 to only macOS, which is also the only platform where the lower port makes a difference due to the pseudoloopback forwarders.

However, since we have reverted to the SSH forwarder, this test will not currently test the gRPC implementation.

Should we run at least one of the VZ tests (default or fedora) with LIMA_SSH_PORT_FORWARDER=false?

pkg/portfwd/listener_darwin.go

.github/workflows/test.yml

Signed-off-by: Jan Dubois <[email protected]>

AkihiroSuda · 2024-11-08T02:02:50Z

Fixes #2859

Does this PR fix the hang too? Or just fixes the panic?

AkihiroSuda

Thanks

jandubois · 2024-11-08T02:05:46Z

Does this PR fix the hang too?

I have not been able to reproduce the hangs. Do you have any repro steps?

Or just fixes the panic?

It fixes the panic, adds support for IPv6, and makes sure the forwarder isn't removed when a connection is attempted from a rejected (non-loopback) address.

AkihiroSuda · 2024-11-08T02:13:52Z

I have not been able to reproduce the hangs. Do you have any repro steps?

Just running docker.yaml on the CI seems enough.
And I don’t know other way to reproduce the hang.

jandubois · 2024-11-08T02:18:31Z

Just running docker.yaml on the CI seems enough.

It seems almost all the PRs merged for v1.0.1 have been running docker.yaml without hanging, otherwise they wouldn't be green: https://github.com/lima-vm/lima/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av1.0.1

Anyways, I can't remember seeing the docker test hang in this PR, but also not in any of the others I did after v1.0.0.

AkihiroSuda · 2024-11-08T02:23:55Z

I think I have restarted several failing jobs, so they are marked green

jandubois · 2024-11-08T02:25:36Z

I think I have restarted several failing jobs, so they are marked green

Ok, let's see if this is going to stop now.

jandubois force-pushed the not-localhost branch from dc8ae6d to 9be7f54 Compare November 7, 2024 05:18

jandubois force-pushed the not-localhost branch 2 times, most recently from ed5c6da to e539fd3 Compare November 7, 2024 05:58

jandubois marked this pull request as ready for review November 7, 2024 05:59

jandubois force-pushed the not-localhost branch from e539fd3 to 3cda370 Compare November 7, 2024 06:06

jandubois force-pushed the not-localhost branch from 3cda370 to bb995b9 Compare November 7, 2024 06:37

jandubois marked this pull request as draft November 7, 2024 16:52

jandubois force-pushed the not-localhost branch 4 times, most recently from 8a8eefb to a744670 Compare November 8, 2024 00:33

AkihiroSuda reviewed Nov 8, 2024

View reviewed changes

hack/test-templates.sh Outdated Show resolved Hide resolved

AkihiroSuda reviewed Nov 8, 2024

View reviewed changes

hack/test-templates.sh Show resolved Hide resolved

jandubois force-pushed the not-localhost branch 2 times, most recently from 8a08a76 to af3b7d6 Compare November 8, 2024 00:59

jandubois mentioned this pull request Nov 8, 2024

v1.0.1 planning #2872

Closed

jandubois marked this pull request as ready for review November 8, 2024 01:36

jandubois requested a review from AkihiroSuda November 8, 2024 01:36

AkihiroSuda reviewed Nov 8, 2024

View reviewed changes

pkg/portfwd/listener_darwin.go Show resolved Hide resolved

AkihiroSuda reviewed Nov 8, 2024

View reviewed changes

.github/workflows/test.yml Outdated Show resolved Hide resolved

AkihiroSuda added this to the v1.0.1 milestone Nov 8, 2024

AkihiroSuda added the area/portfwd label Nov 8, 2024

Make sure err isn't nil when returning failure

b8b400c

Signed-off-by: Jan Dubois <[email protected]>

jandubois force-pushed the not-localhost branch from af3b7d6 to b8b400c Compare November 8, 2024 01:58

AkihiroSuda approved these changes Nov 8, 2024

View reviewed changes

jandubois merged commit 0e93110 into lima-vm:master Nov 8, 2024
29 checks passed

jandubois deleted the not-localhost branch November 8, 2024 02:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make sure err isn't nil when returning failure #2861

Make sure err isn't nil when returning failure #2861

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024 •

edited

Loading

AkihiroSuda commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024 •

edited

Loading

AkihiroSuda commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 8, 2024

AkihiroSuda commented Nov 8, 2024

AkihiroSuda left a comment

jandubois commented Nov 8, 2024

AkihiroSuda commented Nov 8, 2024

jandubois commented Nov 8, 2024

AkihiroSuda commented Nov 8, 2024

jandubois commented Nov 8, 2024

Make sure err isn't nil when returning failure #2861

Make sure err isn't nil when returning failure #2861

Conversation

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024 • edited Loading

AkihiroSuda commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024 • edited Loading

AkihiroSuda commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 7, 2024

jandubois commented Nov 8, 2024

AkihiroSuda commented Nov 8, 2024

AkihiroSuda left a comment

Choose a reason for hiding this comment

jandubois commented Nov 8, 2024

AkihiroSuda commented Nov 8, 2024

jandubois commented Nov 8, 2024

AkihiroSuda commented Nov 8, 2024

jandubois commented Nov 8, 2024

jandubois commented Nov 7, 2024 •

edited

Loading

jandubois commented Nov 7, 2024 •

edited

Loading