Isolate docker interactions into module and simplify test docker containers #52

ImogenBits · 2022-06-26T13:37:32Z

Following your request of splitting up the changes in #49 (here) into multiple smaller PRs, this is the first of them where I tried to isolate all the interactions with docker into its own module and simplify the docker containers used for testing.
I also made some very minor modifications to related parts of the code, such as fixing up type annotations, passing the currently fighting teams as an argument to FightHandler.fight() instead of them being part of its state, and adding the flake8 config file.

Benezivas · 2022-06-28T13:58:02Z

Thank you very much for taking the time to split up the pull request. These changes look a lot more manageable. I will add replies to this comment to collect more general questions and possible remaining issues that I find while checking the code that cannot be linked to a specific part of a file.

First observations:

Do you assume a changed format of the passed problems or is this pull request still compatible with the problem format of version 3.0.2? I get an error when trying to pass the biclique problem as a parameter (see below).
If the docker daemon is not running, it seems that this is communicated, but the program seems to try to continue its run. The run should stop instead.

I will continue to review the code in iterations, but I am confident that we can apply most of the suggested changes as-is.

battle ../algobattle-problems/problems/biclique
You can find the log files for this run in /home/henri/.algobattle_logs/2022-06-28_15:41:45.log
Running a benchmark to determine your machines I/O overhead to start and stop docker containers...
Maximal measured runtime overhead is at 1.91 seconds. Adding this amount to the configured runtime.
####################  Running Battle 1/5  ####################
==================== Iterative Battle, Instanze Size Cap: 50000 ====================
=============== Instance Size: 5/50000 ===============
Traceback (most recent call last):
  File "/home/henri/.local/bin/battle", line 164, in <module>
    results = match.run()
  File "/home/henri/.local/lib/python3.10/site-packages/algobattle/match.py", line 43, in run
    self.battle_wrapper.run_round(self.fight_handler, matchup)
  File "/home/henri/.local/lib/python3.10/site-packages/algobattle/battle_wrapper.py", line 35, in wrapper
    return function(self, *args, **kwargs)
  File "/home/henri/.local/lib/python3.10/site-packages/algobattle/battle_wrappers/iterated.py", line 80, in run_round
    approx_ratio = fight_handler.fight(matchup, n)
  File "/home/henri/.local/lib/python3.10/site-packages/algobattle/fight_handler.py", line 40, in fight
    instance, generator_solution = self._run_generator(matchup.generator, instance_size)
  File "/home/henri/.local/lib/python3.10/site-packages/algobattle/fight_handler.py", line 83, in _run_generator
    encoded_output = team.generator.run(str(instance_size), self.timeout_generator, scaled_memory, self.cpus)
AttributeError: 'str' object has no attribute 'run'

ImogenBits · 2022-06-28T20:43:25Z

The format of the problems remains the same as in the 4.0 version, which afaik is the same as in 3.0.2, the bug you saw was caused by the input paths not being actual Path objects and me incorrectly assuming they were. the commit i just pushed should fix it.

I can't observe the same behaviour regarding docker, it always exits right away for me. I also can't see why the code might behave that way, it raises SystemExit which is not caught anywhere. can you describe the error more?

ImogenBits · 2022-08-28T22:49:52Z

After way more research and testing than I expected, I implemented the changes we discussed last week. The gist is that the docker module now uses the official docker api instead of shell commands. In addition I also moved the setup.py based build to a pyproject.toml one, this is now much more user friendly, future proof and installs on windows without any issues or additional user actions.

There are two more or less breaking changes here though:

I renamed the cli command from battle to algobattle. I always found it unintuitive that the command is different from the project name and changing this now is a single line change in the pyproject.toml so I took the liberty. If you want to stick with battle I have no issues with changing it back.
Because of limitations in the docker api lib I changed the way the containers interact with the program! Instead of using STDIN and STDOUT, it now creates a file /input in the root directory of the container and reads from /output there. I think this is a great change for the students too, when I was taking the course we basically always had our executed shell file be cat > input; run_program; cat output since interacting with files like that was much easier. All of the example docker containers you've provided also follow this structure, so migrating everything should be very easy.

There also are a couple further things of note:
A docker container that times out now no longer automatically gets treated as having produced no output. Instead we still check the output file and read from it. During the course we often had the problem that we were incrementally generating solutions, but because going even slightly over the timeout would mean a total fail, we had to create our own timeout a couple seconds shorter and kill our program then. With this change the students can now arbitrarily rewrite their output and the battle will always use the last solution generated within the timeout.
Because we now don't use shell commands to interface with docker the timeouts are much more precise since we can start and stop them with more control. On my machine the total overhead used to be roughly 1.5 to 2 seconds, it now is only 0.5-1.5. But more importantly the overhead the timeouts can't account for is only about 0.06 seconds! Because of this I removed the measure_runtime_overhead function and it's associated cli interface, it's just not really needed anymore and the values it measures vastly overestimate the actual time we need to give to the containers in addition to the timeout.
As you can see in the pyproject.toml this project uses a fork of the docker library instead of the official PyPI version. This is because it needs to implements a named pipe socket api to communicate with the docker daemon, but on windows this implementation is pretty broken. I've implemented a working version myself with all the features we need for this project and I'll try to make a PR to get it into the official library but idk how long that'll take.

Benezivas · 2022-08-29T11:35:13Z

Thank you for commiting these changes, I will review them this week in detail and comment further here.

A short test shows that the CI pipeline currently seems to fail, could you look into it? (I have not figured out yet why the automated CI tests are currently not launched in this PR)

I do not mind the two breaking changes, they seem reasonable and since we are aiming for the next full release, such changes are to be expected. Please have an eye on not straying too far from the purpose of this specific pull request, I really appreciate your ideas and time to implement them, but it becomes much more complicated to properly review them if one pull request wants to do too many different things beyond its scope.

ImogenBits · 2022-08-31T20:17:16Z

CI runs here now (reason was that the on field of the github actions yaml specified it to only run in the main and develop branches) and passes without issues

ImogenBits · 2022-09-01T12:48:11Z

I made #53 and #54 to split out the unrelated changes in here, if you merge them I can then either just rebase this branch to be based on them or also split out the other small and somewhat unrelated changes to the battle script (having it handle signals using the python error handling rather than the signal handler and having the functions definitions outside of the main function) and the teams objects (though that would be kinda messy since they are mainly about changing the way the battle script interacts with the docker containers)

Benezivas · 2022-09-01T13:28:10Z

I have merged #53 and #54 into the release candidate. I would suggest not splitting out the changes to the teams object, as this is closely correlated to the docker changes and thus arguably in scope of this PR. If the required effort is reasonable I would advocate for splitting out the changes to the battle script (that are not related to the docker changes) to keep the PR clean.

ImogenBits · 2022-09-02T10:55:41Z

rebased the branch on the new 4.0 branch, file diffs are now properly only the docker changes.
the commit history is a bit messed up because of that but I tried to preserve it as best I could and the old branch is still at the corresponding tag in my repo.

…xecuting the battle script from the algobattle folder directly

Benezivas · 2022-09-12T08:58:34Z

Thank you for taking the time to do these additional changes. I have reviewed the changes and think that they are helpful.

For clarification - since I have not worked with python's docker module so far: How do we ensure that running docker containers are stopped upon early program termination, e.g. when receiving a SIGTERM from the os? Previously, this was done with the sigh module calling the kill_spawned_docker_containers function, which is removed in this PR.
I assume they will be spawned as subprocesses of the algobattle process and killed when the parent process dies?

Once this is cleared up, I am ready to merge these changes.

ImogenBits · 2022-09-12T10:28:07Z

The python docker lib is basically a very thin wrapper around the docker engine api (which largely mirrors the docker cli arguments but with some oddities such as different defaults and some added/removed higher level commands). In particular here we essentially invoke docker rm -f CONTAINER, which will both kill the container and then remove it.
This code will run when the algobattle script is killed early because the default python signal handler raises a KeyboardInterrupt when that happens, which will be propagated from that loop, through the finally clause until we catch it in the battle module. This does mean that if you circumvent the python error handling process the spawned containers will not be stopped, but afaik the only sensible way for that to happen is if you step through it in debugging and cancel it early.

ImogenBits force-pushed the easy_changes branch 2 times, most recently from a402a9b to ee5ad7b Compare August 28, 2022 22:22

Benezivas self-assigned this Aug 29, 2022

ImogenBits mentioned this pull request Sep 1, 2022

Minor cleanup to solve typing issues #56

Merged

ImogenBits added 19 commits September 2, 2022 12:21

simplify test problems

c282759

add docker module

3754759

use new team module

4de08f5

use new measure_runtime_overhead

d3ec5b2

use docker module instead of run_subprocess

9188eca

remove unneeded call docker_running

36c6dab

remove unneeded build_docker_container

24dd936

Team constructor can take already built images

2f8b89d

fix fight handler tests

a8310cc

remove signal handler and cleanup imports

2a71fbc

fix util tests

acd58e8

move docker tests into own module

8e6fb9e

fix docker tests

053a7d0

raise an error when container times out

297ac12

move runtime test to docker tests

da83ac0

change fight_handler to use proper exception handling

4a90bf1

autoformat

d5d3024

specify matchup as argument instead of state

247290e

change parser to use str and not bytes

845f204

ImogenBits added 21 commits September 2, 2022 12:32

automatically detect OS when installing

fd14164

use docker api to run containers

2465ae8

update included docker containers to use the new io model

a1b32a7

use docker api in Image.remove()

fc96083

return partial output when container times out

77b2c9e

remove intermediary containers

7f0b30c

minor bugfixes

4356ea5

improve docker tests

a1c3899

improved DockerError messaging

3bb852b

create empty input file when empty input is specified

1316692

remove delaytest as it's not needed anymore

f5938b6

fix debug formatting

b1c6a6d

put tar file interaction into their own functions

298c445

cleanup docker_wrapper.py

1b71fc7

fix test docker containers

46c92c9

properly install python 3.10 in CI

018804f

slightly change formatting and flake8 spec

dd4a2b2

install dependencies when running tests in CI

80e8de8

prepare rebase

acc2c03

minor cleanup

434b106

fixup rebase

9086998

ImogenBits force-pushed the easy_changes branch from 0630628 to 9086998 Compare September 2, 2022 10:43

properly cast all input paths to Path objects

99a25b7

rename docker to docker_util to avoid shadowing the docker lib when e…

df1e3ff

…xecuting the battle script from the algobattle folder directly

Benezivas merged commit 69a62b2 into Algorithmic-Battle:4.0.0-rc Sep 12, 2022

ImogenBits deleted the easy_changes branch September 13, 2022 12:19

ImogenBits mentioned this pull request Jan 5, 2023

Refactoring and Maintainability improvements #49

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Isolate docker interactions into module and simplify test docker containers #52

Isolate docker interactions into module and simplify test docker containers #52

Uh oh!

ImogenBits commented Jun 26, 2022

Uh oh!

Benezivas commented Jun 28, 2022

Uh oh!

ImogenBits commented Jun 28, 2022

Uh oh!

ImogenBits commented Aug 28, 2022

Uh oh!

Benezivas commented Aug 29, 2022

Uh oh!

ImogenBits commented Aug 31, 2022

Uh oh!

ImogenBits commented Sep 1, 2022

Uh oh!

Benezivas commented Sep 1, 2022

Uh oh!

ImogenBits commented Sep 2, 2022

Uh oh!

Benezivas commented Sep 12, 2022

Uh oh!

ImogenBits commented Sep 12, 2022

Uh oh!

Uh oh!

Isolate docker interactions into module and simplify test docker containers #52

Isolate docker interactions into module and simplify test docker containers #52

Uh oh!

Conversation

ImogenBits commented Jun 26, 2022

Uh oh!

Benezivas commented Jun 28, 2022

Uh oh!

ImogenBits commented Jun 28, 2022

Uh oh!

ImogenBits commented Aug 28, 2022

Uh oh!

Benezivas commented Aug 29, 2022

Uh oh!

ImogenBits commented Aug 31, 2022

Uh oh!

ImogenBits commented Sep 1, 2022

Uh oh!

Benezivas commented Sep 1, 2022

Uh oh!

ImogenBits commented Sep 2, 2022

Uh oh!

Benezivas commented Sep 12, 2022

Uh oh!

ImogenBits commented Sep 12, 2022

Uh oh!

Uh oh!