-
Notifications
You must be signed in to change notification settings - Fork 27
Automated testing of testing tools #469
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
I tested this on three interactive problems, Slot Machine (WF'2025 I), Where Am I Now? (WF'2024 L) and Lateral Damage (NWERC'2023 L). It works pretty well already (in particular it was quite easy to rediscover the bugs in the testing tools for the former two), but there are a few things that could/should be improved:
I did not yet test this on any multipass problems. |
if its identical, you can use the
hmmm any name recommendation? |
i think you can already put it inside a directory and add a |
After the most recent change, I now receive The input files in
Ah yes,
But you wouldn't want that stuff in the To give some examples for why I think more flexibility would be nice:
Of course one can always rewrite testing tools to fit the given format, but if one was instead able to specify where the testing tool is located and how it should be run, then you could perhaps do something like this inside the
|
Though thinking about it a bit more, for newly developed interactive problems it should always be possible to choose the input format such that any valid input for the testing tool is also valid input for the interactor, even if the latter features additional behaviour such as adaptiveness or different strategies of playing a game. |
ahrg... thats because jvm does not handle memory restrictions and our exec call does not see the submission but only the |
yes and I think the same is true for the there is still the issue with the name... |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Didn't have a close look at testing_tool.py:run
but otherwise lgtm.
You should add some docs though to explain precisely what is run when doing bt check_testing_tool
, and what is the expected invocation of the testing tool.
Addresses #460
This adds the new command
bt check_testing_tool
which uses the downloadable samples as well as the files found in the new directorydata/testing_tool_test
.This is a bit hacky but should work in most cases?
testing_tool<.ext>
-f <test file> <submission run command>
@paul-wild want to try this?