-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI: Improve test.summary rule #462
CI: Improve test.summary rule #462
Conversation
Codecov Report
@@ Coverage Diff @@
## dev/gfdl #462 +/- ##
=========================================
Coverage 38.03% 38.03%
=========================================
Files 269 269
Lines 77554 77554
Branches 14319 14319
=========================================
+ Hits 29494 29496 +2
+ Misses 42709 42707 -2
Partials 5351 5351 see 1 file with indirect coverage changes 📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
I should say that while I think this is an overall improvement, I am still unsure if it fixes the original problem, which I have yet to actually replicate. The issue is that
and then also raises an error. I believe this is because (for example) Despite the supposed portability of this method, I think |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These changes should be helpful for providing more information, and hopefully they will reduce the rates of lustre-based failures as intended.
The test.summary rule was causing errors on our lustre filesystem, even when all tests were successful. While the reason is still not yet clear, I suspect it is related to our `if ls results/*` tests, which may not behave as intended on all filesystems. To somewhat secure us against this, I have replaced the top-level check with `if [ -d results ]` to check if there are any output from failed tests. The subdirectory tests still remain, but at least these will only happen if any actual results exists. Other minor changes: - The script to generate the summary was moved out of the Makefile and into a separate script. - Unrelated to these changes, error output was extended from 20 to 40 lines, to provide more readable backtrace output.
a259dc2
to
95ca617
Compare
These tests failed in the CI; the I'll leave this open a bit longer to investigate the problem, but if the problem gets too deep then I may need to retract it. |
I'm going to close this for now, since I may need to make a lot of changes and don't want to spam the channel. The good news is that whatever is going on is now reproducible. |
I believe at least part of the problem here is that GNU and Intel are sharing the runner work directory on Gaea (configured with I don't see any benefit to even using |
The test.summary rule was causing errors on our lustre filesystem, even when all tests were successful. While the reason is still not yet clear, I suspect it is related to our
if ls results/*/*
tests, which may not behave as intended on all filesystems.To somewhat secure us against this, I have replaced the top-level check with
if [ -d results ]
to check if there are any output from failed tests. The subdirectory tests still remain, but at least these will only happen if any actual results exists.Other minor changes:
The script to generate the summary was moved out of the Makefile and into a separate script.
Unrelated to these changes, error output was extended from 20 to 40 lines, to provide more readable backtrace output.