Gracious kill hook #914

hejung · 2020-05-04T11:29:21Z

This builds on #911 and #755. It should only be merged after them.

This adds a GraciousKillHook that writes the current simulation state to storage and ends the simulation loop by raising a GraciousKillError when a given maximum walltime is reached.
You can also pass a custom callable as final_call at hook construction, in case you want to do anything else than closing the storage when the simulation is terminated.

Example usage (assuming your steps are not superfast the code below should end the simulation after something sligthly smaller than 3 min 34 s):

kill_hook = GraciousKillHook("3 minutes 34 seconds")
# sampler is a `PathSimulator`
sampler.attach_hook(kill_hook)
try:
   sampler.run(2000)
except GraciousKillError:
   print("Simulation ended due to maximum walltime reached")

dwhswenson · 2021-07-03T21:20:46Z

With #911 in, checking on this one: it looks like the code is largely completed. It will need some conflict resolution (which is hopefully easy to accomplish -- probably some are cases of "added two functions in the same location of the old code").

Main question: Is this close enough (and do you, @hejung, have time enough to finish it up) that it is worth holding off on the 1.5 release until this is included? Thematically, it would be nice to include it with the other work on hooks in 1.5, but if it will take more than a week to get done, it might be better to include it in 1.6.

…s reached

hejung · 2021-07-04T10:23:14Z

With #911 in, checking on this one: it looks like the code is largely completed. It will need some conflict resolution (which is hopefully easy to accomplish -- probably some are cases of "added two functions in the same location of the old code").

I cherry-picked the commit that introduces the GraciousKillHook onto the current master and changed the tests to use a patched time.time (as for the PathSamplingOutputHook).

Main question: Is this close enough (and do you, @hejung, have time enough to finish it up) that it is worth holding off on the 1.5 release until this is included? Thematically, it would be nice to include it with the other work on hooks in 1.5, but if it will take more than a week to get done, it might be better to include it in 1.6.

I would say it is done.
Only thing I noticed that we might want to consider, is that the GraciousKillHook is the only hook that uses logging. I did that to see in the ops main simulation log if the hook ended the simulation.
However I realized now that the reason the other hooks do not use logging might be that logging and dask do not play nicely together (similar to logging and multiprocessing), but I have no experience with logging from dask. @dwhswenson Should I rather convert the calls to logging to print statements? (Then they would/should go to std-out and would normally be captured by the queuing system in my intended use-case)

codecov · 2021-07-04T10:35:08Z

Codecov Report

Merging #914 (b437d85) into master (08fb2bc) will increase coverage by 0.04%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #914      +/-   ##
==========================================
+ Coverage   81.47%   81.51%   +0.04%     
==========================================
  Files         140      140              
  Lines       15362    15399      +37     
==========================================
+ Hits        12516    12553      +37     
  Misses       2846     2846

Impacted Files	Coverage Δ
openpathsampling/beta/hooks.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 08fb2bc...b437d85. Read the comment docs.

dwhswenson

Looks good. Very tiny change to keep the line lengths down -- you may want the link listed in my comment instead of the one in the single-click suggestion -- and some unused as clauses in tests.

As to logging vs. writing to an output stream:

The general advantage of logging is that it allows the user to control where/whether/how the output is reported. We limit user options on that in progress reporting because it might mess up our tools for refreshing the output (although users can still silence/redirect output by changing the output_stream, giving at least some choice). This use case seems reasonable for logging in serial.
I think that, in order for Dask to report task logging, you need to set up the logging inside the task, and do so in a way that gets the information back to you (i.e., use a unique file name). (I remember investigating this in detail at one point, but don't remember exactly what I could/couldn't do -- in any case, logs from remote processes are at best a headache to process). However, GraciousKill probably won't be running inside a remote Dask process, since each Dask process may be running on a different job from your queuing system (so it's hard for GraciousKill to track time remaining).

In other words, this probably isn't an issue: the GraciousKillHook will be very useful for some workflows/simulation types (e.g., running TPS or other simulations where we can't do multiple shooting moves as once), but it probably won't be used in combination with Dask-based workflows.

openpathsampling/beta/hooks.py

openpathsampling/tests/pathsimulators/test_hooks.py

hejung · 2021-07-05T11:45:19Z

Done! (I also removed the unnecessary as from the PathSamplingOutputHook...I seem to always forget that you can use with without as....)

dwhswenson · 2021-07-05T14:00:05Z

LGTM. Merging! (And I will start the 1.5.0 release process soon.)

hejung force-pushed the GraciousKillHook branch from b8c8c37 to 2306fa7 Compare September 4, 2020 08:51

dwhswenson added the feature label Sep 26, 2020

hejung force-pushed the GraciousKillHook branch from 2306fa7 to 9ebe509 Compare November 12, 2020 14:04

dwhswenson added this to the 1.5 milestone Dec 23, 2020

hejung added 2 commits July 4, 2021 11:57

Added 'GraciousKillHook' to end simulations once a maximum walltime i…

da4cd30

…s reached

Tests for GraciousKillHook now also patch time

fae89f8

hejung force-pushed the GraciousKillHook branch from 9ebe509 to fae89f8 Compare July 4, 2021 10:07

hejung changed the title ~~[WIP] Gracious kill hook~~ Gracious kill hook Jul 4, 2021

dwhswenson requested changes Jul 4, 2021

View reviewed changes

openpathsampling/beta/hooks.py Outdated Show resolved Hide resolved

openpathsampling/tests/pathsimulators/test_hooks.py Outdated Show resolved Hide resolved

review suggestions

b437d85

dwhswenson approved these changes Jul 5, 2021

View reviewed changes

dwhswenson merged commit 9aff946 into openpathsampling:master Jul 5, 2021

hejung deleted the GraciousKillHook branch July 5, 2021 15:45

dwhswenson mentioned this pull request Jul 5, 2021

OpenPathSampling 1.5 #1031

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gracious kill hook #914

Gracious kill hook #914

Uh oh!

hejung commented May 4, 2020

Uh oh!

dwhswenson commented Jul 3, 2021

Uh oh!

hejung commented Jul 4, 2021

Uh oh!

codecov bot commented Jul 4, 2021 •

edited

Loading

Uh oh!

dwhswenson left a comment

Uh oh!

Uh oh!

Uh oh!

hejung commented Jul 5, 2021

Uh oh!

dwhswenson commented Jul 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Gracious kill hook #914

Gracious kill hook #914

Uh oh!

Conversation

hejung commented May 4, 2020

Uh oh!

dwhswenson commented Jul 3, 2021

Uh oh!

hejung commented Jul 4, 2021

Uh oh!

codecov bot commented Jul 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dwhswenson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hejung commented Jul 5, 2021

Uh oh!

dwhswenson commented Jul 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Jul 4, 2021 •

edited

Loading