At the moment, it is not possible to automate retrying of work orders, perhaps this might be possible once Update RunController and JSON api for GovStack compliance · Issue #1656 · OpenFn/lightning · GitHub is done
In addition to the logs, could you also share the lightning and ws-worker versions that you’re using?
While at it, you can try and go through the Managing Lost Runs guide and let us know if it helps.