Many orchestrator failures are simple to fix, we should add a system to the pusher that picks up failed profiles and tries to recover them based on their last error.
For example, a database connection error can simply be retried a period of time later.
Basically we should try to automate the manual recovery steps that we're having to do every day based on eagle alerts