Add support to checkpoint and restore processes. Checkpointing processes can be used for fault tolerance and/or load balancing.
Checkpointing a process in regular intervals can help to restart a process if it might crash to resume/restart/restore the calculation without too much data lost. Providing this ability transparent at the OS level removes the need to implement this functionality for all processes manually.
Checkpointing and restoring a process to another system can be used to migrate a process, process tree or container to another system to distribute the load during the runtime and also for maintenance without service interruption like it is possible with virtual machines.
- Name: Adrian Reber
- Email: <email@example.com>
- Targeted release: Fedora 19
- Last updated: 2012-10-24
- Percentage of completion: 0%
Checkpointing/restore, as mentioned above, can be used for fault tolerance and load distribution.