Details
-
Feature
-
Status: Dismissed (View Workflow)
-
Minor
-
Resolution: Fixed
-
None
Description
Current Situation
- For failure of a Master (server) and connection loss between Master and the Agent the behaviour is stated in the attached test case matrix (see JS-1518_Handling of Agent tasks in case of failure and connection loss.xlsx)
Desired Behavior for Use Cases
- Network Connection Loss (covered by
JS-1524)- Agent
- Tasks are continued with the Agent.
- The Agent stores log output and the execution history of tasks in local files (see
JS-1521).
- Master
- The Master will retry attempts to re-connect to the Agent.
- In case of successful re-connect:
- The Agent reports the log information of running and completed tasks to the Master.
- The Agent reports the execution history of running and completed tasks to the Master.
- The Master adds the information received from re-connected Agents to its history.
- The Master will report running tasks of an Agent after re-connect.
- In case of unsuccessful re-connect:
- Tasks are killed by the Agent (see
JS-1523).
- Tasks are killed by the Agent (see
- Agent
- Master Service Failure
- Database Service Failure
- same as in case of Master Service Failure
- The Master will retry to connect to the database every minute
- For a JobScheduler Active Cluster the re-connect has to take place within 120s (see
JS-1032). - For a JobScheduler Passive Cluster or single instance the re-connect attempts can be configured for an unlimited number of times.
- For a JobScheduler Active Cluster the re-connect has to take place within 120s (see
Maintainer Notes
- This feature is dismissed as the resilience capabilities will be completely reworked with JobScheduler 2.0.
- Future releases 2.0 of Agents will include semi-autonomous behavior that works in case of connection loss and in case of outage of a Master.
- Therefore we will not focus on providing this functionality to a Master for a limited time (release 1.12 will be the last minor release before the major release 2.0).
Attachments
Issue Links
- requires
-
JS-1521 Universal Agent stores transient log files
- Released
-
JS-1522 Master starts in paused mode after failure
- Released
-
JS-1524 Universal Agent supports reconciliation after connection loss
- Released
-
JS-1523 Universal Agent kills task if connection to Master gets lost
- Released
- mentioned in
-
Page Loading...