|
|
|
[
Permlink
| « Hide
]
Joseph Marques - 13/Sep/08 01:12 AM
rev1449 - this commit changes the alert cache loading from server-start time to agent-connection time;
rev1452 (mazz) - when the agent successfully registers, it should also tell the server it wants to "connect" to it.
i'm still seeing the same issue running on rev1452 code. when i start the agent normally (without --clean), i do not see the connectAgent method being called. this NEEDS to be done BEFORE any queued commands get sent up to the server. if a queued command is sent before the server is ready, then the cache might not loaded yet, which means that one or many alerts should have fired but will be missed now.
the fix I plan on implementing for
this is fixed - we send connect agent command to server whenever we restart the agent. this, in addition to sending the connect-agent command after we register should get the agent to properly connect to the server under all conditions
rev1547 - now see this in the log after the last step of the procedure:
2008-09-18 13:18:12,156 INFO [org.rhq.enterprise.server.core.AgentManagerBean] Agent with name [dev16.qa.atl2.redhat.com] just went down 2008-09-18 13:18:24,843 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Caches for agent[id=500050]... 2008-09-18 13:18:24,843 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Resource Availability' 2008-09-18 13:18:24,843 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Resource Availability', list was size 0 2008-09-18 13:18:24,843 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Threshold' 2008-09-18 13:18:24,859 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Threshold', list was size 0 2008-09-18 13:18:24,859 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Baseline' 2008-09-18 13:18:24,859 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Baseline', list was size 0 2008-09-18 13:18:24,859 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Value Change' 2008-09-18 13:18:24,875 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Value Change', list was size 0 2008-09-18 13:18:24,875 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Trait' 2008-09-18 13:18:24,875 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Trait', list was size 0 2008-09-18 13:18:24,875 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Control Action' 2008-09-18 13:18:24,875 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Control Action', list was size 0 2008-09-18 13:18:24,890 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Log Event' 2008-09-18 13:18:24,890 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Log Event', list was size 0 2008-09-18 13:18:24,906 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loaded Alert Condition Caches for agent[id=500050] 2008-09-18 13:18:24,906 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] UnloadStats for agent[id=500050]: AlertConditionCacheStats[ created=0, updated=0, deleted=0, matched=0, age=63ms ] 2008-09-18 13:18:24,906 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] ReloadStats for agent[id=500050]: AlertConditionCacheStats[ created=0, updated=0, deleted=0, matched=0, age=63ms ] 2008-09-18 13:18:24,937 INFO [org.rhq.enterprise.server.core.CoreServerServiceImpl] Agent [dev16.qa.atl2.redhat.com] has connected to this server. 2008-09-18 13:18:25,875 INFO [org.rhq.enterprise.server.core.CoreServerServiceImpl] Got agent registration request for existing agent: dev16.qa.atl2.redhat.com[10.18.0.79:16163] - Will not regenerate a new token 2008-09-18 13:18:26,125 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Caches for agent[id=500050]... 2008-09-18 13:18:26,125 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Resource Availability' 2008-09-18 13:18:26,125 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Resource Availability', list was size 0 2008-09-18 13:18:26,125 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Threshold' 2008-09-18 13:18:26,125 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Threshold', list was size 0 2008-09-18 13:18:26,125 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Baseline' 2008-09-18 13:18:26,140 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Baseline', list was size 0 2008-09-18 13:18:26,140 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Value Change' 2008-09-18 13:18:26,140 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Value Change', list was size 0 2008-09-18 13:18:26,140 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Measurement Trait' 2008-09-18 13:18:26,156 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Measurement Trait', list was size 0 2008-09-18 13:18:26,156 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Control Action' 2008-09-18 13:18:26,156 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Control Action', list was size 0 2008-09-18 13:18:26,156 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loading Alert Condition Composites of type 'Log Event' 2008-09-18 13:18:26,172 INFO [org.rhq.enterprise.server.alert.AlertConditionManagerBean] Found 0 elements of type 'Log Event', list was size 0 2008-09-18 13:18:26,172 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] Loaded Alert Condition Caches for agent[id=500050] 2008-09-18 13:18:26,172 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] UnloadStats for agent[id=500050]: AlertConditionCacheStats[ created=0, updated=0, deleted=0, matched=0, age=47ms ] 2008-09-18 13:18:26,172 INFO [org.rhq.enterprise.server.alert.engine.AlertConditionCache] ReloadStats for agent[id=500050]: AlertConditionCacheStats[ created=0, updated=0, deleted=0, matched=0, age=47ms ] 2008-09-18 13:18:26,187 INFO [org.rhq.enterprise.server.core.CoreServerServiceImpl] Agent [dev16.qa.atl2.redhat.com] has connected to this server. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||