OPS SAM Tests

Atlas SAM Tests

Alice SAM Tests

CMS SAM Tests

LHCb SAM Tests

Notices

DateAdded bySubjectDescription
09/02/2010 09:38 John Kelly network outage for router work Our schedules downtime for router work overran this morning. We expect all services to be working now.
08/02/2010 15:23 Gareth Smith Status Update. Power issues that led to At Risk over weekend now stable. We await more information on root cause.

Reminder of site network intervention tomorrow morning (Tuesday 9th) as declared in GOC DB.
08/02/2010 14:28 Tiju Idiculla Status No known issues.
05/02/2010 17:40 John Kelly update: partial power failure at RAL Hi,
We have re-routed power to the air-conditioning in the LPD (low power density) room. So the air conditioning is now working normally. We are putting the site in an 'at risk state' until Monday when we expect this to be fully investigated and understood.
There has been no loss of service and the RAL tier1 is running normally.

regards,

John Kelly
05/02/2010 16:40 Tiju Idiculla partial power failure at RAL HI,
We have just experienced a partial power failure here at RAL. Apparently we have lost one phase. None of the machine supplies are affected but we have lost air conditioning in our LPD (low power density) machine room.
Operation have been informed and we are working to restote power. However unless we restore the aircon soon, we will have to power off machines. The LPD room contains castor core nodes and the tape robot. So RAL should be considered 'ar risk' until this is resolved.

I will update this as developments occur.

regards,

John Kelly

[Click here to add]

Disk servers in Intervention

Machine VO DiskPool dxtx
gdss282CMScmsFarmReadd0t1
gdss294CMScmsFarmReadd0t1
gdss364CMScmsFarmReadd0t1


Downtimes

No ongoing downtime

Downtimes during next 2 weeks

Downtime IDHostsStart timeEnd timeSeverityDescription


Castormon

Migration Queues

Drive Usage

Disk LSF

This version will not display properly in Internet Explorer. Please try Firefox or Safari.