original task entry
OCG has been decommissioned already as a service, the hosts have been migrated to role::spare::system and all decommission steps up to Steps for DC-OPS (with network switch access) have been completed already.
The servers are out of warranty, so there is no point in returning them to the spares pool.
decom checklist for each system
ocg1001:
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/heira/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS
- - disable puppet on host
- - power down host
- - disable switch port
- - switch port assignment noted on this task (for later removal) ocg1001:asw-c-eqiad:ge-7/0/9
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/#/c/424148/
- - remove production dns entries https://gerrit.wikimedia.org/r/#/c/424149/
- - puppet node clean, puppet node deactivate
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite)
- - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
- - IF DECOM: switch port configration removed from switch once system is unracked.
- - IF DECOM: add system to decommission tracking google sheet
- - IF DECOM: mgmt dns entries removed.
ocg1002:
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/heira/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS
- - disable puppet on host
- - power down host
- - disable switch port
- - switch port assignment noted on this task (for later removal) ocg1002:asw-d-eqiad:ge-3/0/4
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/#/c/424148/
- - remove production dns entries https://gerrit.wikimedia.org/r/#/c/424149/
- - puppet node clean, puppet node deactivate
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite)
- - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
- - IF DECOM: switch port configration removed from switch once system is unracked.
- - IF DECOM: add system to decommission tracking google sheet
- - IF DECOM: mgmt dns entries removed.
ocg1003:
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/heira/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS
- - disable puppet on host
- - power down host
- - disable switch port
- - switch port assignment noted on this task (for later removal) ocg1003:asw-d-eqiad:ge-3/0/5
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/#/c/424148/
- - remove production dns entries https://gerrit.wikimedia.org/r/#/c/424149/
- - puppet node clean, puppet node deactivate
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite)
- - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
- - IF DECOM: switch port configration removed from switch once system is unracked.
- - IF DECOM: add system to decommission tracking google sheet
- - IF DECOM: mgmt dns entries removed.