After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 592079 - Procedure: reboots for new kernels
Procedure: reboots for new kernels
Status: RESOLVED FIXED
Product: sysadmin
Classification: Infrastructure
Component: Packages
unspecified
Other Linux
: Normal normal
: ---
Assigned To: GNOME Sysadmins
GNOME Sysadmins
Depends on:
Blocks:
 
 
Reported: 2009-08-17 13:28 UTC by Owen Taylor
Modified: 2010-10-04 13:51 UTC
See Also:
GNOME target: ---
GNOME version: ---



Description Owen Taylor 2009-08-17 13:28:58 UTC
Application of security fixes for the kernel does not typically cause the machine to be automatically rebooted. For optimum security, we should try to ensure that all machines are rebooted into the latest installed kernel on a regular basis. It's not worth trying to figure out what kernel security fixes are important or not.

There may be some exceptions to this - for example, a virtualization container machine like VBox will typically have:
 
 - No non-sysadmin login access
 - No open ports other than ssh

And is considerably more of a nuisance to reboot, so we may want to consider not rebooting such machines except for identified critical kernel fixes.

Our record recently in having machines come up properly after a reboot is very good. Even so, reboots should typically happen during working hours at the relevant colo unless the GNOME sysadmin team has remote console/power access itself.

Possible procedure:

 - Automate collection of running kernel and latest installed kernel across all machines into a single report
 - Send a daily nag-mail if there are any systems that need reboots
 - Check-off at the monthly team meeting that the report is currently clean, and if necessary assign a volunteer to deal with scheduling reboots.

The low tech-form of this is to send per-system nag mails; I'm not quite sure how the collection of data across all systems would work, though we need similar things for a lot of montoring (e.g., disk usage). The disadvantage of per-system nag messages is that it contributes to the general white-noise that gets ignored.
Comment 1 Christer Edwards 2010-10-04 13:51:27 UTC
I have documented a SOP for kernel reboots to get us started with this.

http://live.gnome.org/Sysadmin/SOP/Reboot

I don't like the idea of nag emails (I know how the team enjoys the noise as it is).  I think it'll simply require someone (me?) taking the initiative to publicize a maintenance schedule for 48hrs out (minimum) once we see kernel updates have been applied.

Feedback / discussion on this topic can be added on the SOP.