Server down

ByAnonymous Web Angel(GFZ)

Feb 21, 2024

On Wed, Feb 21, the site experienced downtime. The cause was an infrastructure update that caused certain resources to go offline.

At no time was there a risk to the contents of the site.

The resources (ceph cluster) has been brought back online and the site, obviously, is now functioning properly.

Geek speak:

The base infrastructure is K8S. A ceph cluster provides backing storage for the RDBMS and for the assets. At around 0800, the K8S cluster was forcibly upgraded because of EOL issues.

This caused NAS volumes to become detached from the K8S ceph nodes. This is expected. Once the volumes were attached to the new K8S ceph nodes, the OSD processes had to be properly restarted.

Once this was completed, the ceph volumes became available to all the pods that needed them and the site was brought back up.

5 thoughts on “Server down”

sota says:

2024-02-21 at 11:04

… I.T. is hard.
Birdog357 says:

2024-02-21 at 12:50

Those are certainly all words, I think…
Rob Crawford says:

2024-02-21 at 14:58

That’s what she said!
it's just Boris says:

2024-02-21 at 15:24

Reading that ….I know how Jayne feels.

1
B.Zh says:

2024-02-23 at 10:21

Is that Klingon?

1