As a Site Reliability Engineer - OpenStack Private Cloud (Storage), you will work with other SREs, Engineers, Developers and our support & operations teams to ensure maximum performance, reliability and automation of our Private Cloud deployments and infrastructure.
We recognize that manual approaches to operations do not scale, and are launching a new team in Private Cloud Engineering to tackle the significant problems of managing many, discreet Private Cloud installations with multiple offerings and form-factors at scale world-wide. Our Site Reliability Engineer is someone who is familiar with both software and systems engineering with a desire not to just resolve the problem but prevent it in the future. You should have excellent written and verbal communication skills and you should be comfortable operating in fast paced environment.
In this role, you will be focused on our private cloud storage offerings including CEPH, Swift, and Hummingbird. This is a mix of block, filesystem and object storage systems offered as part of the private cloud product offering. This includes understanding how Openstack integrates these technologies, operational expertise and debugging.
In addition to resolving and automating issues internally and downstream if a problem, or issue is better served by fixing the issue in the upstream Open Source code, you will be submitting patches to improve the operational and reliability aspects of the upstream projects.