• Manager/ Sr. Manager, Site Reliability Engineering

    Location(s) US-TX-San Antonio
    Req #
    40065
    Category
    Leadership, Networking, System Administration / Engineering
  • About Rackspace

    Rackspace is modernizing IT in today’s multi-cloud world. We have been honored by Fortune, Forbes, Glassdoor and others as one of the best places to work. We serve over 50% of the Fortune 100 companies & customers in 120 countries around the globe. Our achievements are powered by our people – we call them Rackers.  We grow & thrive through world-class development opportunities, learning & selling bleeding-edge technologies & solutions, and most importantly, connecting with each other (the best & brightest in the industry). Are you a Racker? Join us!

     

    More on Rackspace

     

    Rackers aren’t all alike. We look different. We think uniquely. We are from many places and our beliefs & backgrounds vary. But, being a Racker — a valued member of a winning team on an inspiring mission – is what connects us all. Rackers are encouraged to bring their whole self to work every day, as we know that unique perspectives fuel innovation and enable us to best serve our customers & communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

    Overview & Responsibilities

     

    As the Senior Manager for TES Site Reliability Engineering, you’ll lead two teams comprised of Systems and Network Engineers that build solutions to enhance availability, performance and stability of our internal platforms for teams across Rackspace. You'll define processes and lead your team to respond effectively to alerts, tickets, calls and in addition manage ongoing project work. Your team will be working in both production and non-production environments focusing on the SRE core tenants. The best person for this role is someone who has strong leadership experience, understands in depth engineering and networking concepts and has a very collaborative spirit. Your able to manage a project from vision to implementation with little to no guidance. You take a proactive approach to resolve issues before they even exist, ensuring that your internal customers and partners can do their job without issue. You love partnering with developers, engineers and operations teams to drive solutions for your customers.

     

     

    In this role you will:

    • Work with an awesomely talented passionate group of Rackers
    • Create and meet roadmap deliverables for designing and building an internal Platform as a Service solution to support internal Rackspace development and application teams using technologies such as containers, Kubernetes, application pods, VMware, etc.
    • Drive the vision for a state of the art multi cloud platform solution and provide monthly costs and investment ROI data
    • Support internal teams on multiple levels that will utilize your platforms including VMware, AWS and OpenShift.
    • Drive your teams to identify opportunities to automate infrastructure and application deployment processes for internal Rackspace developers
    • Drive whiteboarding sessions to lead the team in architecting and developing full stack solutions, from whiteboard to green SLA’s
    • Own end-to-end availability and performance of mission critical services and plan / prioritize building automation to prevent problem recurrence; automate response to all non-exceptional service conditions.
    • Keep in close contact with your customers and partners to ensure your roadmap objectives provide solutions, align to the business objectives and most of all are improving their abilities to provide custom tools for our customers.
    • Educate on best practices in terms of redundant architecture and application deployment workflows
    • Lead by example, care for your team and establish credibility with the quality of your and your team's technical execution.
    • Work closely with your peers that oversee other technical teams ensuring that cross training and knowledge sharing is happening to eliminate single point of failures and silo’s
    • Manage employees around the globe including on-call rotations, running incidents, 1:1’s, team meetings, quarterly review, etc.  

    Qualifications

    • BA/BS degree in Computer Science or related technical field, or equivalent practical experience
    • 7-10 years Technical leadership experience. Includes understanding of SDLC and systems infrastructure principles and how they interrelate
    • Strong driving and collaboration/coordination skills. Experience facilitating across large diverse cross-functional teams
    • Strong facilitative leadership skills; able to effectively sell your ideas and convince others to follow based on persuasion rather than authority
    • Strong analytical skills to understand issues and work collaboratively to identify root cause
    • Effective communication & liaising across a wide range of audiences from engineers to executives
    • Proven track record of managing large, complex, multidisciplinary programs
    • Strong organizational skills, planning, and attention to detail are also required
    • Internally motivated, self-starter with ability to plan, organize and establish priorities to meet goals and achieve results
    • Must work well under pressure, balancing multiple priorities and objectives. Handles conflict well
    • Demonstrated leadership working in a broad cross-functional environment
    • Experience working with SaaS/cloud applications and enterprise technology Preferred qualifications
    • Hands-on technical experience combined with strong management and communication skills
    • Capable of technical deep-dives into code, deployment architecture, networking, operating systems and storage
    • Demonstrated expertise in recruiting and managing a team of bright, experienced engineers/project managers/analysts on large scale projects
    • Expertise in problem solving and analyzing global scale distributed systems Key Skills/Competencies
    • Complete Ownership and Accountability Mindset
    • Experience in running complex large scale distributed systems
    • Passionate about uptime and resiliency and operational excellence
    • Thought leadership, Problem Solving and creative thinker
    • Ability to make smart trade-offs, say no when relevant
    • Grow talent/raise the bar, talent optimization, lead strong engineers/personalities
    • Communication, ability to run effective interference/framing with Directors and above