<blockquote id="sgmii"><label id="sgmii"></label></blockquote>
  • Skip to main content
    Engineering

    Senior Site Reliability Engineer

    Summary

    Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about solving problems using data and seek to deliver the best experience for customers. At Splunk, we’re committed to our work, our customers, having fun, and most importantly to each other’s success.

    We are looking for a Site Reliability Engineer focussing on the SignalFx and APM product lines. Site Reliability Engineers at Splunk are hybrid software/systems engineers whose overarching goal is to ensure that Production Services are always up and running reliably. They are also responsible for improving Operational Efficiency, Utilization and System Resiliency of the Platform. They own Critical Open Source Software that our platform relies on, and are core participants in every significant engineering effort underway in the platform.

    Responsibilities:

    • Responsible for automating & operationalizing engineering tasks on Backend Services - data migrations, performance tuning, capacity changes, etc
    • Monitor Capacity & Utilization and work closely with the Infrastructure team to orchestrate scale-up/down of Backend Services.
    • Own & operate critical back-end Open Source Services like Cassandra, Kafka, Zookeeper, Elasticsearch, Druid etc.
    • Build tools and design processes that help improve observability and system resiliency of the SignalFx Platform.
    • Triage Site Availability Incidents and proactively work towards reducing MTTR for customer impacting incidents.
    • Partner with Service owners to implement Service Level Metrics & Service Level Objectives that act as service level health indicators.
    • Establish design patterns for monitoring, benchmarking and deploying new features for the backend services.

    Requirements:

    • BS degrees in Computer Science or related technical field, or equivalent practical experience.
    • 5+ years of experience as a Site Reliability Engineer, Production Engineer or Backend Software Engineer for web-scale or similar platforms.?
    • Coding experience in one or more of Python, Bash, Go or Java.
    • Experience building or operating high performance distributed systems.
    • Experience with one or more OSS technologies like Kafka, Cassandra, Zookeeper or Elasticsearch.
    • Understanding of Unix/Linux systems from kernel to shell and beyond, taking in system libraries, file systems, and client-server protocols along the way.
    ?
    ?
    Splunk's Hiring Practices
    Splunk turns machine data into answers. Organizations use market-leading Splunk solutions with machine learning to solve their toughest IT, Internet of Things and security challenges.
    ?
    Individuals seeking employment at Splunk are considered without regards to race, religion, color, national origin, ancestry, sex, gender, gender identity, gender expression, sexual orientation, marital status, age, physical or mental disability or medical condition (except where physical fitness is a valid occupational qualification), genetic information, veteran status, or any other consideration made unlawful by federal, state or local laws. Click here to review the US Department of Labor’s EEO is The Law notice. Please click here to review Splunk’s Affirmative Action Policy Statement.
    ?
    Splunk also has policies in place to protect the personal information candidates disclose to us as part of the application process. Please click here to review Splunk’s Career Site Privacy Policy.

    Splunk does not discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. Please click here to review Splunk’s Pay Transparency Nondiscrimination Provision.

    Splunk is also committed to providing access to all individuals who are seeking information from our website. Any individual using assistive technology (such as a screen reader, Braille reader, etc.) who experiences difficulty accessing information on any part of Splunk’s website should send comments to accessiblecareers@splunk.com. Please include the nature of the accessibility problem and your e-mail or contact address. If the accessibility problem involves a particular page, the message should include the URL of that page.

    Splunk doesn't accept unsolicited agency resumes and won't pay fees to any third-party agency or firm that doesn't have a signed agreement with Splunk.

    To check on your application click here.
    ?

    DIVE DEEPER

    Find out what makes Splunk such a great place to work

    box1 box1
    Our Values

    Splunkers are encouraged and empowered to be Innovative, Passionate, Disruptive, Open and Fun.?

    Learn More
    box2 box2
    Our Locations

    From San Francisco to Shanghai, Splunkers work in 25+ offices across the globe.

    Learn More
    box3 box3
    University Recruiting Program

    Intern with people you want to hang out with, even outside the office.

    Learn More
    box3 box3

    Our Blog

    Hear from Splunkers on the latest.

    Learn More
    box2 box2
    Diversity & Inclusion

    Culture of Inclusion: Splunkers Share Their Stories

    Learn More
    box1 box1
    LinkedIn

    Follow Splunk on LinkedIn for job announcements, company news, and more.

    Learn More