More About the Role:
Leidos currently has an opening on the Service Management Integration and Transport (SMIT) Contract for a Site Reliability Engineering (SRE) Product Owner. This is an exciting opportunity to use your experience and leadership skills to successfully execute the mission of the Navy’s largest IT services program. Under the SMIT Contract, the Leidos team is responsible for the core backbone for the Navy-Marine Corps Intranet, including cybersecurity services, network operations, network engineering, service desk, seat support services, and data transport.
The SRE Product Owner/Team Lead works closely with the designated SRE team to prioritize and guide the development and implementation of reliability-focused features, tooling, and processes. This role ensures that SRE practices align with business goals and improve the resilience, scalability, and performance of critical systems. The PO advocates for automation, proactive monitoring, and incident management best practices to enhance the end-user experience.
The Product Owner will collaborate with other product owners, architects, and the Director to continue to improve and mature our site reliability engineering program processes and procedures. They will have technical leadership over their 6-8 employees and work with SRE Resource Managers.
What You'll Get to Do:
Product Strategy & Roadmap:
•Develop and maintain a product vision and roadmap for SRE initiatives, in alignment with organizational objectives.
•Translate business and operational requirements into product features and technical requirements for SRE teams.
•Provide backlog management, iteration planning, and decomposition of the user stories. Create and groom short, medium and long-term product roadmaps in agreement with internal and external stakeholders.
•Assesses business value and prioritize all stories to ensure work focuses on those with maximum value that align with product strategy.
•Work with the SRE stakeholders on establishing an infrastructure automation vision and a roadmap that aligns with the platform's strategic mission and objectives.
Stakeholder Engagement:
•Serve as the primary point of contact between the SRE team and business stakeholders for all SRE product requirements.
•Creates User Stories and Acceptance Criteria, ensuring stories clearly communicate customer/Stakeholder needs to the development team; work with team to clarify stories as necessary.
•Participates in team demo, retro, and Inspect and Adapt.
Incident and Problem Management:
•Act as a key partner in incident response, working with SREs to address high-impact incidents.
•Analyze incidents to identify root causes and drive the development of features that prevent recurrence.
Reliability & Performance Optimization:
•Work with SRE and engineering teams to drive automation, monitoring, and alerting initiatives that enhance system resilience.
Documentation & Communication:
•Ensure clear documentation and communication of product requirements, progress, and updates to stakeholders.
•Embrace and champion Agile development processes and adoption to modern Site Reliability Engineering workflows and practices while providing technical guidance to team members and coworkers on best practices.
•Create and publish strategies, implementation, maintenance, and administration guides for SRE platforms.
•Review new work proposals that may have an impact with your product(s) and roadmap.
•Work with software developers and operations engineers to improve the software delivery process.
•Strive to provide internal and external customers with excellent customer service and world-class service.
You'll Bring These Qualifications:
•Requires B.S. Degree and 8–12 years of prior relevant experience or Masters with 6–10 years of prior relevant experience. May possess a Doctorate in technical domain.
•Must be a US Citizen and possess an active DoD Secret Security Clearance.
•Minimum of DoD 8570.01 IAT Level II Certification required prior to onboarding and must maintain certification while supporting the SMIT Contract.
•Must be able to support program execution in classified environments and access SIPRNet from an NMCI location on short notice (local travel).
•Ability to travel up to 10% including the potential for OCONUS travel.
•Exceptional written and oral communication skills including producing technical analysis/reports, presentations and executive level briefings with internal and external stakeholders.
•Ability to review requirements, comprehend, and solution capabilities that satisfy customer requirements.
•Ability to work in a highly collaborative, forward thinking, and innovation-driven environment.
•Experience with Agile and DevSecOps/SRE concepts and best practices.
•Hand-on experience with Atlassian products (Jira, Confluence, Bitbucket, etc.).
•Hands-on experience administrating/maintaining SRE platform via Ansible playbooks.
•Strong experience in automating tasks with scripting languages like PowerShell, or Python.
•Working knowledge of the Risk Management Framework (RMF), DISA STIGs.
These Qualifications Would be Nice to Have:
•Previous work experience providing support to the NGEN-NMCI program is highly desired.
•Previous people leadership and mentoring experience.
•Certified Scrum Product Owner (CSPO).
•Advanced/Professional level vendor certifications (Azure).
•ITILv4 and Agile SAFe certifications or applicable experience.
While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
Pay Range:Pay Range $101,400.00 - $183,300.00The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.