Staff Software Engineer focused on incident management to improve system reliability at Insulet. Collaborating with Incident Managers and teams to automate detection and response processes.
Responsibilities
Driving the incident management process and coordinating efforts with all teams involved, including SRE, R&D, IT, vendors, and stakeholder, in resolving the incident
Responding to incidents and initiating the incident management process
Prioritizing incidents according to their urgency and business impact
Coordinating response efforts and collaborating with the incident response team to ensure that all protocols are diligently followed
Communicating with internal stakeholders on major incidents and impacts
Producing documents that outline incident timelines and actions taken during the incident
Coordinating post-incident RCAs with responders and SMEs and communicating to stakeholders
Design and implement automation for incident detection, triage, and resolution
Develop and maintain runbooks, playbooks, and tooling to streamline incident response
Collaborate with Incident Managers to improve processes and reduce Mean Time to Recovery (MTTR)
Participate in major incident response efforts, providing technical leadership during high-severity events
Lead post-incident reviews and implement preventive measures to avoid recurrence
Requirements
Bachelor’s degree required (preferred field of study: Computer Science, Engineering, or related field)
7+ years of experience in software engineering, operations, or reliability roles
Minimum 3+ years focused on incident management or operational resilience
Proven track record of improving incident response processes and reducing MTTR
Proven experience architecting and managing highly available, scalable, and fault-tolerant systems
Strong understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and container orchestration technologies (e.g., Kubernetes)
Strong understanding of incident management principles and frameworks (e.g., ITIL)
Hands-on experience with incident response in complex, distributed systems
Proficiency in scripting or automation (Python, Bash, or similar) for operational tasks
Familiarity with monitoring and alerting tools (e.g., Datadog, Prometheus, Grafana)
Senior Full Stack Engineer responsible for developing and integrating AI tools in a HealthTech startup, providing impactful solutions for therapists' workflows.
Full Stack Engineer developing AI systems for a proactive chat app. Building features across frontend, backend, and AI integrations for reliable workflows.
Full - Stack Developer creating innovative digital solutions at Dreamlight Labs. Develop modern web interfaces and APIs while collaborating with a dynamic team in the creative industry.
Software Tech Lead responsible for Full Stack architecture and development at SPiNE Energy startup. Engage in innovative energy applications and scalable solutions for energy management.
Product Engineer responsible for full - stack development and feature ownership using AI tools. Join JustDice in Hamburg to build mobile and apps in a supportive environment.
Senior Staff Engineer at mylo responsible for large - scale technical impact across backend systems. Collaborating with teams to ensure architectural excellence and improve system reliability.
Full Stack Developer at Centah managing integrations and API features for a scalable SaaS platform. Contributing to system reliability and mentoring less experienced engineers.
Director of Software Engineering providing strategic and technical leadership across Ensemble’s software delivery organization. Drive technology strategy and lead engineering teams to deliver scalable software solutions.
Developing and maintaining state of the art solutions for McKesson. Senior Associate Software Engineer role requires collaboration with teams and participation in Agile methodology.
Lead Software Engineer responsible for influencing and implementing technology strategy for software delivery teams at Ensemble. Focused on developing reusable components and mentoring technical deliverables.