Manager leading observability platform operations across logging, metrics, telemetry, and analytics in Canada. Focused on reliability, scalability, and compliance with agile product teams.
Responsibilities
Own platform operations and roadmap (Elastic, Dynatrace, Micro Focus, Grafana)
Manage capacity, cost, performance, and security
Govern logging, telemetry, tracing, topology, and data lifecycle/quality
Publish standards and guardrails; ensure compliance via gating and maturity checks
Align governance with enterprise architecture
Manage vendor relationships in collaboration with the Director
Build partnerships across IT, application owners, infrastructure, and SDLC stakeholders
Coach on instrumentation, alert hygiene, dashboards, tracing, and topology
Lead the observability community and deliver shared trainings and templates
Communicate platform health, adoption, coverage, and outcomes
Enrich signals with app, infra, and network data; apply anomaly detection and AI to reduce noise
Provide reusable dashboards, alert policies, runbooks, and instrumentation patterns
Strengthen incident response, major-incident support and contribute to post-mortems
Implement enhancements that lower detection time and MTTR
Automate provisioning, config-as-code, data onboarding, alerting, and visualization
Embed observability in CI/CD and pre-release checks; promote “observability by default”
Support SRE goals by enabling SLIs/SLOs/SLAs and improving reporting
Manage two Agile product teams and a 24/7 Operations Center
Develop talent and a culture of automation, reliability, and customer service
Maintain backlog and roadmap; prioritize features and cost; drive continuous improvement and report outcomes
Requirements
6+ years in observability/platform engineering, with 2+ years leading ops/platform teams
Hands-on expertise with observability tooling (Elastic Stack, Dynatrace, Grafana, Micro Focus) and pipelines for logging, metrics, tracing, and topology
Experience building automation and self-service for observability (IaC, CI/CD, config-as-code) and integrating observability into the SDLC
Familiarity with multi-cloud (AWS, Azure, GCP) and on-prem environments; hybrid infrastructure visibility across Canada
Background in 24/7 operations and service management
Strong communication, stakeholder partnership, coaching, and vendor management skills
Experience with AI/ML anomaly detection and analytics in observability contexts
Familiarity with SAFe/Agile product management and platform roadmaps
Exposure to performance engineering and enterprise architecture standards
Senior Manager overseeing credit products for Leisure, Hospitality and Gaming. Managing underwriting teams, portfolio quality, and collaborating for responsible growth.
Associate Manager ensuring food safety and quality at Red Lobster, focusing on operational food safety and compliance. Collaborating with teams and training staff on best practices.
Portfolio Manager managing renewable energy project finance deals at Fifth Third Bank. Responsible for managing a portfolio of accounts, reviewing credit decisions, and collaborating with teams.
Store Manager at 7 - Eleven responsible for managing team and driving store success through KPIs. Focus on customer service, safety, and financial performance in corporate - owned locations.
Strategy Consulting Manager focusing on health services at PwC. Analyzing trends, providing consulting services, and leading strategic projects across various areas.
Consulting Manager advising Pharma and Life Sciences executives on business challenges. Leading client engagements and strategic planning initiatives to drive growth and performance.
Program Cost & Controls Manager at RTX handling financial management of capital projects with hybrid work arrangement. Collaborating with cross - functional teams and ensuring budget and schedule tracking.
Store Manager in Training overseeing operations and fostering team development at Safelite's auto glass stores. Supporting execution of business strategies and ensuring high customer service standards.