Site Reliability Engineer Lead Analyst at Citi overseeing application systems analysis and reliability. Leading monitoring, automation, and collaborative initiatives to enhance system performance.
Responsibilities
Monitor, Measure and analyze the system's performance and availability
Resolve variety of high impact problems/projects through in-depth evaluation of complex business processes, system processes, and industry standards
Provide expertise in area and advanced knowledge of applications programming and ensure application design adheres to the overall architecture blueprint
Utilize advanced knowledge of system flow and develop standards for coding, testing, debugging, and implementation
Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
Serve as advisor or coach to junior SRE engineers, allocating work as necessary
Develop and maintain automated tools and systems to manage and monitor the infrastructure
Reduce manual intervention, human errors and the time it takes to perform routine tasks
Periodically assess the capacity of needs of services and work on scaling them to handle the increased usage
Plan for resource allocation, manage load balancing and ensure the system can handle demand fluctuations
Work to detect, diagnose and resolve issues quickly to minimize the impact on users and business
Conduct post-incident reviews to learn and improve system's reliability
Work with different development teams, product owners and other stakeholders to ensure seamless deliveries and aligning to a common goal
Requirements
6+ years of relevant experience in Apps Development or systems analysis role
Extensive experience system analysis and in programming of software applications
Extensive experience in automated pipelines, automated testing and automated security controls
Extensive experience in the use of logging tools/systems (splunk, appDynamics, etc...)
Experience in managing and implementing successful projects
Subject Matter Expert (SME) in at least one area of Applications Development
Ability to adjust priorities quickly as circumstances dictate
Demonstrated leadership and project management skills
Consistently demonstrates clear and concise written and verbal communication
Benefits
medical, dental & vision coverage
401(k)
life, accident, and disability insurance
wellness programs
paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
Job title
Site Reliability Engineer, Lead Analyst, Vice President
Safety and Reliability Engineer focusing on safety assessments and reliability evaluations at Collins Aerospace. Lead analyses and ensure designs meet certification standards.
Deployment Engineer responsible for client solution deployment and integration at ng - voice. Work includes planning, configuration, and operational efficiency tasks.
DevOps Engineer participating in structuring Terraform practices at EOLEN, a consulting firm in engineering and IT. Focused on Cloud, Data, Cybersécurité, software development and IT infrastructure.
DevOps Developer coordinating IT support and developing pipelines and delivery processes for Saab. Focused on collaboration, technical solutions, and communication to achieve high - quality results.
Senior Infrastructure Engineer focused on design automation and software infrastructure at Intel Foundry. Collaborating with development teams to improve reliability and velocities in engineering processes.
Site Reliability Engineer at Personio focusing on automated infrastructure and collaboration across engineering teams. Shape the future of HR technology with meaningful impact and ownership.
Site Reliability Engineering Senior Manager leading multiple SRE teams at Netwealth. Shaping strategy and operational practices in a collaborative environment.
DevOps Engineer automating software development lifecycle in multi - cloud Kubernetes environments. Building and maintaining DevSecOps pipeline using Infrastructure as Code and modern tools.