Systems and Infrastructure Engineer managing technology infrastructure and providing DevOps support for system reliability. Collaborating with development teams to implement solutions and enhance system performance.
Responsibilities
Coordinate system support and performance by ensuring support queue issues are resolved.
Identify the root causes of issues.
Drive suppliers to resolve issues related to products in accordance with service level agreements.
Document the resolution of issues and perform escalation procedures.
Responsible for core DevOps team duties which includes on-call support to ensure system reliability, Oversight of new launch reviews and onboarding, effective incident management along with troubleshooting.
Proactive management of Dashboards and alert systems, providing consultation support to expedite development team processes during releases and post deployment phases.
Track alert analysis trends and identify weekly focal points and collect data driven evidence for specific issues.
Collaborate with development teams to understand and address problem context.
Implement permanent solutions to prevent recurring alerts and communicate.
Implement fixes transparently to stakeholders for enhanced system reliability.
Contribute to automation/ efficiency gain with reduction in manual efforts.
Demonstrate up-to-date expertise in Information Systems Division (ISD) infrastructure and applies this to the development, execution, and improvement of action plans by providing expert advice and guidance to others in the application of information and best practices; supporting and aligning efforts to meet Customer and business needs; and building commitment for perspectives and rationales.
Requirements
Master's degree or equivalent in computer science, computer engineering, information systems, information technology, or related area; OR Bachelor's degree or equivalent in computer science, computer engineering, information systems, information technology, or related area and 2 years of experience in technology infrastructure engineering across areas with compute, storage, network, mobility or virtualization-related technologies.
Experience automating tasks and managing system configurations using Python BASH to streamline operations and reduce manual intervention.
Experience overseeing containerized technologies including Docker and orchestrating complex applications using Kubernetes, ensuring optimal deployment, scaling, and high availability in infrastructure management environment.
Experience designing, implementing, and automating distribute systems solutions.
Experience managing Linux environments which includes tasks such as system configuration, maintenance, troubleshooting and security management.
Experience creating, maintaining and updating Monitoring Dashboard systems for Infrastructure along with alerting mechanism using Grafana, ELK stack, Dynatrace, Spotlight, Prometheus, and X-matter.
Experience troubleshooting Network Issues, identifying the Root cause and fix for a permanent solution using Wireshark, Dynatrace, and Networking protocols.
Experience maintaining and administering Version control source code, Branching Strategies, along with creating Governance rules for Git Organizations using Git, GitHub, Bitbucket, and GitHub Actions.
Experience maintaining streaming application flows including Apache Kafka and AWS Kinesis.
Demonstrated knowledge of SSL/certificates at Various root levels and Certificate authorities with certificate renewal management using ServiceNow, Venafi, OpenSSL, DigiCert, Global sign, and Rapid SSL.
Experience with Code Analysis and Quality gate control measures using SonarQube, Jenkins, Looper, and Concord.
Experience developing APIs for various applications and usage of Databases including Redshift, Cassandra, MySQL, Flask, Docker, and Python.
Benefits
Health benefits include medical, vision and dental coverage.
Financial benefits include 401(k), stock purchase and company-paid life insurance.
Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty and voting.
Other benefits include short-term and long-term disability, education assistance with 100% company paid college degrees, company discounts, military service pay, adoption expense reimbursement, and more.
Infrastructure Engineer designing and building workflows, internal tools, and services at MUBI. Collaborating in a hybrid London setting, connecting systems with AI - powered automation.
Infrastructure Engineer at Push Gaming, focused on building scalable backend systems for online casino games. Collaborating with teams on CI/CD pipelines and automation processes in a hybrid work environment.
Cloud Infrastructure Specialist handling Azure operations and vendor coordination. Driving resilient infrastructure projects with a collaborative, impact - driven team in Warsaw.
Windows Server Infrastructure Engineer maintaining critical enterprise Windows Server environments. Supporting DoD security compliance and infrastructure management for Federal clients in multiple locations.
Infrastructure Engineer contributing to AWS architecture and automation at Oddin.gg. Collaborating with teams to optimize performance and support developer experience.
Cloud Platform Infrastructure Engineer optimizing and managing cloud - native systems in Austin, TX. Collaborating with global teams and participating in agile development processes.
Senior Specialist Infrastructure Architect at Baker Hughes focusing on digital transformation and cybersecurity. Responsible for infrastructure architecture and mentoring team members within the organization.
ML Infrastructure Engineer developing Cloud Data Infrastructure to support Assured AI for Autonomy. Designing and developing infrastructure to enhance Bluespace's APNT capabilities.
Senior Data Infrastructure Engineer responsible for modernizing the data platform while optimizing for cost - efficiency and ensuring scalability. Joining a team focused on user - friendly solutions and data accessibility.
Lead Infrastructure Engineer managing endpoint vulnerabilities and configuration compliance at Truist. Collaborating with engineering and security teams to drive risk reduction and governance.