Senior engineer delivering critical solutions to mobile network operators and public safety agencies. Managing infrastructure automation, ensuring systems' scalability, performance, and security.
Responsibilities
Build, deploy, and manage infrastructure automation using Terraform and Ansible.
Design, implement, and support highly available, fault-tolerant systems across hybrid environments (on-premises and cloud).
Manage and optimize AWS infrastructure and services with a focus on scalability, performance, cost efficiency, and security.
Integrate automation into CI/CD pipelines to streamline deployments and reduce manual intervention.
Develop and maintain automation frameworks, reusable modules, and deployment templates.
Establish and enforce standards for monitoring, alerting, logging, and observability.
Collaborate with development and operations teams to align infrastructure automation with broader technology roadmaps.
Review and enhance system architecture to meet best practices in security, resiliency, and reliability.
Document and maintain standard operating procedures, including deployment workflows, incident response, and disaster recovery.
Provide tier 4 support, troubleshooting, and resolution of infrastructure and automation-related issues.
Act as escalation point for systems after business hours only when tier 3 on-call rotation personnel are unable to resolve customer service impairments.
Requirements
15+ years of applicable experience
Automation & IaC Expertise: Advanced proficiency in Ansible and Terraform for infrastructure provisioning, configuration management, and automation at scale.
Infrastructure as Code / GitOps: Strong knowledge of IaC, Configuration as Code, and GitOps principles and best practices.
Cloud (AWS): Hands-on experience with a broad range of AWS services, including EC2, VPC, IAM, S3, CloudWatch, CloudTrail, Route53, ELB, Transit Gateway, Direct Connect, EKS, ECS, and KMS.
Deep understanding of AWS networking, monitoring, and security best practices.
DevOps & CI/CD: Proficiency with CI/CD pipelines and tooling (e.g., GitLab CI/CD, Packer, Python, or similar scripting languages).
Containers & Orchestration: Solid understanding of Docker and Kubernetes for application deployment and management.
Security & Compliance: Working knowledge of OS-level hardening, vulnerability remediation, and secure infrastructure design.
Networking: Strong foundation in networking fundamentals (Layer 2/3, IPv4; IPv6 a plus).
Monitoring & Observability: Familiarity with tools such as Prometheus, Grafana, ELK/EFK stacks, CloudWatch, Datadog, or equivalent platforms.
Problem-Solving & Troubleshooting: Proven ability to diagnose and resolve complex issues across automation, infrastructure, cloud, and networking layers.
Collaboration & Communication: Excellent communication skills with experience working in cross-functional DevOps/Engineering teams.
Leadership & Initiative: Self-starter with adaptability, able to thrive in fast-paced, mission-critical environments.
Qualifications: Bachelor’s degree in engineering or business. MBA preferred.
Certifications: AWS certifications (e.g., Solutions Architect, SysOps, or DevOps Engineer) are highly desirable.
Structural Systems Engineer specializing in structural analysis of aerospace vehicle pressurized systems. Involving design, development, and execution of test programs for launch and space structures.
Systems Engineer at Quevera collaborating with experts to deliver innovative solutions. Join our dynamic team recognized as a top employer in the Baltimore/DC area.
Staff Systems Engineer working on delivering complex software applications into operations with a talented team at CACI. Supporting development and verification of mission capabilities while ensuring operational efficiency.
Senior Systems Engineer supporting mission - critical software and AI/ML product development. Collaborating within an Agile team to transition complex systems to operational use.
IT Support Specialist ensuring installation, support, and maintenance of IT systems in healthcare settings. Focusing on efficiency, stability, and customer service with a team - oriented approach.
RF Systems Engineer III developing spacecraft communication systems for civil, commercial, and National Security Space programs. Collaborating with cross - functional teams to enhance RF communications technology.
Systems Engineer supporting deployment and operational reliability in cloud - based healthcare platform. Collaborate with engineering and QA teams to manage cloud environments and troubleshoot issues.
Business Systems Analyst participating in daily support and enhancement of systems for health care. Involved in development and configuration to support Cambia's mission in health care.
Systems Analyst for Connecticut Children’s health improving computer systems and supporting colleagues. Utilizing data gathering techniques for effective solutions in a healthcare environment.
Epic Systems Analyst supporting pharmacy IT systems for Connecticut Children’s. Utilizing expertise in complex application and systems enhancements or replacements.