Cloud DevOps Engineer developing AWS cloud-based solutions at S&P Global. Collaborating on innovative projects using advanced AWS and AI technologies in an Agile environment.
Responsibilities
Develop, deploy, and maintain scalable, secure, and fault-tolerant AWS cloud-based DevOps solutions using tools such as Terraform, AWS CloudFormation, and AWS CLI, directly impacting the reliability and performance of business-critical applications.
Collaborate with global development teams to design and implement best practice cloud-native solutions, gaining exposure to innovative technologies and market leaders.
Lead the configuration and management of AWS environments, focusing on automation and containerization with AWS ECS/Fargate, Docker, and integration with CI/CD pipelines using Github/GitLab/Teamcity.
Contribute to application architecture and design, leveraging deep expertise in AWS services (ECS, ECR, S3, CloudFront, Lambda, VPC, Route 53, RDS, CloudWatch) and infrastructure as code practices.
Utilize Agentic AI and AIOps tools for predictive monitoring, anomaly detection, automated remediation, and AI-driven incident management to enhance operational efficiency and service continuity.
Promote a culture of teamwork and continuous improvement, mentoring peers and fostering cooperation to achieve shared goals in an Agile environment, utilizing collaboration tools such as Jira and Confluence.
Requirements
Bachelor's degree in Computer Science, Information Systems, Information Technology, or a similar major or Certified Development Program
Minimum 3 years of experience managing AWS application environments and deployments, with strong hands-on expertise in AWS services (e.g., ECS/Fargate, ECR, S3, CloudFront, Lambda, VPC, Route 53, RDS, CloudWatch).
Proficiency in infrastructure as code (Terraform, CloudFormation), automation, CI/CD pipelines, container platforms (Docker, ECS), and familiarity with Agentic AI and AIOps tools.
Solid understanding of networking, security, load balancers, DNS, and operational tools in cloud environments; experience with software design fundamentals and DevOps principles.
Excellent written and verbal communication and presentation skills.
Strong problem-solving abilities and passion for tackling challenging issues.
Collaborative mindset, promoting teamwork and shared goals in Agile, global environments.
Commitment to continuous learning, innovation, and the adoption of best practices.
Benefits
Health & Wellness: Health care coverage designed for the mind and body.
Flexible Downtime: Generous time off helps keep you energized for your time on.
Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in-class benefits for families.
Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.
DevOps SME designing, implementing, and operating multi - cloud platforms for The Missing Link. Collaborating with engineering, security, and operations teams while embedding DevOps best practices.
Site Reliability Engineer improving reliability of cloud infrastructure for an AI - specialized company. Taking ownership of monitoring and incident response processes in hybrid - working style.
DevOps Engineer leading automation for sophisticated release/deployment pipelines at Securonix. Focused on Python, Ansible, and cloud services to enhance security operations.
Senior Analyst on Data Platform DevOps at AIMCo, responsible for building data operations and collaborating with teams on innovative solutions. Focused on ensuring data quality and integrity across technologies.