Systems Engineer managing AWS/Cloud production environments at Ticketek Entertainment Group. Proactively mitigating issues and collaborating with engineering teams for optimal performance.
Responsibilities
Proactively monitoring and managing our AWS/Cloud production environments and reacting swiftly to prevent or reduce customer visible impact
Escalation and communication of production issues to key stakeholders
Troubleshooting, reproducing, and mitigating complex system and infrastructure issues within the AWS environment
Incident management of high severity issues impacting our sites and services 24x7
Developing and implementing automation and tooling (e.g., leveraging CloudFormation, Ansible, or Terraform) in collaboration with the Site Reliability Engineering team to improve cloud management processes
Working on the engineering team backlog
Supporting service prior to go-live through pre-launch reviews
Providing technical support for internal products, requiring strong investigation, analysis, and resolution skills
Monitoring and checking of systems
Execution of daily system operations tasks, including maintenance and optimisation of the cloud infrastructure
Deployment, configuration, and management of cloud-native or updated solutions, utilizing Gitlab CICD
Requirements
Strong troubleshooting, problem-solving, and investigative skills applied across diverse operating systems and networked environments
Extensive experience operating and managing critical cloud infrastructure and production environments
Experience of working in an agile environment to deliver software
Knowledge and practical experience with scripting (including Shell scripting and Python) for automation and system management
Experience working in a Microsoft stack environment including Windows Server Operating system, Internet Information Services (IIS), Active Directory (AD) and database servers such as Microsoft SQL Server.
Operating knowledge of UNIX or Linux, including proficiency in Shell scripting and Python scripting for automated task scheduling and infrastructure management
Demonstrated experience with Amazon Web Services (AWS), specifically in managing core services (e.g., EC2, VPC, S3)
Sound knowledge of basic networking such as IPs, TCP/IP and Firewall
Proficient in quickly learning new technologies and ability to analyze business needs and recommend effective solutions
Excellent verbal and written communication skills
Proven experience in Incident Management for high-severity issues, preferably within a large-scale production environment
Experience working within GCP
Experience utilizing CI/CD tools (such as Gitlab CICD or Jenkins) to streamline cloud deployments
Experience managing or administering specialized databases, particularly cloud-native services like DynamoDB and Snowflake
Experience working effectively in a demanding and fast-paced production environment
Expertise with logging, monitoring, and Application Performance Monitoring (APM) tools used for proactive system health checks
Previous experience working a shift pattern or managing substantial weekend work / on call responsibilities
Part - time Systems Architect/SME at AMERICAN SYSTEMS, supporting multi - level security systems for the Exodus Transport Network. Requires deep expertise and active Top Secret clearance.
HBM System Engineer at Micron developing innovative memory solutions for customers. Leading technical validation and testing efforts to ensure successful product launches.
Architectural leadership for next - generation DRAM products at Micron. Define DRAM system architectures and collaborate across technology domains for high - performance memory solutions.
IT Network System Engineer focusing on LAN/WLAN projects for customer locations at DATAGROUP. Responsible for network configuration, operation, troubleshooting, and support.
Design System Engineer managing PermitFlow's component library and collaborating with product designers on accessibility and quality standards. Building and maintaining components that drive product development and consistency.
Computer Systems Analyst at Northrop Grumman coalescing and documenting user requirements. Collaborative role developing solutions and providing system training for business operations.
Senior Instructional Systems Designer developing training solutions aligning with Boeing's organizational goals and workforce needs. Collaborating with business leaders to enhance employee performance through impactful learning experiences.
Systems Architect Engineer focusing on B - 1 Weapons programs for Boeing's Bombers Mission Systems. Collaborating with software teams and program management on engineering solutions.
Warehouse Management Systems Analyst at GXO Logistics responsible for liaising between Operations and IT. Evaluate WMS and support systems enhancements for improved operational efficiency.