Maintenance Engineer ensuring reliability of production AI systems at Luminai. Monitoring, diagnosing, and improving AI workflows for critical organizational processes.
Responsibilities
Monitor, maintain, and improve the reliability of production AI systems and workflow infrastructure
Proactively identify, diagnose, and resolve system issues across application, integration, and cloud infrastructure layers
Own incident response processes, including root cause analysis and long-term remediation
Implement monitoring, alerting, and observability tooling to ensure system health and uptime
Collaborate with Engineering to harden deployments and improve system architecture for resilience and scalability
Support customer-facing teams by troubleshooting and resolving technical issues in live environments
Document system configurations, operational procedures, and recovery protocols
Continuously improve reliability standards, deployment practices, and operational safeguards
Requirements
3+ years of experience in support engineering, site reliability engineering, or infrastructure maintenance
Strong proficiency in Python or scripting languages
Experience managing cloud infrastructure (AWS, GCP, or Azure)
Strong problem-solving skills and a proactive, preventative mindset
Clear communication skills and ability to collaborate across engineering and customer-facing teams
High ownership and accountability in high-reliability environments
Software Engineer responsible for designing, developing, and maintaining software applications in financial services. Collaborating across teams for requirements analysis and engaging in the entire development lifecycle.
Senior Logistics Engineer at Saab Australia leading logistics engineering activities for defence acquisition projects. Collaborating with teams to manage and execute logistic engineering and obsolescence analysis.
Project Engineer responsible for executing hardware design projects in industrial automation. Ensuring on - time delivery and customer satisfaction while upholding engineering standards.
Senior Analog Layout Engineer executing custom analog layouts for critical circuit blocks. Collaborating with design teams and supporting silicon bring - up and debugging processes.
Formal Verification Engineer crafting and optimising verification flows for CPU/GPU projects at NVIDIA. Collaborating with design teams and ensuring design correctness using advanced formal techniques.
Senior Packaging Development Engineer managing packaging design and vendor collaboration for product lifecycle. Driving packaging automation and improvement in a Taiwan - based environment.
Engineer, Test Manufacturing (ICT) at Celestica providing tester support and improving test solutions in Thailand. Collaborating with production and internal customers for robust testing outcomes.
Associate Engineer responsible for automatic test equipment and support in manufacturing at Celestica. Focus on optimizing testing processes for product quality and efficiency.
Reliability Test Engineer at Celestica maintaining and improving automatic test equipment for product validation. Conducting reliability testing and collaboration with cross - functional teams to ensure quality.