Production Engineer managing database operations at Palantir, ensuring reliability and availability of data systems. Involved in architecture, design, and maintenance of production databases in various environments.
Responsibilities
Build and maintain software that automates the routine maintenance tasks involved in deploying and running production databases
Participate in regular on-call rotations
Troubleshoot, diagnose, and remediate issues that arise in the stability and reliability of production database systems
Participate in post-incident reviews and take ownership of follow up actions
Manage and execute large scale migrations of the fleet of databases we run
Collaborate closely with customer facing teams when responding to incidents or when assisting them in setting up new installations of databases
Work with database engineering teams and infrastructure teams to build resilient and highly available database systems
Continuously invest in documentation, metrics, monitors and other troubleshooting tools
Build deep expertise and experience in production systems (Kubernetes, cloud environments, Cassandra, Elasticsearch, etc.) and share that knowledge amongst the team
Requirements
Engineering background in Computer Science, Mathematics, Software Engineering, Physics or similar field.
Experience writing code a variety of programmin languages such as Java, Python, and Go, as part of a past role or personal projects.
Experience with cloud and deployment technologies such as Kubernetes, Helm, AWS, GCP, Azure, and Openshift.
Experience with database technologies such as Cassandra, Elasticsearch and Kafka.
A solid foundation in Linux and web service fundamentals.
Familiarity with observability tools such as Grafana and Prometheus.
Strong written and verbal communication and the ability to iterate quickly with teammates and incorporate feedback.
Benefits
Promoting health and well-being across all areas of Palantirians’ lives
Reasonable accommodation for those living with a disability
Production Support Engineer ensuring system stability and reliability for Manulife's critical services. Collaborative role bridging development and infrastructure, providing seamless service for customers.
Senior Production Engineer (SRE) at Legion building and operating a secure AWS/Kubernetes platform. Focused on automation, reliability, and infrastructure as code.
Production Engineer PCB managing first - line technical support for PCB assembly processes. Assisting with product introduction and implementing process improvements in a leading transport solutions company.
Senior Production Support / DevOps Engineer at Keyrus focusing on application reliability and cloud operations. Support enterprise Java - based platforms in collaboration with development teams.
Lead Production Engineer managing production optimization initiatives across the enterprise for oil and gas. Act as the key authority in autonomous and semi‑autonomous production engineering standards.
Production Engineer in open pit mining at St Ives Gold Mine. Responsible for drill and blast designs aligning with production plans and continuous improvement.
Production Engineer ensuring compliance with manufacturing procedures and standards at Galderma. Optimizing production processes and supporting autonomous work cells for operational improvements.
Production Support Engineer ensuring reliability of Ruby on Rails platform at HHAeXchange. Supporting operational health and handling incident response for production systems.
Ingénieur systèmes rejoignant une équipe pour l'exploitation et la mise en place de solutions numériques. Environnement stimulant avec une culture technique forte.
Senior Production Systems Engineer at DRS RADA Technologies developing radar solutions for defense applications. Acting as a key interface between systems engineering and manufacturing engineering organizations.