Platform Engineer developing cloud-native microservice components and internal tools at Furiosa. Involved in building monitoring platforms and managing machine learning lifecycles.
Responsibilities
Designs and develops cloud-native microservice components on Kubernetes
Builds monitoring and logging platforms for cloud-native applications
Designs and develops Software Development Kit (SDK) of Furiosa NPUs
Develops high-performance runtime system handling DNN inference requests
Develops ML production software to manage machine learning lifecycles
Develops internal tools and services that improve teams' productivity
Writes API references and development documents
HW operation tools for hyper scale datacenter, e.g. monitoring, dashboard, workbench, job scheduling server, HW resource management system, and others.
Work on various operation tools, such as chip monitoring systems and multi-machine resource management and scheduling systems.
Requirements
Bachelor’s degree in Computer Science or equivalent work experience
Excellent communication skills for requirement gathering and clarification
3+ years strong programming skills in one or more of the following languages - Rust, Python, Golang, C++
2+ years experience developing cloud-native applications with Kubernetes
2+ years of experience building and managing microservices in AWS, Azure, GCP or Kubernetes
Experience with ML/DNN frameworks, e.g., Tensorflow, Pytorch, Apache MXNet
Knowledge of testing and CI/CD pipeline, e.g., Jenkins, and others
Experience developing high-performance and highly concurrent server applications
Experience with RDBMS, NoSQL systems and message queuing systems, such as Kafka, AWS SQS
Experience developing production-grade software for customers
Experiences in authentication & authorization methodologies, e.g., OpenID, JWT, OAuth, 802.1X, and others
Software engineer at Uncountable focusing on Generative AI deployment in software. Building AI - powered search tools and developing LLM stack for scientific research.
Lead Platform Engineer at TD Securities, developing a high performing Trading Risk Warehouse platform. Responsible for ensuring stability and scalability, while managing underlying infrastructure and supporting development teams.
Lead Platform Engineer at Capital One driving transformation in technology and solutions with Agile practices and DevOps tools. Collaborating on complex technical problems in a fast - paced environment.
Data Platform Engineer managing daily operations of data platforms for a global cybersecurity company. Collaborating with teams to ensure platform reliability and performance.
Senior Platform Engineer focused on building internal platform capabilities for developer tooling and experience at MONY Group. Collaborating with teams to enhance platform engineering and software delivery.
Databricks Platform Engineer working on AWS ecosystem design, build, and optimization. Responsible for implementing scalable pipeline solutions across data platforms.
Senior Data & Platform Support Engineer supporting Oracle databases at the Federal Reserve Bank. Collaborating with teams to ensure operability of payment systems and enhance business outcomes.
IT Project Manager involved in managing diverse projects at Fidelity focusing on architecture and data solutions. Lead delivery teams in technology initiatives enhancing existing systems.