EDA Infrastructure Engineer at Marvell designing and maintaining HPC clusters for EDA workloads. Collaborating on infrastructure optimization and deployment automation to enhance chip verification processes.
Responsibilities
Design, implement, and maintain large-scale HPC clusters for EDA workloads, ensuring high availability, fault tolerance, and efficient resource utilization.
Manage, configure, and optimize LSF job scheduling systems to support diverse verification workflows.
Develop, automate, and monitor deployment, configuration, and operational processes for EDA infrastructure.
Collaborate with EDA engineers and designers to refine verification flows to run optimally on the grid.
Implement and advance CI/CD pipelines to streamline the deployment, testing, and monitoring of infrastructure and EDA flows.
Provide troubleshooting and support for users and the infrastructure.
Monitor infrastructure health, performance, and usage; proactively identify, resolve, and document issues.
Ensure compliance with security best practices, license management, and data protection requirements.
Contribute to architectural innovation and process improvement for future scalability and efficiency.
Participate in incident management teams for prompt issue resolution.
Requirements
Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field.
2 -4 yrs of industry experience.
Proficiency with Programming or scripting in languages such as Python, Bash, or Perl for automation and workflow development.
Working knowledge of Linux system administration and cluster troubleshooting.
Familiarity with infrastructure-as-code, configuration management, and monitoring, DevOps and SRE concepts.
Strong communication and collaboration skills; ability to work in cross-functional teams.
Track record of identifying and implementing infrastructure optimizations for efficiency, throughput, and reliability.
Benefits
Competitive compensation and great benefits including health insurance
Professional development opportunities
Workstyle within an environment of shared collaboration, transparency, and inclusivity
AI Infrastructure Engineer designing and implementing AI solutions for Xsolla's infrastructure tasks across GCP and multi - cloud environments. Collaborating with senior engineers to execute AI strategy.
Data Transport Infrastructure Engineer at Leidos supporting U.S. Air Force Cloud One Architecture. Involves developing scalable cloud - native solutions and mentorship roles in a hybrid remote setting.
Principal Software Engineer on Walmart's AI Security team analyzing threats and implementing robust security architectures. Collaborate across domains and mentor on AI safety and secure engineering practices.
Data Center Infrastructure Architect designing scalable and resilient optical cabling for hyper - scale data centers. Implementing physical solutions and automating fiber mapping for efficiency.
Systems and Infrastructure Engineer managing technology infrastructure and providing DevOps support for system reliability. Collaborating with development teams to implement solutions and enhance system performance.
Infrastructure Engineer managing IT infrastructure projects and operational tasks for the MHRA. Collaborating with teams to ensure service stability and performance in the Digital and Technology group.
AI Infrastructure Engineer designing and implementing AI/ML solutions for infrastructure use cases at Xsolla. Collaborating with teams to enhance the security posture of infrastructure systems.
AI Infrastructure Engineer at Xsolla designing AI/ML solutions for multi - cloud infrastructure. Collaborating on automation workflows and observability systems for improved infrastructure management.
Cloud Infrastructure Engineer managing Azure environments and supporting cloud infrastructure processes in a credit market servicing organization. Collaborating with DevOps teams and ensuring compliance with security standards.
Cloud Infrastructure Architect managing AWS and Azure environments for fintech clients. Leading architectural governance and security compliance in a hybrid infrastructure setup.