Solution Architect developing comprehensive AI infrastructure solutions for deployment at d-Matrix. Collaborating with clients to enable successful integration of d-Matrix based solutions.
Responsibilities
Develop end-to-end AI infrastructure reference solutions optimized for d-Matrix servers including compute, networking, storage, and orchestration layers, in collaboration with various internal teams.
Create reference blueprints that integrate smoothly into cloud-native and on-prem environments.
Develop infrastructure-as-code templates and examples using Ansible, Terraform, and Helm for provisioning d-Matrix-based nodes and clusters.
Integrate with Kubernetes-based systems to enable model deployment, auto-scaling, and fault-tolerant execution.
Design and deploy telemetry and monitoring frameworks to support real-time visibility into d-Matrix cluster health, job status, and system performance.
Integrate with industry-standard observability stacks (e.g., Prometheus, Grafana, OpenTelemetry) for data collection, visualization, and alerting.
Develop dashboards, health check systems, and metric pipelines that track performance, availability, and operational KPIs
Collaborate with performance and software teams to validate infrastructure using real-world workloads and benchmarks.
Incorporate telemetry hooks for benchmark reporting and feedback-driven tuning.
Create and publish detailed infrastructure deployment guides, monitoring configuration templates, and operational best practices.
Collaborate with customers and OEM/ISV ecosystem, enable them to adopt and customize reference solutions to their specific datacenter environments and/or software stacks.
Requirements
Bachelor's or Master’s degree in Computer Science, or related technical field
10+ years of experience in infrastructure solution architecture, systems management, DevOps, or platform engineering roles.
Experience working with GPUs, custom AI accelerators or heterogeneous compute environments.
Proven expertise in building, managing, and monitoring full-stack AI infrastructure at scale.
Pre Sales Solution Architect at Integrity360 focusing on enhancing clients' cybersecurity maturity and architecting security solutions. Collaborating with Account Management and Product Management teams.
Pre Sales Solution Architect at Integrity360 advising clients on cyber security solutions. Collaborating with technical teams and engaging with C - level customers in Belgium.
Senior Solutions Engineer at TeamViewer displaying software solutions value to enterprise customers. Collaborating with sales and product teams in complex digital workplace environments.
Client Success Manager managing major accounts for Cox Automotive in the Atlanta area. Fostering relationships with dealership contacts to drive revenue and ensure client satisfaction.
Solution Architect responsible for designing architectural solutions in R&D at Hempel. Overseeing digital platforms and ensuring compliance with enterprise standards.
Process Integration Engineer developing and advising on new semiconductor packaging modules at Applied Materials. Working in Albany, NY or Santa Clara, CA with diverse global teams.
Data Transport Systems Integration Engineer modernizing multi - cloud environment for USAF. Managing IBM MQ applications and mentoring junior staff while ensuring security and delivery.
Solution Architect analyzing logistics and production processes for a German family - run company. Focusing on digital transformation and effective software solutions in Hungary.
Manager of Solutions Engineering at Kantata leading pre - sales efforts for strategic clients across EMEA. Responsible for team leadership, mentoring, and enhancing sales performance.
Medior Systems Integration Engineer at Arcadis working on impactful projects. Leading systems integration and supporting multidisciplinary teams in the Netherlands.