Open-source Software Engineer at Mistral AI developing and maintaining state-of-the-art AI models and libraries. Collaborating with the open-source community to drive innovation.
Responsibilities
You will be in charge of open-sourcing state-of-the-art models, whilst maintaining and improving Mistral’s publicly available libraries.
Your work is critical in helping turn research breakthroughs into tangible solutions and improve Mistral's open-source ecosystem.
Releasing our models to open-source platforms and libraries, e.g., vLLM, GitHub, Hugging Face.
Create and maintain tooling and services: both internal facing (internal research) and external facing (open-source libraries).
Implement and optimize open-source and internal libraries for performance and accuracy, ensuring production readiness and employing cutting-edge technology and innovative approaches.
Collaborate with the open-source community (PyTorch, vLLM, Hugging Face).
Requirements
Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
Experience contributing to popular open-source libraries such as PyTorch, Tensorflow, JAX, vLLM, Transformers, Llama.cpp, ...
Passion for contributing to the open-source software ecosystem
Expert programming skills in Python, PyTorch, MLOps
Adaptable, proactive, and autonomous
Attention to detail and a drive to go the last mile to build almost perfect tools
Deep understanding of machine learning approaches, especially LLMs and algorithms
Low-ego, collaborative and have a real team player mindset
Experience with training and fine-tuning large language models (e.g., distillation, supervised fine-tuning, policy optimization) - ideal
Experience working with Slurm - ideal
Worked with research teams before - ideal
Experience as a core-maintainer of a popular ML open-source library - ideal
Senior Machine Learning Engineer at Bumble developing scalable AI systems for personalized user interactions. Leading machine learning model development and deployment from exploration to production.
Lead Machine Learning Engineer at Bumble shaping user connections through machine learning. Driving end - to - end AI solutions while mentoring engineers in a hybrid work environment.
Designing and operating cloud - based MLOps capabilities supporting analytical and generative AI models. Collaborating with data science and business teams for high - impact AI solutions.
Machine Learning Engineer analyzing data structures and developing ML models for customer profiling in Azerbaijan. Collaborating on probabilistic modeling and data quality improvement.
Machine Learning Engineer at HackerRank working on integrity systems to improve model quality. Collaborating on strategies for new signals like audio analysis and behavioral anomalies.
Machine Learning Engineer developing integrity systems for assessing model quality at HackerRank. Collaborating on multimodal signal processing and improving model performance.
Architect designing enterprise - grade AI/ML architectures for Quantiphi. Leading AI applications and ML strategy with a focus on scalability, security, and integration.
Software Engineer for ML Infrastructure at Slack, architecting systems to support large scale AI deployment and reliability. Engage in deep systems engineering focusing on ML lifecycle and infrastructure scalability.
Machine Learning Engineer at Winnow developing AI solutions for food waste reduction. Collaborate with cross - functional teams and leverage cutting - edge technologies in food recognition.
Senior Engineer developing AI/ML solutions to enhance patient care at Edwards Lifesciences. Collaborating with cross - functional teams to deliver impactful technologies in healthcare.