AI Research Scientist developing advanced technologies related to multimodal models at Mercari's R4D team. Collaborating on machine learning and computer vision projects that impact e-commerce platforms.
Responsibilities
Research and develop new technologies related to multimodal foundation models and generative AI, including computer vision and natural language processing (NLP).
Design and execute large-scale model training and conduct performance evaluations across multiple benchmarks.
Develop prototypes for new algorithms and present research results both internally and externally.
Collaborate with researchers, engineers, and business teams to research and implement advanced technologies for next-generation AI services.
Requirements
Ph.D. in Computer Science or a related field, or equivalent professional experience.
Practical experience in computer vision or multimodal learning within the field of machine learning or generative models.
Record of publications in top-tier conferences or journals such as CVPR, ICCV, ECCV, or NeurIPS.
Development experience using general-purpose programming languages such as Python or C++.
Experience using machine learning frameworks such as PyTorch or TensorFlow.
Familiarity with modern development tools and cloud environments, including Git, Docker, AWS, or GCP.
3+ years of professional experience in software or research and development fields.
Experience training Large Language Models (LLMs) or Large Vision-Language Models.
Implementation experience in supervised learning, self-supervised learning, reinforcement learning, or graph learning, including pre-training and fine-tuning techniques.
Experience in building large-scale datasets.
Experience with Retrieval-Augmented Generation (RAG) and large-scale model deployment.
Experience in designing and operating distributed learning environments or training pipelines.
Business-level Japanese proficiency, with smooth reading, writing, and listening skills.
Advanced AI Scientist at HP responsible for architecture and leadership of AI ecosystems. Leading projects in data mining, modeling techniques, and automation systems to drive business innovation.
Graduate Machine Learning Researcher at Longshot Systems designing and improving predictive models for sports betting analytics with a focus on innovation and R&D.
AI Research Intern at Toyota Research Institute exploring AI applications in enhancing wellbeing. Collaborate in developing innovative approaches within the Human - Centered AI Division.
Machine Learning Researcher at Astera Institute focusing on data - efficient and general model induction. Collaborating on innovative architectures with a focus on performance and throughput.
AI Research Intern focusing on advancing deep learning techniques in financial products at TD. Collaborating on large - scale datasets and representing the team at ML conferences.
AI Research Engineer designing downstream AI models operationalizing clinical endpoints in breast cancer care. Collaborating with clinical experts to enhance healthcare innovation in medical imaging.
Machine Learning Engineer designing, implementing, and deploying ML solutions for GEICO. Collaborating with cross - functional teams to integrate ML models and ensure business impact.
Intern AI Researcher at Analog Devices exploring AI models for efficient edge computing. Collaborating with experts to drive breakthroughs in model optimization and compression.
Senior AI Researcher at Dolby developing audio and video technologies with a focus on deep learning. Partnering with experts to innovate in multimedia analysis, processing, and rendering.