Estágio em Ciência de Dados apoiando equipe na criação de pipelines de análise e modelos de linguagem. Envolvimento com inteligência artificial e colaboração em projetos de inovação.
Responsibilities
Support descriptive and predictive analyses in the context of large language models (LLMs);
Participate in building and evaluating pipelines involving fine-tuning, instruction tuning, and LLM integration;
Support the development and testing of intelligent agents, including the use of external tools and knowledge bases;
Assist in defining performance metrics and interpreting model results;
Contribute to methodologies, assessment of model limitations, and best practices for model use.
Requirements
Currently enrolled in a degree program in Computer Engineering, Computer Science, or a related field;
Proficiency in Python;
Familiarity with machine learning and NLP libraries and frameworks such as Hugging Face Transformers, LangChain, OpenAI API, TensorFlow and/or PyTorch, and basic knowledge of fine-tuning and RAG (Retrieval-Augmented Generation) techniques;
Basic knowledge of Git and SQL, and familiarity with shell scripting and Linux environments;
Advanced English (for reading documentation and technical articles).
Software Development role at GDIT requiring a TS/SCI clearance with Polygraph. Engaging in government projects for software innovation and development in Virginia.
Coordenador de Ciência de Dados leading the data team in developing analytical solutions for Caixa Vida e Previdência. Focused on data science and project coordination in a collaborative environment.
Lead and deliver AI initiatives focusing on traditional machine learning and generative AI for Mashreq Bank. Build, scale, and productionize models for business transformation and operational efficiency.
Data Scientist designing and implementing analytical solutions using Python and AI technologies. Collaborating cross - functionally to deliver Generative AI solutions for business needs in a hybrid setup.
Data Scientist developing machine learning models and analytics solutions to improve decisions in the mortgage lifecycle from acquisition to servicing. Collaborating with teams on predictive modeling and automation workflows.
Data Scientist developing predictive models and automation workflows for the mortgage lifecycle. Collaborating with cross - functional teams to enhance operational efficiency and customer outcomes.
Senior Manager in Data Science and Analytics leading statistical modeling for mortgage and consumer lending. Collaborating with teams to deliver data - driven insights and improve operational efficiency.
Internship for AI in document processing at ArianeGroup, focusing on natural language processing and data analysis tasks in a collaborative environment.
Senior Data Scientist focused on Generative AI and LLM at Manulife. Develop and implement machine learning models to solve business problems and mentor peers.