About this role

Our client’s mission is to empower people, build community, and bring the world closer together. Through their apps and services, they are building a different kind of company that connects people worldwide and provides meaningful ways to share what matters most.

We are looking for a Software Engineer, LLM Evaluation, to join their team in the Netherlands remotely.

The successful candidate will work closely with researchers and engineers to evaluate model performance, conduct data-driven analyses, and contribute to research initiatives related to LLM pretraining and evaluation.

Job Profile for Software Engineer, LLM Evaluation
Responsibilities will include, but not be limited to:

Analyse and evaluate large language models and their performance across various tasks and benchmarks
Execute benchmark evaluations and generate performance metrics and insights
Conduct quantitative and qualitative data analysis to support research objectives
Contribute to the design, implementation, and validation of new evaluation methodologies
Support research initiatives related to LLM pretraining and model evaluation
Develop and maintain machine learning and deep learning systems and tooling
Build research workflows and experimental frameworks using Python and PyTorch
Enable rapid experimentation and support the execution of research initiatives
Collaborate with researchers and engineers within a multidisciplinary team environment

Candidate Profile for Software Engineer, LLM Evaluation

Must be fluent in English, both written and spoken
Bachelor's degree in Computer Science, Artificial Intelligence, Machine Learning, or a related technical discipline. Master's degree in Computer Science, Machine Learning, AI, or a related discipline is desirable; a PhD in a relevant field would be considered a strong advantage
3–5 years of experience working with large language models, including pretraining and evaluation
Strong hands-on programming experience in Python
Proven experience building and maintaining ML/DL systems and research infrastructure
Experience developing machine learning and deep learning solutions using PyTorch
Publications in machine learning, natural language processing, artificial intelligence, or related fields are a plus
Experience working with transformer architectures, large language models, and/or multimodal models is an advantage
Ability to write clean, efficient, and production-quality code
Strong interest in model evaluation, experimentation, and data analysis
Demonstrated scientific curiosity and problem-solving skills

What Our Client Offers

25 holidays per annum
Pension plan
Opportunity to work alongside experienced researchers and engineers on cutting-edge AI initiatives
Exposure to state-of-the-art methodologies and technologies within the LLM ecosystem
Collaborative and research-driven environment in a technologically advanced office

Apply for this position

Want more jobs like this?Get IT & technology jobs in Amstelveen delivered straight to your inbox.

Sign me up to the IamExpat Newsletter

By signing up, you agree that we may process your information in accordance with our privacy policy.

More jobs from this employer