Evaluation Scenario Writer - Madrid
hace 3 semanas

Job summary
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies focused on testing evaluating and improving AI systems.
Responsibilities
- Create structured test cases that simulate complex human workflows
- Define gold-standard behavior and scoring logic to evaluate agent actions
- Analyze agent logs failure modes and decision paths
- Work with code repositories and test frameworks to validate your scenarios
Descripción del trabajo
, consectetur adipiscing elit. Nullam tempor vestibulum ex, eget consequat quam pellentesque vel. Etiam congue sed elit nec elementum. Morbi diam metus, rutrum id eleifend ac, porta in lectus. Sed scelerisque a augue et ornare.
Donec lacinia nisi nec odio ultricies imperdiet.
Morbi a dolor dignissim, tristique enim et, semper lacus. Morbi laoreet sollicitudin justo eget eleifend. Donec felis augue, accumsan in dapibus a, mattis sed ligula.
Vestibulum at aliquet erat. Curabitur rhoncus urna vitae quam suscipit
, at pulvinar turpis lacinia. Mauris magna sem, dignissim finibus fermentum ac, placerat at ex. Pellentesque aliquet, lorem pulvinar mollis ornare, orci turpis fermentum urna, non ullamcorper ligula enim a ante. Duis dolor est, consectetur ut sapien lacinia, tempor condimentum purus.
Accede a todos los puestos de alto nivel y consigue el trabajo de tus sueños.
Trabajos similares
We're looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. · ...
hace 1 mes
This opportunity involves creating test cases that simulate complex human workflows and defining gold-standard behavior to evaluate agent actions. · ...
hace 3 semanas
Mindrift connects specialists with project-based AI opportunities for leading tech companies. · ...
hace 1 día
Mindrift connects specialists with project-based AI opportunities for leading tech companies. · ...
hace 1 día
Please submit your CV in English and indicate your level of English proficiency. · Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation isproject-based, not permanent ...
hace 1 día
Please submit your CV in English and indicate your level of English proficiency. · Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. · Create structured test cases that simulate ...
hace 1 semana
We're looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. · At Mindrift, innovation meets opportunit ...
hace 1 mes
Mindrift connects specialists with project-based AI opportunities for leading tech companies. · Mindrift is looking for Evaluation Scenario Writers to work on projects focused on testing and evaluating AI systems. ...
hace 2 semanas
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. · At Mindrift, innovation meets opportunity. · ...
hace 1 mes
MCP & Tools Python Developer - Agent Evaluation Infrastructure
Solo para miembros registrados
We're on the hunt for hands-on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. · You'll implement base methods for agent action verification, integrate with internal and cl ...
hace 1 mes
Join our creative team as a YouTube Script Writer and help bring engaging stories to life for a global audience. · ...
hace 4 días
You will work on the new generation of reactors like the AP300 SMR and eVinci microreactor, · the new fleet of AP1000 and the operating fleet from all over the world. · You will acquire knowledge about writing operating procedures to define · the way control room operators run pl ...
hace 2 semanas
Welcome to the future of nuclear energy, where Westinghouse Electric Company is leading the field with expertise and progress to shape the power of tomorrow. · Innovation is in our DNA. We are creative. We think differently. We reimagine the possible across the nuclear industry e ...
hace 1 mes
+ Job summary: As a SRO Principal Engineer you will work on the new generation of reactors like the AP300 SMR and eVinci microreactor, · the new fleet of AP1000 and the operating fleet from all over the world. · + Qualifications:Reactor Operator (RO) or Senior Reactor Operator (S ...
hace 3 semanas
Welcome to the future of nuclear energy, where Westinghouse Electric Company is leading the field with expertise and progress to shape the power of tomorrow. · ...
hace 1 mes
We are creative. We think differently. We reimagine the possible across the nuclear industry every day. · You will acquire knowledge about writing operating procedures to define the way control room operators run plants in normal, abnormal and emergency conditionsYou will develop ...
hace 3 semanas