Freelance Agent Evaluation Engineer - Barcelona - Mindrift

    Mindrift
    Mindrift Barcelona

    hace 1 semana

    temporary
    Descripción
    Overview


    Por favor, lea detenidamente la siguiente descripción del puesto para asegurarse de que encaja con el perfil antes de enviar su solicitud.

    Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

    What This Opportunity Involves

    While each project involves unique tasks, contributors may:
    Create structured test cases that simulate complex human workflows
    Define gold-standard behavior and scoring logic to evaluate agent actions
    Analyze agent logs, failure modes, and decision paths
    Work with code repositories and test frameworks to validate your scenarios
    Iterate on prompts, instructions, and test cases to improve clarity and difficulty
    Ensure that scenarios are production-ready, easy to run, and reusable

    What We Look For

    This opportunity is a good fit for software engineers, open to part-time, non-permanent projects


    Ideally, contributors will have:
    3+ of software development experience with strong Python focus
    Experience with Git and code repositories
    Comfortable with structured formats like JSON/YAML for scenario description
    Understanding core LLM limitations (hallucinations, bias, context limits) and how these affect evaluation design
    Familiarity with Docker
    English proficiency - B2

    How It Works

    Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

    Project time expectations

    Tasks for this project are estimated to take 6-10 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

    Payment

    Paid contributions, with rates up to $30/hour*Fixed project rate or individual rates, depending on the project
    Some projects include incentive payments

    Note:
    Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. xugodme Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project

    #J-18808-Ljbffr

  • Solo para miembros registrados Barcelona, Catalonia

    We are seeking an exceptional AI Evaluation Engineer to design, implement, and scale frameworks for assessing the performance, reliability, and trustworthiness of advanced AI systems. · This individual will be responsible for developing methodologies and tools to measure model qu ...

  • Solo para miembros registrados Barcelona

    The company is seeking an exceptional AI evaluation engineer to design and implement scalable frameworks for assessing the performance of advanced AI systems. · ...

  • Solo para miembros registrados Barcelona

    We are seeking an exceptional AI Evaluation Engineer to design, implement and scale frameworks for assessing the performance, reliability and trustworthiness of advanced AI systems. · Create scalable reproducible evaluation pipelines for large-scale AI systems including LLMs and ...

  • Solo para miembros registrados Barcelona A tiempo parcial

    Mindrift connects specialists with project-based AI opportunities for leading tech companies. · ...

  • Solo para miembros registrados Barcelona, Catalonia

    We are particularly interested in candidates with a strong background in computer architecture who can contribute to our research efforts focused on emulation and evaluation of novel architectures for AI. · Formulate and evaluate new ideas in a rigorous way · Write technical repo ...

  • Solo para miembros registrados Barcelona, Barcelona provincia

    · We are particularly interested in the strengths and lived experiences of women and underrepresented groups to help us avoid perpetuating biases and oversights in science and IT research.In instances of equal merit, the incorporation of the under-represented sex will be favoure ...

  • Solo para miembros registrados Barcelona

    We are inviting applications for a Research Engineer to join our Computer Sciences Department at BSC-CNS.The successful candidate will contribute to research activities focused on novel architectures for AI. · ...

  • Solo para miembros registrados Barcelona

    We are looking for a Machine Learning engineer with experience in Language Technologies specifically in Deep Learning and large language model building. · Degree in Computer Science Telecommunications Applied linguistics or related disciplines. · Demonstrated experience of at lea ...

  • Solo para miembros registrados Barcelona, Catalonia

    Join Keysight's central AI Hub in the heart of Barcelona. Software and AI Labs (SAL) drives innovation across Keysight's global software engineering organization. · ...

  • Solo para miembros registrados Barcelona, Catalonia

    Want to build AI systems that power autonomous robots inspecting complex industrial environments? · A fast-growing robotics company is developing autonomous inspection robots used across energy, utilities, and industrial facilities. · ...

  • Solo para miembros registrados Greater Barcelona Metropolitan Area

    This is a full-time on-site role located in the Greater Barcelona Metropolitan Area for an Automotive Driveability Calibration Engineer. · Calibrate powertrain driveability functions (engine & transmission). · Perform and support subjective driveability evaluations. · ...

  • AI Engineer

    hace 3 semanas

    Solo para miembros registrados Barcelona, Catalonia

    About Biorce: pioneering Healthtech company dedicated to revolutionizing drug development through the power of AI.We envision a world where all patients benefit from accelerated and cost-effective access to treatments. · ...

  • Solo para miembros registrados Barcelona A tiempo parcial

    Mindrift connects specialists with project-based AI opportunities for leading tech companies. · Design automotive engineering problems grounded in real practiceEvaluate AI-generated solutions for correctness and engineering logicValidate analytical or numerical results using Pyth ...

  • Solo para miembros registrados Barcelona, Catalonia

    The IFAE announces the opening of a position for the LST Project to be located at La Palma. · ...

  • Solo para miembros registrados Barcelona A tiempo parcial

    Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing and improving AI systems. · Design energy engineering problems reflecting professional practice; · Evaluate AI solutions for correctness and constraints; · ...

  • AI Engineer

    hace 4 semanas

    Solo para miembros registrados Barcelona, Catalonia

    AI Engineer to design, build and operate the core conversational and cognitive systems of our AI companion platform. · ...

  • Solo para miembros registrados Barcelona A tiempo parcial

    We are looking for mechanical engineers with experience in Python to participate in project-based AI opportunities. · Design graduate- and industry-level mechanical engineering problems grounded in real practice. · Evaluate AI-generated solutions for correctness, assumptions, and ...

  • Solo para miembros registrados Barcelona, Catalonia

    We are looking for a Mechanical Engineer to participate in the design of instrumentation at IFAE interfacing with other groups inside the technical division and communicating with IFAE scientists to understand needs and perform mechanical design · Design mechanical pieces and Sys ...

  • Solo para miembros registrados Barcelona, Catalonia

    Join GE Vernova's Wind Turbine Operability team and contribute to the advancement of sustainable energy solutions through cutting-edge control systems engineering. · ...

  • Solo para miembros registrados Barcelona, Catalonia

    We seek highly motivated professionals to help us bring these innovations to life, driving the evolution from development to commercial product. · Build and maintain AI infrastructure: model serving, vector databases, embedding pipelines · ...

  • AI Internship

    hace 2 semanas

    Solo para miembros registrados Barcelona, Catalonia PrácticasSHIP

    We are passionate about accelerating medical advancements and improving patient outcomes through the power of AI. · You'll work alongside senior engineers to build and experiment with next-generation AI systems that go beyond basic chatbots. · Assist in building multi-step AI wor ...

Empleos
>
Barcelona