- Identification of open/public data sources: Proactively identify and evaluate open and public data sources for the creation of extensive corpora in Spanish and coofficial languages. This includes scouting for datasets that are relevant to the group's research focus on language models, including translation, audio processing, and large language models (LLMs).
- Engagement with data providers: Act as the primary contact point for negotiations and communications with external data providers, including public entities, companies, and other research institutions. Establish and maintain relationships to secure access to valuable data resources.
- Data acquisition strategy design: Develop and implement strategies for the efficient acquisition of external data. This includes outlining procedures for data requests, licensing negotiations, and ensuring compliance with data privacy regulations.
- Data management and governance: Collaborate in data management protocols to ensure the integrity, confidentiality, and availability of data.
- Dissemination and engagement activities: Lead the dissemination of findings and datasets within the scientific community and beyond. This includes publishing data reports, contributing to academic papers, and presenting at conferences. Also, engage with the broader research community to foster collaborations and share best practices in data management.
- Manage corpora and language data according to the requirements specified in the Unit's data managemt.
- Monitor applications of data protection, licensing and security rules.
- Control the quality of collected data and metadata.
- Compliance and ethics oversight: Ensure all data management activities comply with relevant laws, ethical standards, and best practices in data handling. This includes overseeing the ethical review of data sources and uses, as well as managing any data protection implications.
- Education
- Essential Knowledge and Professional Experience
- Competences
-
Data Engineer for Language Technologies
Encontrado en: Jooble ES C2 - hace 3 días
Somma Barcelona, EspañaContext And Mission · The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catal ...
-
Data Manager for Language Technologies
Encontrado en: Jooble ES C2 - hace 3 días
Somma Barcelona, EspañaContext And Mission · The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning. It has been entrusted by the Spanish and the Catal ...
-
ML developer for Language Technologies
Encontrado en: Talent ES C2 - hace 2 días
Barcelona Supercomputing Center Barcelona, España De jornada completaContext And Mission · The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been en ...
-
Deep Learning Engineer for Language Technologies
Encontrado en: Talent ES C2 - hace 1 día
Barcelona Supercomputing Center Barcelona, España De jornada completaContext And Mission · The Language Technologies Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been en ...
-
Java Developer with French/German
Encontrado en: Jobatus ES A2 - hace 5 días
Sollers Consulting Barcelona, España indefinido, jornada completaAbout the role. You will: · • Implement and customise core insurance systems such as Guidewire, Salesforce, TIA or INSIS. · • Develop our own systems as a part of the internal R&D team. · • Design IT solutions and prepare documentation. · • Create unit tests. · • Work directly wi ...
-
French Marketing Consultant
Encontrado en: Indeed ES C2 - hace 4 días
Teleperformance Spain Barcelona, España De jornada completaWould you like to develop your career in Digital Marketing? · If you're passionate about marketing and sales, tech-savvy, creative, outgoing, and willing to roll up your sleeves and get things done in a fast-paced, rapidly changing environment, we may have the perfect job for you ...
-
Informatiker/in, Administrator/in,Systemadministrator/in, Windows-Spezialist/in
Encontrado en: Talent BE 2 C2 - hace 23 horas
NetCologne IT Services GmbH Barcelona, EspañaTicketsysteme, PowerShell, Informatik, MicrosoftWindows, IT-Infrastruktur, Konfiguration, Systemadministration,Cloud Computing, Server-Betriebssysteme, MCSE, ITIL,Digitalisierung, IT-Support, Gruppenrichtlinien, Rollouts, MCSA,Active Directory, IT-Betrieb, Linux, Python, 2nd-Leve ...
-
Advanced Manufacturing Systems
Encontrado en: Talent ES C2 - hace 3 días
Eurecat Barcelona, España Jornada completaDescripción de la oferta · La Unidad de Advanced Manufacturing Systems (AMS) concentra sus esfuerzos en la investigación y el desarrollo de nuevas tecnologías de fabricación flexibles, de alta eficiencia y productividad. Estas tecnologías que integran AMS se pueden clasificar a ...
-
Software Engineer C#
Encontrado en: Talent BE C2 - hace 4 días
OMP Barcelona, EspañaYour challenge · "> Your team · As a .Net Software Engineer, you'll join the Software Development team, as part of the Product Development organization. Product Development consists of specialized teams that focus on a specific domain such as demand and supply planning, user ...
-
Order Management Specialist
Encontrado en: Talent ES A C2 - hace 2 días
Cepheid Barcelona, EspañaAt Cepheid , we are passionate about improving health care through fast, accurate diagnostic testing. Our mission drives us, every moment of every day, as we develop scalable, groundbreaking solutions to solve the world's most complex health challenges. Our associates are involv ...
-
Senior Drupal Backend developer
Encontrado en: Talent DE C2 - hace 5 días
Cocomore Barcelona, España De jornada completaSenior Drupal Backend developer (all genders) · About us: Cocomore is not only a digital agency providing products and communication services for international clients like EssilorLuxottica, GroupeSEB, Nestlé, Procter & Gamble, or Samsung. Cocomore is also an incubator for digita ...
-
Spanish Language Data Annotator
Encontrado en: Buscojobs ES C2 - hace 23 horas
Transperfect Barcelona, EspañaWork Location: Barcelona, Spain Work Schedule: Monday – Friday during regular business hours Engagement Model: Temporary employmentLanguages Needed: Spanish Start Date: 23th of February We are looking for? Spanish Speakers?to join us on a new innovative and interesting project to ...
-
Customer Service Specialist – German Speaker
Encontrado en: Talent ES A C2 - hace 1 día
Danaher Barcelona, España De jornada completaThis position of Customer Service Specialist – German Speaker will be in Barcelona as a hybrid role. At Cepheid, our vision is to be the leading provider of seamlessly connected diagnostic solutions. · In this role, you will have the opportunity to: · Being a close Business Partn ...
-
SMB Account Executive
Encontrado en: JobGet AU C2 - hace 3 días
Semrush Barcelona, EspañaJob Description · Hi there · We are Semrush, a global IT company developing our own product – a platform for digital marketers. New stars are born here, so don't miss your chance. · This is our role · SMB Account Executive (Australia) for those who can find a common language w ...
-
People Partner
Encontrado en: Jooble UK O C2 - hace 3 días
Traveltechessentialist Barcelona, EspañaWe are TravelPerk, a scaling unicorn valued at $1.3billion that has raised over $400m since our creation in 2015. Backed by world-class investors with portfolios including Airbnb, Stripe, Slack, Trello, Gusto, Twitter, Farfetch and Deliveroo, our team is made up of A-players from ...
-
Senior Frontend Engineer
Encontrado en: YadaJobs ES C2 - hace 3 días
Kognia Sports Intelligence Barcelona, EspañaLocation: Preferably in the Barcelona area and open to hybrid working, but fully remote candidates will be considered. · About Kognia: Kognia Sports Intelligence is on a mission to revolutionize football performance analysis through cutting-edge technology and deep insights in ...
-
Technical Product Marketing Manager
Encontrado en: Jooble ES C2 - hace 3 días
Welocalize, Inc Barcelona, EspañaTechnical Product Marketing Manager page is loaded · Technical Product Marketing Manager · Apply locations Barcelona time type Full time posted on Posted Yesterday job requisition id R If you have a Candidate Login already, but have forgotten your password please use the steps ...
-
Senior Machine Learning Engineer Barcelona, Spain
Encontrado en: Jooble UK O C2 - hace 3 días
sennder Italia Barcelona, Españasennder is Europe's leading digital freight forwarder. In a traditional industry, we are moving fast to digitize and automate all road logistics processes. We move trucks with courage and the power of data to unlock endless and sustainable capacity at exceptional quality. Get to ...
-
Dz599 | Translation Process Specialist Intern
Encontrado en: Buscojobs ES C2 - hace 5 días
Roche Barcelona, EspañaRoche fosters diversity, equity and inclusion, representing the communities we serve. When dealing with healthcare on a global scale, diversity is an essential ingredient to success. We believe that inclusion is key to understanding people's varied healthcare needs. Together, we ...
-
British Or American Education Consultant
Encontrado en: Buscojobs ES C2 - hace 4 días
Quorum Selección Barcelona, EspañaBritish or American Education ConsultantBarcelona Firm QUORUM SELECCIÓN recruits for important multinational company, immersed in a plan of expansion and growth, and specialized in the implementation of an innovative educational model for the teaching of English in private educat ...
Data Engineer For Language Technologies - Barcelona, España - Barcelona Supercomputing Center
Descripción
Context And Mission
The Language Technologies (LT) Unit at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning.
It has been entrusted by the Spanish and the Catalan government with the mission to develop essential open-source resources and technologies for Spanish and Catalan.
In connection with this, the LT Unit is currently in charge of two flagship projects at the national and regional levels:the Spanish National Plan for the Advancement of Language Technology, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department.
In addition, the Unit participates in various EU-funded international projects.The Language Technologies Unit at BSC is seeking a Data Manager with experience in language technologies to lead the development of the largest curated Spanish language corpus.
This corpus will be used to train reference foundational LLMs.The successful candidate will work in a highly sophisticated HPC environment, have access to state-of-the-art systems and computational infrastructures, and establish collaborations with experts in different areas at the local and international levels.
Key DutiesStrong understanding of data acquisition strategies, including licensing negotiations and compliance with data privacy regulations.
Knowledge of open/public data sources relevant to language models, translation, audio processing, and large language models (LLMs).
Familiarity with data governance principles, including data integrity, confidentiality, and availability.
Excellent communication and negotiation skills for engaging with external data providers and stakeholders.
Experience in disseminating findings and datasets within the scientific community through reports, academic papers, and conference presentations.
Strong attention to detail and ability to control the quality of collected data and metadata.
Knowledge of compliance requirements and ethical standards in data management.
Excellent understanding of data administration and management functions (governance, transfer, storage, analysis, distribution, exploration, etc.).
Understanding of data privacy laws, ethical considerations in data handling, and best practices in data governance.
Experience in establishing and maintaining partnerships with data providers, research institutions, and other relevant organizations.
Fluent in written and spoken Catala
Willingness to stay abreast of new data sources, technologies, and methodologies in the rapidly evolving field of language technologies.
Strong organizational skills, with the ability to manage multiple tasks simultaneously and meet deadlines.
Ability to work independently and in a team to complete tasks on schedule.
Ability to work under set deadlines.
Conditions
The position will be located at BSC within the Life Sciences Department
We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance, support to the relocation procedures
Duration:
Open-ended contract due to technical and scientific activities linked to the project and budget duration
Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement
Salary:
we offer a competitive salary commensurate with the qualifications and experience of the candidate and according to the cost of living in Barcelona
Starting date:
asap
Applications procedure and process
All applications must be made through BSC website and contain:
A full CV in English including contact details
A Cover Letter with a statement of interest in English, including two contacts for further references - Applications without this document will not be considered
In accordance with the OTM-R principles, a gender-balanced recruitment panel is formed for every vacancy at the beginning of the process.
After reviewing the content of the applications, the panel will start the interviews, with at least one technical and one administrative interview.
A profile questionnaire as well as a technical exercise may be required during the process.
The panel will make a final decision and all candidates who had contacts with them will receive a feedback with details on the acceptance or rejection of their profile.
At BSC we are seeking continuous improvement in our recruitment processes, for any suggestions or feedback/complaints about our Recruitment Processes, please contact
.
#J-18808-Ljbffr