Unbabel’s “Translation as a Service” platform allows modern enterprises to understand and be understood by their customers in dozens of languages.

Powered by AI and refined by a global community of tens of thousands of human linguists, Unbabel delivers professional-grade content at the scale required by modern enterprises like Facebook, Microsoft, Under Armour, Pinterest and Expedia.

Backed by Scale Venture Partners, Notion, Microsoft Ventures, Salesforce Ventures, Samsung NEXT and Y Combinator, Unbabel is accelerating the shift to a world without language barriers.
We are a diverse team, working every day to build an outstanding organisational culture, based on strong values of transparency, team spirit and continuous learning, with a fast-paced Silicon Valley atmosphere in the beautiful city of Lisbon, Portugal.

To strengthen and augment our Applied AI team and the Data tribe, we are looking for a creative and experienced data acquisition and management specialist (aka “data czar(itza)”) to help design and develop systems and processes for efficient data acquisition, processing and management. We are primarily focusing on the acquisition and management of the training data for our internal AI systems, creation of data derivatives and estimation of data efficiency when integrated with the in-house AI systems. You’ll be working closely with other AI researchers, data engineers, software engineers, product managers, and data analysts in developing truly disruptive products that are changing the way the world communicates. This multidisciplinary position requires a combination of technical, project management and business skills that makes it a great role to grow both technically and managerially.

Replicate and implement latest scientific and engineering works in the field of data crawling, alignment, filtering and storage at Unbabel
Design, plan, implement, test and analyse parallel and comparable data crawler
Manage the process of data acquisition from free and commercial repositories
Assess ROI of commercial data sets and create business justification and documentation for acquisition of commercial data
Design, plan, implement, test and analyse parallel and comparable data crawler
Investigate and implement SOTA data cleaning, filtering and selection methods for Unbabel’s use-cases
Help develop and maintain Unbabel’s data storage and retrieval infrastructure.
Have a lot of fun being part of the Unbabel team




MS degree in Informatics, Mathematics, Computer Science, Machine Learning or Statistics, major is preferred
A solid foundation in Computer Science
1+ years of hands-on experience with text processing techniques and tools
2+ years of experience programming in Python, C++ or Java
1+ experience in business environment in a project manager, product manager or other non-technical role
Fluency in Bash scripting
Knowledge of web crawling libraries and tools, as well as will be a big plus
Another advantage will be practical experience with major parallel corpora (EuroParl, United Nations, etc.)

Language requirements:

Strong verbal and written communication skills in fluent English (C1)

Educational level:

Master Degree

Level of experience (years):

Mid Career (2+ years of experience)

Tagged as: , , ,

You can apply to this job and others using your online resume. Click the link below to submit your online resume and email your application to this employer.

About Unbabel

online translation service