Do you have solid experience in computer programming, machine learning and large scale data processing using Spark or Hadoop/ MapReduce and do you have foremost appetite to learn new technologies? This might be your next challenge!
In line with the Elsevier corporate strategy of greater content volume, types and sophistication, the services that Elsevier provides are becoming increasingly dependent on Smart Content. We are looking for a Software Engineer with Machine Learning capabilities for our Core Capabilities team in executing that strategy.
As a Software Engineer Machine Learning you will be working with our business units on developing our content and information offering to end customers. These services may rely on existing text and data mining and classification processes that require improvements, but we also like to challenge you to be creative and envision new processes and procedures based upon your Machine Learning expertise.
Both a publisher and information solutions provider, Elsevier is looking for someone that is able to work on information from internal and external sources and using different (or no) data standards. Our search solutions depend heavily on concept indexing or annotation (for example using ontologies in the medical domain), relationship extraction or extracting data from images and tables.
You will be working in Elsevier Operations within a cross-functional team of IT and product colleagues to pilot and develop new methods of extracting and surfacing information relevant to our customers for new product development. When successful, you will support the implementation of industry-scale high-quality production systems.
This position is based in Amsterdam and you will report directly to the Head of Core Capabilities.
Text and data mining
Bring active experience in to the organization on extraction text and data information from structured and unstructured data
You are well-versed in machine learning and bring new processes into the organization in order to improve (in cost and time-efficiency) the data excerption processes that Elsevier is engaged with
Contribute to the content strategy
Actively contribute to product and operational content strategies by identifying and ingesting new technical capabilities to forward Elsevier mission of leading the way in advancing science, technology and medicine
Using the available base data and actively promoting new ideas of using this data to enhance our competitive offerings
Data analytics to support business and products
Analyze extracted information to drive such processes as automated and manual data cleansing. Data analytics can also be used to identify research trends, or drive decision for our content acquisition strategy
Using visualizations tools to present the extracted data to be ready for consumption will be another key ability
Serve as internal and external specialist on data extraction and Machine Learning matters
Serve as a Machine Learning expert for the Core Capabilities team in the Content and Innovation group
Actively contribute to a culture of product and process innovation
Being a trusted resource in new development projects in Elsevier
Acting as a liaison between IT developers and (content) subject matters experts, translating information needs into software development
Coaching junior members on a need basis
What you should bring
University graduate (Master or PhD level) in Computer Science, Artificial Intelligence, Computational Linguistics or an associated area
3 – 5 years of experience in Machine Learning or a similar role
Solid software engineering skills and experience including coding, testing, troubleshooting and deployment
Experience using key languages like JVM-based languages (Java, Clojure), C++ and Python
Large scale data processing experience using Spark or Hadoop/MapReduce
Solid Experience in machine learning including supervised or unsupervised learning techniques and algorithms (e.g. k-NN, SVM, RVM, Naïve Bayes, Decision trees, etc.)
Familiarity with cloud computing (AWS)
Experience and/or interest in Scala is a plus
Relevant certificates (Spark, Hadoop/Cloudera or CBIP) is a plus
Experience with Git or a similarly distributed revision control system
You think a working proof-of-concept is the best way to make a point
Experience working with a variety of stakeholders at the mid and senior management level and ability to coach junior members
Level of experience (years):
Mid Career (2+ years of experience)
How to apply:
Please mention NLP People as a source when applying
A leading provider of science and health information, Elsevier partners with experts around the globe to develop world-class content, delivering it in ways that fuel discovery, drive innovation and improve health care. Our global community comprises over 7,000 journal editors, 70,000 editorial board members, 300,000 reviewers and 600,000 authors. They are scientists and clinicians; authors and editors, professors and students; information professionals and decision makers.
We are a global company headquartered in Amsterdam, employing more than 7,000 people in 24 countries. Elsevier's roots are in journal and book publishing, where we have fostered the peer-review process for more than 130 years. Today we are driving innovation by delivering authoritative content with cutting-edge technology, allowing our customers to find the answers they need quickly.