The NLP expert is responsible for the design and development of state-of-the-art natural language processing capabilities in the web application framework of Uni3T. He or she will design new ways to process, model, and analyze vast quantities of unstructured data coming from the life science domain, and based on the needs of the company. The NLP expert possesses advanced analytical skills, in addition to exceptional oral and written communication abilities. The NLP expert processes research information for easier consumption and transforms it into actionable plans. He or she will also provide value to the businesses through his or her findings and thoughtful insights.
As our NLP expert at Uni3T, you will work collaboratively in a cross-functional team comprising of AI/data scientists, developers, and architecture team members. A successful candidate will combine an expertise in artificial intelligence (focused on information extraction and retrieval, knowledge representation, and machine learning techniques), subject matter knowledge, and an interest in doing practical research regarding life science, biotechnologies, and pharmaceutics.
Design and build new NLP pipelines for modeling, data mining, and production purposes.
Design, develop, and deploy algorithms based on ML and NLP best practices in order to tackle the hard problems of Uni3T’s software platform where structured and unstructured data is involved.
Perform and interpret ML and NLP studies and product experiments concerning new data sources or new uses for existing data sources.
Develop prototypes, proof of concepts, algorithms, predictive models, and custom analyses.
Search for patterns in structured and unstructured data that can provide solutions to business problems or create new business opportunities.
Use advanced computational techniques to analyze vast pools of data under the limited guidance of the lead/principal data scientist.
Communicate clearly and effectively with both clients and multi-disciplinary teams.
Strengthening the artificial intelligence team’s capabilities through machine learning mastery.
Masters degree (MSc) in Computational Linguistics, Cognitive Science, Computer Science, or related field.
Experience using Python (pandas, scikit-learn, mlpy, NLTK, gensim, spaCy, TextBlob, Orange, etc.), R (caret, tm, quanteda, NLP, openNLP, lsa, etc.), or other scripting languages (such as Java: Mallet, Apache OpenNLP, Stanford Topic Modeling Toolbox, etc.) for data preparation, analysis, and machine learning (classification, regression, clustering, dimensionality reduction).
Experience working on one or more projects comprising one or more components such as big data, text mining, and statistical machine learning.
Experience with text classification, representation learning, and language modeling techniques, such as the Word2Vec, GloVe, and FastText algorithms.
Knowledge of the algorithms and techniques of a computational domain with emphasis on text processing, performance, and scalability.
Working knowledge of database design and interaction (such as SQL/relational databases).
Ability to cope under high demand, handle multiple priorities, projects and tasks, and meet tight deadlines.
Strong oral and written communication skills. Must be able to interact cross-functionally and drive both business and technical discussions.
Doctoral degree (PhD) in Computer Science, Mathematics, Engineering, or related field.
Knowledge and experience of deep learning techniques in artificial intelligence, with libraries such as TensorFlow, PyTorch, Caffe2, Keras, or Theano.
Experience with cloud computing infrastructure (AWS, MS Azure, etc.)
Experience applying NLP knowledge and techniques to data related to bioinformatics and medical informatics. Experience in biomedical text mining (BioNLP) is a highly valuable asset.
A minimum of 2-3 years of experience in the private sector. Experience in the specific industry of life science, biotechnologies, and/or pharmaceutics is a big plus.
Level of experience (years):
Mid Career (2+ years of experience)
How to apply:
Please mention NLP People as a source when applying