Axtria is a global big data analytics company. We empower leaders across the Life Sciences and Financial Services industries to make better data-driven decisions.

Our data analytics and software platforms support data science, commercial operations and cloud information management. We help to optimize business strategy by delivering cutting edge analytics from the broadest set of data sources, combined with deep technical and domain expertise. We enable commercial excellence by eliminating spreadsheets and delivering analytical guidance to the field through Axtria SalesIQ™, our cloud based sales planning and operations platform. We are leaders in managing data using the latest cloud information management and big data technologies.

We have more than 850 employees worldwide, we are growing rapidly, and we are proud to count 8 of the top 10 global Life Sciences companies and 2 of the Top 5 global banks as our customers. We serve clients with a high-touch on-site and onshore presence, leveraged by a global delivery platform.

We are seeking an NLP Data Scientist having experience with Machine Learning and PySpark to join our fast paced, growing organization.




Ability to work in a fast paced environment and under tight deadlines having more than 5 years of experience
Experience with Cloudera HDFS (CDH)
Experience with No SQL Database – HBASE
Good Programming Experience with HIVE, Spark (DDL and Dataframes) , Spark MLib
Programming Language – Python (pyspark), Scala (good to have), HQL, Impala, Spark SQL, Shell scripting
Good to have experience with OOzie scheduling and understands Kerberos, Sentry
Good to have – understanding of Tessaract, OpenCV, Imagemagic , OCR libraries for image processing
Statistical understanding – good to have, Statistical Analysis (GLM, Naïve Bayes), ensembles (GBM, Distribited Random Forest)

Level of experience (years):

Senior (5+ years of experience)

How to apply:

Please mention NLP People as a source when applying


Tagged as: , ,

About Axtria

Axtria is a Big Data Analytics software and services company that enables customers to transform their business with data and analytics.