Data Scientist - SME3
The candidate will build algorithms to meet business requirements and use statistical, mathematical, and predictive modelling skills to build a comprehensive platform for data analysis; use Business Intelligence tools to create analytical models; integrate data sets and work with multiple systems to extract and use data for deeper analysis; have the ability to use the Hadoop environment with Hive and MapReduce skills; be able to build programs in programming languages and scripts; have an understanding of and use skills for Natural Language Processing, Machine Learning, Statistical Analysis, Predictive Modelling, and Hypothesis Testing.
Skills Requirements:
Required Skills:
.Experience with statistical analysis.
.Working knowledge of scripting languages.
.Experience working in a Big Data environment.
Desired Skills:
.Proficiency at transforming data, data classification and translations, as well as resolving data quality and data cleansing
.Intermediate design and use of relational databases including experience working with working knowledge of dimensional modeling, star schemas and working with time-series data
.Experience working with Hadoop (Cloudera, Hortonworks, etc.) and working with MapReduce development, using Pig and Hive
.Experience with NoSQL databases such as Cassandra, Hbase, CouchDB
.Advanced skills/expertise in data mining, text mining or distributed computing
.Experience and proficiency in utilizing statistical/analytic packages such as SAS, R, SPSS, S-Plus, Matlab to develop statistical models
.Proficiency and advanced ability to leverage scripting languages: Python, Perl, Ruby
.Software development skills in Java including working with Mahout, including integrating with search engines/libraries such as Lucene and/or Solr
.Excellent research, analytical, written and oral communications skills.
Education:
B.S. plus 11 years of related experience