Irving, Texas
5 days ago
Lead Data Scientist

Career Area:

Business Technologies, Digital and Data

Job Description:

Your Work Shapes the World at Caterpillar Inc.

When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other.  We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.

Lead Data Scientist

The Lead Data Scientist will be a technical expert, working in a team environment, to support the development, integration & enhancement for Caterpillar in Advanced Analytics and GenAI initiatives for the Technology & Analytics team. Significant responsibilities of this position are collaborating with people; enhancing the team’s creativity; maintaining knowledge of approaches used in similar projects; and pushing the technical bounds of experimentation while meeting customer commitments.

The Lead Data Scientist acts as a technical leader for establishing and maintaining a sound analytic approach for solving the problem at hand. This individual will develop good networks within the technical community to enable them to collaborate on technical solutions, obtain resources and cooperation needed, and remove roadblocks so that they can ensure the success of their team.

What You Will Do:

Conduct Data discovery, data preparation and data processing for business intelligence and ML / AI models.

Exploring, promoting, and implementing semantic data capabilities through data analytics and machine learning techniques.

Leading to define requirements and scope of data analyses, presenting and reporting business insights to management using data visualization technologies.

Conducting research on data model optimization and algorithms to improve effectiveness and accuracy on data analyses.

Plan technical deliverables and definition of done for each sprint for the Jr. Data Scientists/Engineers.

Communicate and Present the Technical Deliverables to stakeholders and business in an easy-to-understand way.

Demonstrate a breadth of knowledge in the application statistical methods and/or digital methods to solve business problems.

Develop, validate, train and implement statistical models, and implement digital solutions.

Participate on 3-4 projects/products concurrently.

Demonstrate strong initiative to research and apply new methods to exceed customer expectations.

Have a strong focus on continual learning in the Analytics field.

Possess thorough statistical and/or digital technology knowledge and the ability to solve low to medium complexity problems.

Have very good communication skills, being able to explain conclusions to customers with limited knowledge and experience with quantitative analytical methods.

What You Have:

Business Statistics: Experience with statistical tools, processes, and practices to describe business results in measurable scales; ability to use statistical tools and processes to assist in making business decisions.

Machine Learning: Extensive knowledge of principles, technologies, and algorithms of machine learning; ability to develop, implement and deliver related systems, products, and services.

Programming Languages: Extensive knowledge of basic concepts and capabilities of applying Python programming to solve business challenges; ability to use tools, techniques, and platforms to write and modify programming languages.

Database Management and Consumption:  Extensive knowledge of data management systems; ability to use, support and access facilities for searching, extracting and formatting data for further use.

Data Analysis:  Posses the ability to conduct thorough data analysis and produce data mapping documents per the business requirements; extensive capabilities in SQL, data modeling, and understanding of data processing including ETL / ELT. 

 Requirements Analysis:  Working knowledge of tools, methods, and techniques of requirement analysis; ability to elicit, analyze and record required business functionality and non-functionality requirements to ensure the success of a system or software development project. 

Top Candidates Will Have:

Extensive experience applying Python (NumPy, SciPy, pandas, etc.) programming to solve business challenges.

Extensive experience with advanced data analysis and statistical methods such as regression, hypothesis testing, ANOVA, statistical process control, etc.

Extensive experience in practical applications of Machine Learning techniques such as Clustering, Logistic Regression, Random Forests, SVM or Neural Networks.

Advanced experience in quantifying the costs, benefits, risks, and chances for success before recommending a course of action.

In-depth technical and critical thinking skills and evidence of continuous learning in the analytics field.

Extensive knowledge of techniques and tools that promote effective analysis; ability to determine the root cause of organizational problems and create alternative solutions that resolve these problems.

Working knowledge with cloud technologies (AWS, Azure, Google Cloud, etc.).

Experience with Snowflake and SQL

Experience with GenAI is plus.

Additional Info:

The primary location for this position is Dallas, TX, Nashville, TN or Peoria, IL.

Must be able to work 3 days ONSITE.

SPONSORSHIP IS NOT AVAILABLE.

RELOCATION IS AVAILABLE for the right candidate.

Skills Descriptors:

Business Statistics:

Level Extensive Experience:

Knowledge of the statistical tools, processes, and practices to describe business results in measurable scales; ability to use statistical tools and processes to assist in making business decisions.

Generates and interprets a wide range of statistical data and reports.

Utilizes in-depth knowledge of statistical tools or applications.

Consults and coaches’ others on sampling approaches and associated statistical significance.

Teaches use of standard deviation, correlation, and covariance on several types of data.

Establishes metrics and surveys with reasonable and appropriate randomness and sample size.

Instructs others in statistical concepts and techniques and their application to business.

Analytical Thinking:

Level Extensive Experience:

Knowledge of techniques and tools that promote effective analysis; ability to determine the root cause of organizational problems and create alternative solutions that resolve these problems.

Seeks discrepancies and inconsistencies in available information; explains variances.

Organizes and prioritizes the sequence of steps to be taken to remedy the situation.

Identifies many probable causes for a problem based on prior experience and current research.

Quantifies the costs, benefits, risks, and chances for success before recommending a course of action.

Approaches a complex problem by breaking it down into its component parts.

Chooses among a diverse set of analytical tools according to the nature of the situation.

Machine Learning:

Level Extensive Experience: 

Knowledge of principles, technologies, and algorithms of machine learning; ability to develop, implement and deliver related systems, products, and services.

Monitors the operation and performance of machine learning projects.

Defines and implements the processes and work flows for machine learning projects.

Selects the appropriate modeling for machine learning projects and calibrates, as necessary.

Explains the logic, algorithms, and emerging techniques of machine learning for beginners.

Performs complex tasks and initiatives of machine learning, such as image recognition.

Develops and applies a training framework about machine learning.

Programming Languages:

Level Extensive Experience:

Knowledge of basic concepts and capabilities of programming; ability to use tools, techniques, and platforms to write and modify programming languages.

Assesses the impact of new productivity improvement tools or techniques for creating and writing programming languages.

Conducts walkthroughs and monitors quality of the development activities.

Consults on key issues and challenges of programming languages associated with the IT environment.

Oversees major developmental efforts adhering to the program's application system design.

Supervises the evaluation of multiple programming languages for diverse environments.

Discusses characteristics and advantages of different programming techniques.

Query and Database Access Tools:

Level Extensive Experience:

Knowledge of data management systems; ability to use, support and access facilities for searching, extracting, and formatting data for further use.

Writes, debugs, and implements complex queries involving multiple tables or databases.

Works with aggregate functions, complex joins, groupings, dynamic and embedded SQL's (Structured Query Languages).

Teaches others about query optimization techniques and facilities.

Consults on query optimization, interactive queries, testing and verification.

Evaluates all major database access tools and functions for distributed databases.

Compares and contrasts the benefits and drawbacks of various SQL products.

Requirements Analysis:

Level Working Knowledge:

Knowledge of tools, methods, and techniques of requirement analysis; ability to elicit, analyze and record required business functionality and non-functionality requirements to ensure the success of a system or software development project.

Follows policies, practices, and standards for determining functional and informational requirements.

Confirms deliverables associated with requirements analysis.

Communicates with customers and users to elicit and gather client requirements.

Participates in the preparation of detailed documentation and requirements.

Utilizes specific organizational methods, tools, and techniques for requirements analysis.

What You Will Get:

Our goal at Caterpillar is for you to have a rewarding career. Our teams are critical to the success of our customers who build a better world.

Here you earn more than just a salary because we value your performance. We offer a total rewards package that provides benefits on day one (medical, dental, vision, RX, and 401K) along with the potential of an annual bonus. Additional benefits include paid vacation days and paid holidays.

All qualified individuals - Including minorities, females, veterans, and individuals with disabilities - are encouraged to apply.

About Caterpillar -

Caterpillar Inc. is the world’s leading manufacturer of construction and mining equipment, off-highway diesel and natural gas engines, industrial gas turbines and diesel-electric locomotives. For nearly one hundred years, we’ve been helping customers build a better, more sustainable world and are committed and contributing to a reduced-carbon future. Our innovative products and services, backed by our global dealer network, provide exceptional value that helps customers succeed.

Visa Sponsorship is not available for this position. This employer is not currently hiring foreign national applicants that require or will require sponsorship tied to a specific employer, such as, H, L, TN, F, J, E, O. As a global company, Caterpillar offers many job opportunities outside of the U.S which can be found through our employment website at www.caterpillar.com/careers.

Posting Dates:

November 7, 2024 - November 20, 2024

Any offer of employment is conditioned upon the successful completion of a drug screen.   

EEO/AA Employer.  All qualified individuals - Including minorities, females, veterans and individuals with disabilities - are encouraged to apply.

Not ready to apply? Join our Talent Community.

Confirm your E-mail: Send Email