Health Sciences Research IT is seeking a Data Engineer to join our team serving the schools of the health sciences. The candidate will work with other team members and research groups to design pipelines for data ingestion/transformation and implement the designs and execute them. Sometimes this will be for ingesting thousands of files and others single large files in various formats. They will work with a large number of different research groups on many projects. The candidate will work with the teams to understand the data query requirements to optimize query performance with indices/views/functions etc. and be able to create analytic files based on specifications provided. Some research teams will require data warehouse development in creating models, data domains, and cubes and optimize for OLAP. The candidate must understand data warehousing concepts. The candidate will be responsible for DBA functions as well such as permissions, optimization, query optimization, performance monitoring, etc. Our team has a great mix of junior and experienced people and this position will mentor some of our more junior members.
The ideal candidate will have five or more years of experience working in a MS SQL Server environment in both writing data ingestion pipelines/jobs as well as database administration and be a strong mentor. The candidate must have strong communication and documentation skills. Data experience in Azure, AWS, Snowflake, and NoSQL as well as experience with EHR and claims data is preferred but not required.
Evaluates business requirements, creates advanced data ingestion processes and modeling, and provides extensive support for databases and relevant services. Designs new data architectures. Ensures data quality and delivery. Trains and assists lower-level data engineers; often serves as team lead. Supports data analysts and scientists with expert-level research and consulting.
Evaluates business requirements, creates advanced data ingestion processes and modeling, and provides extensive support for databases and relevant services. Designs new data architectures. Ensures data quality and delivery. Trains and assists lower-level data engineers; often serves as team lead. Supports data analysts and scientists with expert-level research and consulting.
Design pipelines
Data ingestion/transformation
Work independently on multiple projects
Work with groups to understand and document project requirements
Data warehouse development: models, data domains, and cubes and optimize for OLAP
All DBA functions
Query and database optimization
At least 5 years of experience with SQL server
Mentor Junior members