Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000 people across 30 countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose - the relentless pursuit of a world that works better for people - we serve and transform leading enterprises, including the Fortune Global 500, with our deep business and industry knowledge, digital operations services, and expertise in data, technology, and AI.
Inviting applications for the role of Lead Consultant –COE Data Engineer
Works with Client’s Data Science and Data Engineering team to ensure delivery of data operations solutions, consistent with a long-term data strategy. We are an AWS / Python based department looking for strong self-starters who can have input and vision in where they would want the department to go.
Responsibilities
· Oversees an assigned Data Science data operation, Architecture and Software Design.
· setting up or migrated into and worked in AWS environments
· A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
· Should have very well experienced in writing complex SQL using Joins and window functions .
· Strong skill sets required with Python library like Pandas ,numpy ,wrangler and Pyspark
· Ability to assemble large, complex data sets that meet functional / non-functional business requirements using AWS data services like ( datasync ,glue,athena,lambda,redshift ,aurora )
· Close interaction with both the data scientists and data warehouse team to deploy emerging technologies to all areas of the business.
· Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
· Ensure a high-performing team through the development of strong development processes including the development of best practice
Qualifications we seek in you
Minimum Qualifications
· Bachelor's degree in Computer Science, Computer Engineering, Informatics, MIS, or related field
· technical experience with the above qualifications.
· Experience using cloud-based technology, especially data operation services AWS datasync ,glue,athena,lambda,redshift ,aurora.
· Mastery of ELT principles (such as SCD, parallel processing, partition, etc.)
· Experience handling bigdata (parquet, ORC formats, hive based metadata) and real-time data processing
Preferred Qualiifications
· Experience with medical malpractice, workers compensation, or other low-frequency, high-severity industries is preferred, but not required. Experience with dirty medical data is also preferred
· Exposure of the complete lifecycle (from idea to execution) for machine learning projects in Python or R would be very helpful.
· Docker or similar tools for scripting deployment.
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. Get to know us at www.genpact.com and on X, Facebook, LinkedIn, and YouTube.
Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training.