The Data Platforms team is responsible for collecting, organizing, and analyzing all the data that is collected as the result of users interacting with our system with the goal of building machine learning and data science products to improve user experience and merchant operations as well as monitoring the well being of our systems overall.
Bolt is looking for data engineers and backend software engineers for the data platforms team. You will be working closely with all teams and cross-functional partners (including product, engineering, and data analytics) to build and improve our foundational data stack powering business analytics, machine learning, recommendations, etc. You will be responsible for creating a strategy for data ownership at Bolt and help define the data architecture, data model, and pipelines to drive understanding of Bolt’s business.
We are looking for someone who is excited when facing big challenges, thrives when given autonomy to figure out solutions and loves diving deep into complex systems.
ResponsibilitiesIn addition to general problem-solving you will:
Participate in creating and executing a roadmap for all things data infrastructure. Build scalable systems that effectively store and process tons of data. Work with cross-functional partners (product, infra, security, analysts) to power data-driven products. Create fault-tolerant, timely, and optimized pipelines for data ingestion powering the company’s business analytics. Evangelize best practices with the engineering, infra, analyst teams for building data models, pipelines, and materialized views. Standardize access to data across teams, and build tooling to reuse queries. Partner with our AI/ML team to make it easier to do feature engineering and build a reliable data stack to power Bolt’s AI/ML products Own data reliability and help assess and determine which warehousing technologies to use, when to move data between data stores, and help productionize models.Requirements 7+ years of experience with data infrastructure / distributed systems Strong knowledge of Python or Java programming languages Strong SQL background Experience with data technologies such as BigQuery, Snowflake, Spark, DBT, Dataflow, Apache Beam, Ray, Pubsub, Cloud Functions, EMR, S3, Glue, Kinesis Firehose, Lambda, etc. Experience with Docker and Kubernetes Comfortable thinking about infrastructure as code Experience with Terraform or similar tools Nice to haves Familiarity with BI tools Eg: Metabase, Tableau, Quicksight, etc. Familiarity with eCommerce platforms Eg: BigCommerce, Salesforce Commerce Cloud, Shopify, etc. Experience working with ML & generative AI data pipelines
We are at the intersection of e-commerce and payments. We collect a lot of data that is key to decision-making at Bolt.
Tech Stack GCP data pipelines and warehousing (PubSub, DataFlow, BigQuery) along with some AWS data pipelines DBT, DataFlow, PySpark and BigQuery for data processing Terraform for maintaining infra Jenkins & CircleCI for build pipelines Postgres RDS is our application database Our code base is primarily in Golang, Python & Typescript