Are you interested in building hyper-scale distributed systems?
At the Web Data Platform Team, we build the global web scale Index and the platform that supports it.
Today we crawl and store tens of Billions URLs/day and efficient usage of our resources and crawled content is one of the top priorities for us. We are looking for a Software Engineer II to help us scale our web index beyond what we have done so far and build the Next Gen Unified Schedulers. Our efforts should help achieve efficient resource usage along with better balance with discovering the latest pages on the web, maintaining freshness of documents in the Index, while avoiding inundating the web servers with crawl requests. It is a distributed platform scalable with Machine Learning (ML) models aiding effective resource usage. With the advent of Large Language Models (LLMs), web scale data has become critical for training needs, in addition to serving use cases.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.