Menu

Senior Big Data Engineer - JR0024290_IL

at Yahoo! Inc. in Champaign, Illinois, United States

Job Description

Yahoo Mail is the ultimate consumer inbox with hundreds of millions of users. It's the best way to access your email and stay organized from a computer, phone or tablet. With its beautiful design and lightning fast speed, Yahoo Mail makes reading, organizing, and sending emails easier than ever.

A Little About Us

The Yahoo Mail engineering team develops solutions powering our mail brands, including a next-generation infrastructure that we are 100% moving to a native public cloud architecture. The Mail Intelligence AI/ML platform is responsible for building intelligent, smart capabilities at scale to discover interests, reveal habits, and deeply personalize user journeys for Yahoo Mail and across the entire Yahoo's ecosystem.

We are looking for innovative, entrepreneurial, and passionate engineers. We are engineers who strive to deliver to our users only the absolute best and are willing to meticulously refine the details to achieve this goal. While Engineering is a core puzzle piece, we believe that your passion and owner mindset is as crucial as the high engineering standards, code quality and world-class architectural skills that we expect from our engineering teams.

We process billions of mail messages using cutting edge algorithms in areas including but are not limited to: Natural language processing, GenAI, Large Language Models, Machine Learning techniques, big data processing in order of petabytes to: Extract information, build mail content and user knowledge, and interconnect different sources to identify, highlight and amplify what matters.

Our work spans many technical challenges highly rewarding and fulfilling to high-caliber engineers hungry for impactful problem statements.

You will build tools and workflows to make it easier to manage and act on this vast information. You will also be working on AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features.

Our Hadoop clusters are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale data processing, machine learning and modeling, as well as satisfying complex business rules.

If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!

Your Day:

  • You will research and develop innovative algorithms for information retrieval, processing and ranking.
  • Take end to end ownership of Machine Learning-based distributed data systems - especially focused on data pipelines for data collection, validation and active learning and batch inference.
  • Work with other engineers to implement algorithms and systems in an efficient way
  • Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions
  • Lead data investigations to troubleshoot data issues that arise along the data pipelines
  • Maintenance and improvement of released systems
  • Engineering consulting on large and complex warehouse data



Qualifications:

  • BS with 7+ years of relevant Industry experience/M.S. in Computer Science with 5+ years of relevant Industry experience. Computer Science graduate ideally with specialization in Data Engineering or Machine Learning
  • Experience in Hadoop technologies (Map/Reduce,...

    Equal Opportunity Employer - minorities/females/veterans/individuals with disabilities/sexual orientation/gender identity

Copy Link

Job Posting: 11984708

Posted On: Jun 17, 2024

Updated On: Jul 17, 2024

Please Wait ...