Senior Big Data Engineer
US - United States of America
Yahoo
Yahoo is a global media and tech company connecting people to their passions. We reach almost a billion people worldwide, bringing them closer to what they love.A Little About Us
The Yahoo Mail engineering team develops solutions powering our mail brands, including a next-generation infrastructure that we are 100% moving to a native public cloud architecture. The Mail Intelligence AI/ML platform is responsible for building intelligent, smart capabilities at scale to discover interests, reveal habits, and deeply personalize user journeys for Yahoo Mail and across the entire Yahoo’s ecosystem.
We are looking for innovative, entrepreneurial, and passionate engineers. We are engineers who strive to deliver to our users only the absolute best and are willing to meticulously refine the details to achieve this goal. While Engineering is a core puzzle piece, we believe that your passion and owner mindset is as crucial as the high engineering standards, code quality and world-class architectural skills that we expect from our engineering teams.
We process billions of mail messages using cutting edge algorithms in areas including but are not limited to: Natural language processing, GenAI, Large Language Models, Machine Learning techniques, big data processing in order of petabytes to: Extract information, build mail content and user knowledge, and interconnect different sources to identify, highlight and amplify what matters.
Our work spans many technical challenges highly rewarding and fulfilling to high-caliber engineers hungry for impactful problem statements.
You will build tools and workflows to make it easier to manage and act on this vast information. You will also be working on AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features.
Our Hadoop clusters are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale data processing, machine learning and modeling, as well as satisfying complex business rules.
If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!
Your Day:
You will research and develop innovative algorithms for information retrieval, processing and ranking.
Take end to end ownership of Machine Learning-based distributed data systems - especially focused on data pipelines for data collection, validation and active learning and batch inference.
Work with other engineers to implement algorithms and systems in an efficient way
Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions
Lead data investigations to troubleshoot data issues that arise along the data pipelines
Maintenance and improvement of released systems
Engineering consulting on large and complex warehouse data
Qualifications:
BS with 7+ years of relevant Industry experience/M.S. in Computer Science with 5+ years of relevant Industry experience. Computer Science graduate ideally with specialization in Data Engineering or Machine Learning
Experience in Hadoop technologies (Map/Reduce, Oozie, Pig, Hive, Spark, Kafka, HBase, Storm,).
Strong fundamentals: algorithms, distributed computing, data structure, database
Fluency with at least one of:Java/Python/C++
Self-driven, challenge-loving, detail oriented, teamwork spirit, excellent communication skills, ability to multitask and manage expectations
Nice to have:
Experience in any of: machine learning, analytics, data mining, or data mart and warehouse
Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell
Experience with machine learning algorithms, NLP, and/or statistical methods a big plus
Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call 408-336-1409. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.
At Yahoo, we know that diversity makes us stronger. We are committed to a collaborative, inclusive environment that encourages authenticity and fosters a sense of belonging. We strive for everyone to feel valued, connected, and empowered to reach their potential and contribute their best. Check out our diversity and inclusion (www.yahooinc.com/diversity/) page to learn more.
The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience. The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions, in addition to equity incentives. Yahoo provides industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation, sick time, parental leave and an employee assistance program. Eligibility requirements apply.Yahoo has a high degree of flexibility around employee location and hybrid working. In fact, our flexible-hybrid approach to work is one of the things our employees rave about. Most roles don’t require specific regular patterns of in-person office attendance. If you join Yahoo, you may be asked to attend (or travel to attend) on-site work sessions, team-building, or other in-person events. When these occur, you’ll be given notice to make arrangements.
If you’re curious about how this factors into this role, please discuss with the recruiter.
Currently work for Yahoo? Please apply on our internal career site.
Tags: Architecture Big Data Computer Science Consulting Data Mining Data pipelines Deep Learning Engineering Generative AI Hadoop HBase Java Kafka Keras LLMs Machine Learning NLP Oozie Pipelines Python Research Spark SQL Statistics TensorFlow
Perks/benefits: 401(k) matching Career development Equity / stock options Flex hours Flex vacation Parental leave Salary bonus Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Science Intern jobs
- Open Junior Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Data Science Manager jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Marketing Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Research Scientist jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open MLOps Engineer jobs
- Open Data Analyst Intern jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Product Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Business Intelligence-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open NLP-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open Hadoop-related jobs
- Open CI/CD-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Data warehouse-related jobs