Senior Data Engineer
Auckland, Auckland, New Zealand
Groov
Groov helps your people feel and function better, every day, which leads to improved performance. We combine the best of behavioral science, data science, and user experience to deliver the right experience to the right employee at the...Groov is a company based in Auckland, New Zealand, on a mission to help millions of people feel and perform better at work. We do this by combining the best of psychology and behavioural science, data science, and user experience to deliver the right experience at the right time in the right way to assist employees, managers and leaders.
Groov’s technology lifts engagement and performance by prompting people to take appropriate action to care for themselves and others based on their roles, activities and personalities and is integrated into their flow of work. It also provides real-time insights to help managers understand individual and team engagement, focus and burnout risk, and provide them with recommendations on why, when and how to support individuals and teams.
Groov is looking for an experienced (ideally a Senior) Data Engineer to join our Data Science team. As a data engineer at Groov, you will be responsible for developing, maintaining, and optimizing our data warehouse, data pipeline, and data products. The data engineer will support multiple stakeholders, including data scientists, psychologists and scientists, and software developers, to ensure an optimal data architecture and delivery. The ideal candidate should possess strong technical abilities to solve complex problems with data, a willingness to learn new technologies and tools if necessary, and be comfortable supporting the data needs of multiple teams, stakeholders, and products. The position could be partially or fully remote.
Responsibilities
- Design, build and maintain batch and real-time data pipelines in production.
- Maintain and optimize the data infrastructure required for accurate extraction, transformation, and loading (ETL) of data from a wide variety of data sources.
- Develop ETL processes to extract and manipulate data from multiple sources.
- Automate data workflows such as data ingestion, aggregation, and ETL processing.
- Prepare raw data in Data Warehouses into consumable datasets for both technical and non-technical stakeholders.
- Partner with data scientists and functional leaders in sales, customer service, and product to deploy machine learning models in production.
- Build, maintain, and deploy data products for analytics and data science teams on cloud platforms (especially AWS).
- Ensure data accuracy, integrity, privacy, security, and compliance through quality control procedures to meet ISO and GDPR standards.
- Monitor data systems performance and implement optimization strategies.
- Leverage data controls to maintain data privacy, security, compliance, and quality for allocated areas of ownership.
Experience
- Bachelor's degree in Computer Science, Information Systems, or a related field.
- Five+ years of relevant working experience.
Qualifications
Minimum Qualifications
- Advanced SQL skills and experience with relational databases and database design.
- Experience working with cloud Data Warehouse solutions (e.g., Snowflake, Redshift, BigQuery, Azure, etc.).
- Experience working with data ingestion tools such as Airbyte, Fivetran, Stitch, or Matillion.
- Working knowledge of Cloud-based solutions (e.g. AWS, Azure, GCP).
- Experience building and deploying machine learning models in production.
- Strong proficiency in one or more programming languages such as Python, PySpark, .Net, Node.js, Java, C++, Scala, and also scripting languages like Bash.
- Demonstrated ability in deploying machine learning models in production and ML Ops practices.
- Strong proficiency in data pipeline and workflow management tools (e.g., Airflow, Azkaban).
- Strong project management and organizational skills.
- Excellent problem-solving, communication, and organizational skills.
- Proven ability to work independently and collaboratively in a fast-paced environment.
Additional Desirable Qualifications
- Experience with working on large data sets, distributed computing (e.g. Hive, Hadoop, Spark, Presto, MapReduce) and Machine Learning operations
- Experience with using software such as R or Python with Pandas, NumPy, etc. for machine learning and statistical analysis
- Experience with using Large Language Models such as evaluating, optimizing and fine-tuning them for Retrieval Augmented Generation use.
- Location in Auckland preferred; remote is possible
Why Join Groov?
Join a team transforming the workplace into a space of health, creativity, and peak performance. At Groov, you'll be part of a small, dynamic team that uses cutting-edge technologies and innovation to solve real-world problems. We value innovation, collaboration, and well-being, offering competitive salaries, comprehensive benefits, and a dynamic work environment. Our work culture is supportive, entrepreneurial, and pragmatic, fostering a space where you can grow, innovate, and make a significant impact
You must be eligible to work in New Zealand (with a valid working Visa, Residency or Citizenship) to be successful in this application.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS Azkaban Azure BigQuery Computer Science Data pipelines Data warehouse ETL FiveTran GCP Hadoop Java LLMs Machine Learning Matillion ML models Node.js NumPy Pandas Pipelines Privacy Prompt engineering PySpark Python R RDBMS Redshift Scala Security Snowflake Spark SQL Statistics
Perks/benefits: Career development Health care Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Lead Data Analyst jobs
- Open Power BI Developer jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Marketing Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Analyst Intern jobs
- Open Sr Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Data Engineering Manager jobs
- Open Junior Data Engineer jobs
- Open Big Data Engineer jobs
- Open Research Scientist jobs
- Open Data Quality Analyst jobs
- Open Azure Data Engineer jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Data quality-related jobs
- Open ML models-related jobs
- Open Business Intelligence-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open NLP-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open LLMs-related jobs
- Open APIs-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Databricks-related jobs
- Open Airflow-related jobs