Spark

Data Engineer | AWS, Python & Snowflake | Ridgefield, CT (Hybrid) | $140K–$185K

🧠 Data Engineer

📍 Location: Ridgefield, Connecticut (Hybrid – 2–3 days onsite per week)
💼 Openings: 2
🏢 Industry: Information Technology / Life Sciences
🎓 Education: Bachelor’s degree in Computer Science, MIS, or related field (Master’s preferred)
🚫 Visa Sponsorship: Not available
🚚 Relocation: Available for the ideal candidate
💰 Compensation: $140,000 – $185,000 base salary + full benefits
🕓 Employment Type: Full-Time | Permanent

🌟 The Opportunity

Step into the future with a global leader in healthcare innovation — where Data and AI drive transformation and impact millions of lives.

As part of the Enterprise Data, AI & Platforms (EDP) team, you’ll join a high-performing group that’s building scalable, cloud-based data ecosystems and shaping the company’s data-driven future.

This role is ideal for a hands-on Data Engineer who thrives on designing, optimizing, and maintaining robust data pipelines in the cloud, while collaborating closely with architects, scientists, and business stakeholders across the enterprise.

🧭 Key Responsibilities

  • Design, develop, and maintain scalable ETL/ELT data pipelines and integration frameworks to enable advanced analytics and AI use cases.

  • Collaborate with data architects, modelers, and data scientists to evolve the company’s cloud-based data architecture strategy (data lakes, warehouses, streaming analytics).

  • Optimize and manage data storage solutions (e.g., S3, Snowflake, Redshift), ensuring data quality, integrity, and security.

  • Implement data validation, monitoring, and troubleshooting processes to ensure high system reliability.

  • Work cross-functionally with IT and business teams to understand data requirements and translate them into scalable solutions.

  • Document architecture, workflows, and best practices to support transparency and continuous improvement.

  • Stay current with emerging data engineering technologies, tools, and methodologies, contributing to innovation across the organization.

🧠 Core Requirements

Technical Skills

Hands-on experience with AWS data services such as Glue, Lambda, Athena, Step Functions, and Lake Formation.
✅ Strong proficiency in Python and SQL for data manipulation and pipeline development.
✅ Experience in data warehousing and modeling (dimensional modeling, Kimball methodology).
✅ Familiarity with DevOps and CI/CD practices for data solutions.
✅ Experience integrating data between applications, data warehouses, and data lakes.
✅ Understanding of data governance, metadata management, and data quality principles.

Cloud & Platform Experience

  • Expertise in AWS, Azure, or Google Cloud Platform (GCP) – AWS preferred.

  • Knowledge of ETL/ELT tools such as Apache Airflow, dbt, Azure Data Factory, or AWS Glue.

  • Experience with Snowflake, PostgreSQL, MongoDB, or other modern database systems.

Education & Experience

🎓 Bachelor’s degree in Computer Science, MIS, or related field
💼 5–7 years of professional experience in data engineering or data platform development
⭐ AWS Solutions Architect certification is a plus

🚀 Preferred Skills & Attributes

  • Deep knowledge of big data technologies (Spark, Hadoop, Flink) is a strong plus.

  • Proven experience troubleshooting and optimizing complex data pipelines.

  • Strong problem-solving skills and analytical mindset.

  • Excellent communication skills for collaboration across technical and non-technical teams.

  • Passion for continuous learning and data innovation.

💰 Compensation & Benefits

💵 Base Salary: $140,000 – $185,000 (commensurate with experience)
🎯 Bonus: Role-based variable incentive
💎 Benefits Include:

  • Comprehensive health, dental, and vision coverage

  • Paid vacation and holidays

  • 401(k) retirement plan

  • Wellness and family support programs

  • Flexible hybrid work environment

🧩 Candidate Snapshot

  • Experience: 5–7 years in data engineering or related field

  • Key Skills: AWS Glue | Python | SQL | ETL | CI/CD | Snowflake | Data Modeling | Cloud Architecture

  • Seniority Level: Mid–Senior

  • Work Arrangement: 2–3 days onsite in Ridgefield, CT

  • Travel: Occasional

🚀 Ready to power the future of data-driven healthcare?
Join a global data and AI team committed to harnessing the power of cloud and analytics to drive discovery, innovation, and meaningful impact worldwide.

Data Engineer | Azure, Databricks, Python, SQL, Spark | Hybrid – Netherlands (€3,500–€5,000/month)

Data Engineer

📍 Location: Eindhoven area or Randstad, Netherlands (Hybrid – 3 office days / 2 home days)
💼 Employment Type: Full-time
💵 Salary: €3,500 – €5,000 per month (€45,360 – €64,800 annually)
🎯 Experience Level: Mid-level | 2–3 years’ experience

About the Role

Do you love working with data — from digging into sources and writing clean ingestion scripts to ensuring a seamless flow into a data lake? As a Data Engineer, you’ll design and optimize data pipelines that transform raw information into reliable, high-quality datasets for enterprise clients.

You’ll work with state-of-the-art technologies in the cloud (Azure, Databricks, Fabric) to build solutions that deliver business-critical value. In this role, data quality, stability, and monitoring are key — because the pipelines you create will be used in production environments.

Key Responsibilities

  • Develop data connectors and processing solutions using Python, SQL, and Spark.

  • Define validation tests within pipelines to guarantee data integrity.

  • Implement monitoring and alerting systems for early issue detection.

  • Take the lead in troubleshooting incidents to minimize user impact.

  • Collaborate with end users to validate and continuously improve solutions.

  • Work within an agile DevOps team to build, deploy, and optimize pipelines.

Requirements

  • 🎓 Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related field.

  • 2–3 years of relevant experience in data ingestion and processing.

  • Strong knowledge of SQL, Python, and Spark.

  • Familiarity with container environments (e.g., Kubernetes).

  • Experience with Azure Data Factory, Databricks, or Fabric is a strong plus.

  • Experience with data model management and dashboarding (e.g., PowerBI) preferred.

  • Team player with strong communication skills in Dutch and English.

  • Familiarity with enterprise data platforms and data lakes is ideal.

What We Offer

  • 💶 Salary: €3,500 – €5,000 per month

  • 🌴 26 vacation days

  • 🚗 Lease car or mobility budget (€600)

  • 💻 Laptop & mobile phone

  • 💸 €115 monthly cost allowance

  • 🏦 50% employer contribution for health insurance

  • 📈 60% employer contribution for pension scheme

  • 🎯 Performance-based bonus

  • 📚 Training via in-house Academy (hard & soft skills)

  • 🏋️ Free use of on-site gym

  • 🌍 Hybrid work model (3 days in office, 2 days at home)

  • 🤝 Start with a 12-month contract, with option to move to indefinite after evaluation

Ideal Candidate

You are a hands-on data engineer who enjoys data wrangling and building robust pipelines. You take pride in seeing your code run smoothly in production and know how to troubleshoot quickly when issues arise. With strong technical skills in SQL, Python, and Spark, plus familiarity with cloud platforms like Azure, you’re ready to contribute to impactful enterprise projects.

👉 Ready to make data flow seamlessly and create business value? Apply now to join a passionate, innovation-driven team.

 

Senior Data Engineer - USA, Remote - $110,560 to $155,840

Senior Data Engineer

USA, Remote

$110,560 to $155,840

 

Job Description

You are a driven and motivated problem solver ready to pursue meaningful work. You strive to make an impact every day & not only at work, but in your personal life and community too. If that sounds like you, then you've landed in the right place.

The Data Science AI Factory team is committed to exploring new ways to use data and analytics to solve business problems.  The team utilizes a variety of data sources, with a strong focus on unstructured and semi-structured text using NLP to enhance outcomes related to claim, underwriting, operations and the customer experience. 

As a Sr. Data Engineer, you will be an established thought leader through close partnerships with expert resources to design, develop, and implement data assets for a wide range of new initiatives across multiple lines of business. The role involves heavy data exploration, proficiency with SQL and Python, knowledge of service-based deployments and APIs, and the ability to discover and learn quickly through collaboration.  There is a need to think analytically and outside of the box while questioning current processes and continuing to build on the individual’s business acumen.

There will be a combination of team collaboration and independent work efforts.  We seek candidates with strong quantitative background and excellent analytical and problem-solving skills. This position combines business and technical skills involving interaction with business customers, data science partners, internal and external data suppliers and information technology partners.

Responsibilities

  • Identify and validate internal and external data sources for availability and quality. Work with SME’s to describe and understand data lineage and suitability for a use case.

  • Create data assets and build data pipelines that align to modern software development principles for further analytical consumption. Perform data analysis to ensure quality of data assets.

  • Create summary statistics/reports from data warehouses, marts, and operational data stores.

  • Extract data from source systems, and data warehouses, and deliver in a pre-defined format using standard database query and parsing tools.

  • Understand ways to link or compare information already in our systems with new information.

  • Perform preliminary exploratory analysis to evaluate nulls, duplicates and other issues with data sources.

  • Work with data scientists and knowledge engineers to understand the requirements and propose and identify data sources and alternatives.

  • Produce code artifacts and documentation using Github for reproducible results and hand-off to other data science teams.

  • Propose ways to improve and standardize processes to enable new data and capability assessment and to enable pivoting to new projects.

  • Understand data classification and adhere to the information protection and privacy restrictions on data.

  • Collaborate closely with data scientists, business partners, data suppliers, and IT resources.

Experience & Skills

Candidates must have the technical skills to transform, manipulate and store data, the analytical skills to relate the data to the business processes that generates it, and the communication skills to document & disseminate information regarding the availability, quality, and other characteristics of the data to a diverse audience. These varied skills may be demonstrated through the following:

  • Bachelor’s degree or equivalent experience in a related quantitative field

  • 5 + years experience accessing and retrieving data from disparate large data sources, by creating and tuning SQL queries. Understanding of data modeling concepts, data warehousing tools and databases (e.g. Oracle, AWS, Snowflake, Spark/PySpark, ETL, Big Data, and Hive) 

  • Demonstrated ability to create and deliver high quality Python code using software engineering best practices. Experience with object-oriented programming and software development a plus. Proficiency with Github and Linux highly desired.

  • Ability to analyze data sources and provide technical solutions. Strong exploratory and problem-solving skills to check for data quality issues.

  • Determine business recommendations and translate into actionable steps 

  • Self-starter with curiosity and a willingness to become a data expert

  • Demonstrate a passion to both learn new skills and lead discovery of the data research 

  • Results oriented with the ability to multi-task and adjust priorities when necessary 

  • Ability to work both independently and in a team environment with internal customers 

  • Ability to articulate and train technical concepts regarding data to both data scientists and partners

Learn more

Azure Data Engineer - Irving, TX Full-Time, Permanent - $110,000 - $120,000

Azure Data Engineer
Irving, TX
Full-Time, Permanent
$110,000 - $120,000

Required Skills:

                                  

  • Experience in GCP/Azure, Strong Data modelling, Python, Experience with RDBMS, Big Data processing frameworks and tools (Cloudera, Sqoop, Hive, Impala, Spark), DevOps tools and techniques (e.g. continuous integration, Jenkins, Puppet, etc)

                                                        

Preferred Skills:                                     

  • Experience building/migrating data pipeline from on-prem to Cloud (GCP or any cloud)

  • Understanding of cloud technologies

  • Unix Scripting

  • Tableau and Excel tool expertise

                                                     

Job description:                                     

  • Build data pipelines to ingest data from On-prem to cloud

  • Experience with Big Data processing frameworks and tools (Cloudera, Sqoop, Hive, Impala, Spark)

  • Experience with DevOps tools and techniques (e.g continuous integration, Jenkins, Puppet, etc)

  • Experience software development on a team using Agile methodology

  • Build data standardization & transformation logic using framework following Object Oriented Programming concept

  • Write Unit Test scripts

  • Implement standardized error handling & diagnostic logging

  • Schedule and maintain production workflows on-prem as well as cloud

  • Troubleshoot and resolve QA and Production defects

  • Handle code review and code deployment

Learn more