Job opening

Data Ingest Engineer

PurpleLab™ is seeking a highly motivated and experienced Data Ingest Engineer to join our dynamic, cross-functional team. Join a passionate team at PurpleLab and play a pivotal role in building the future of healthcare data! As our Data Ingest Engineer, you’ll be responsible for designing, developing, and maintaining complex data pipelines that bring critical data from PurpleLab’s vendor feeds into our infrastructure. Working as part of a cross-functional agile team, you’ll collaborate with stakeholders to ensure data quality, security, and seamless integration with downstream analytics and applications. This is an exciting opportunity to make a real difference in the healthcare industry by leveraging your data engineering skills to improve patient outcomes.

Duties & Responsibilities:

  • Build and maintain ETL processes using cutting-edge tools like Airflow, Spark, and Python. Design Directed Acyclic Graphs (DAGs) to orchestrate data flows and write custom operators for unique needs.
  • Implement robust data validation processes to ensure the accuracy, consistency, and completeness of ingested data. Analyze trends and patterns to proactively identify and address potential issues.
  • Work closely with product stakeholders, analysts, and engineers to understand data requirements, ensure alignment with business goals, and communicate updates effectively.
  • Monitor and fine-tune data ingestion and transformation processes for efficiency and scalability.
  • Actively research and evaluate emerging data technologies, tools, and frameworks to stay ahead of the curve and contribute to innovation.
  • Partner with diverse teams across the organization to foster a collaborative and inclusive work environment.
  • Actively share your expertise and continuously learn new skills to drive professional growth and team success.


  • Bachelor’s Degree in Computer Science, Engineering, or related field
  • 5+ years of experience as a data engineer
  • Proficient in Python, SQL, Airflow, Spark, and DataBricks
  • Experience with cloud-based data technologies like BigQuery and AWS
  • Familiarity with source code version control tools like Git or Bitbucket
  • Strong analytical and problem-solving skills with a keen eye for detail
  • Excellent verbal and written communication skills
  • Ability to thrive in a fast-paced, agile environment
  • Passion for healthcare and making a positive impact
  • Knowledge of medical, pharmacy, provider, and other healthcare data
  • Experience with Atlassian Tool Suite (Jira, Bitbucket, Confluence)

What We Offer:

  • Make a real difference: Contribute to meaningful work that shapes the future of healthcare and improves patient lives.
  • Join a dynamic team: Collaborate with passionate and talented individuals who share your dedication to data-driven solutions.
  • Competitive compensation and benefits: Enjoy a comprehensive benefits package and competitive salary that reflects your expertise.
  • Continuous learning and growth: Access ongoing training opportunities and exposure to cutting-edge technologies to fuel your professional development.

A background check is required for this role. 

The above is intended to describe the general content of and requirements for the performance of this job. It is not to be construed as an exhaustive statement of duties or responsibilities. Nothing in this job description restricts management’s right to assign or reassign duties and responsibilities to this job at any time. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.


PurpleLab™ is a Healthtech company with a mission to spur value-driven innovation in healthcare to improve outcomes for patients. HealthNexus™, the company’s no-code analytics platform empowers life sciences, payers, providers, and other stakeholders with real-world evidence to solve conventional and emerging challenges faster and more cost effectively.

Complete the form below to apply.

Careers (1)

Max. file size: 50 MB.
Max. file size: 50 MB.