Reporting to the Sr. Director of Data Engineering, the Data Engineer will play a critical role in helping develop and maintain Lumiata’s core data science infrastructure. You will be working on and helping to develop automated pipelines for training and deploying machine learning models and building high performance systems to understand complex patient data.

In the process you will learn in detail how medical information flows from patient all the way through to meaningful insights and the many intricacies involved along the way including but not limited to:

  • How AI and machine learning can transform healthcare.
  • How medical information is stored and communicated between different actors in the healthcare system.
  • What modern, open standards have been developed to better communicate and represent medical data.
  • What specific standards must be respected and how to ensure compliance to handle sensitive healthcare data including HIPAA, SOC2 and HITRUST among others.
  • How to apply TensorFlow, Apache Spark, and other cutting edge tools in a production environment.

We are a diverse, international, creative team; join us in building a better medical system for everyone by tackling looming health problems using large scale machine learning.

Key Responsibilities

  • Research and design of algorithms to solve problems using medical & healthcare data.
  • Building and developing pipelines to
  • Handle large volumes of heterogeneous medical data.
  • Train/update/version/maintain production machine learning systems
  • Expose multilayer machine learning systems to customers while maintaining first class privacy and security standards.
  • Extract and expose external public data sources to enhance medical machine learning models


  • Strong developer skills building python infrastructure and applications
  • Solid working experience building applications in Scala or other similar language
  • Working experience with Hadoop, Apache Spark and distributed databases preferred
  • Development of products using distributed computing & big data products will be a huge plus
  • Experience in high concurrency platforms, graph databases, applied mathematics - a plus.
  • B.S. or MS degree in Computer Science or related fields.


  • Competitive salary and equity
  • Lunch provided daily
  • Open vacation policy
  • 95% company paid health benefits
  • 401K
  • Company-sponsored happy hours, parties, and offsite events

About Us

Lumiata is a venture-backed company based in San Mateo, California inventing predictive analytics for healthcare leveraging the power of Artificial Intelligence (AI). Our flagship Data-as-a-Service (DaaS) product suite, the Lumiata Matrix Suite, is an enterprise-grade, individual-to-population predictive analytics solution powered by AI. It enables the hyper-segmentation of vast amounts of patient data to predict which patient cohorts may be at highest risk for disease progression. Action-ready insights are returned to enterprise healthcare data analysts to enable more effective early-intervention strategies that lead to improved revenues, better patient outcomes and lower overall costs of healthcare delivery.

More Startup Jobs