This role will be focused on building SaaS products, which help healthcare and life science companies make more informed scientific decisions through access to critical data. Here’s what we’re looking for:
Our team is building a suite of machine learning tools to help solve problems in the healthcare and life science space. This includes the classification of researchers and physicians to their scholarly research, simulating how effective drug compounds will be, and much more.
We're growing fast in a field that is also growing fast, so we're looking for people who want to grow fast too. We think an environment that is supportive, collaborative, and sophisticated is the key to making this happen.
Experience and Skills
Our data engineers do these kinds of things:
Pull, clean, augment and master data coming from a variety of public and private sources. This goes way beyond simple ETL - we’re preparing data sets that will be used to power machine learning algorithms, not just pulling data out of a database to display in a graph.
Above all, the data engineering we do at H1 is code-centric, not tool-centric. ETL tools like Talend, Informatica, and the like are all great, but our use case is much more about data analysis than it is about moving and joining data together. For this reason we’re looking for candidates who are comfortable in the Scala/Spark framework.