10 days old

Data Engineer

HealthStream, Inc
Nashville, TN 37201
  • Job Code

The Knowledge Management (KM) Capability sits at the intersection of enterprise platform service design, semantic data engineering, and taxonomy governance for the organization. The team operates in a multi-faceted role, which includes performing complex data analysis to improve knowledge representation and consumption of unstructured data in systems, develop data transformation pipelines that join multiple data sets in the HealthStream knowledge graph, and collaborating across the organization to help define the customer experience for knowledge discovery and access on the HealthStream platform. Members of the KM capability own the end-to-end development of knowledge graph features for HealthStream recommendation engines, data, and content domains, using both industry standards and proprietary technical tools. This is a non-management data engineering position with some data workflow design and programming responsibilities. May serve as a member or development lead of a software development effort.


As a Data Engineer working with the KM Capability team, you will contribute to projects focusing on design, development and support of data pipelines and implement, test, deploy, monitor, and maintain the delivery and analysis of data using a systematic method that will support a variety of platform-based services, including recommendation engines. Primary responsibilities will include:

  • Oversight of HealthStreams data extraction, translation, and integration requirements for knowledge management and semantic data services
  • Support current data orchestration processes, including data ingestion, translation, and storage, that support semantic structures, such as ontologies and knowledge graphs
  • Enhancing current and establishing new ETL processing rules and data workflows to support and grow current infrastructure
  • Design new and maintain current data services that use statistical and data analytical processes to calculate recommendations
  • Conduct data modeling, schema design, and development for varying data structures including SQL and RDF triple stores
  • Ingest and aggregate data from both internal and external data sources to build out world class datasets and knowledge graphs
  • Develop and lead the testing and fixing of new or enhanced solutions for deploying taxonomy models and enabling integration with consuming applications
  • Work with the KM capability team and stakeholders to define data collection and engineering frameworks

Job Specifications


  • Bachelor's degree in Computer Science, Information Science, Data Science or related field and 3 years relevant experience or master's degree and 2 years relevant experience.
  • At least one year of experience with developing data processes using AWS a plus.

  • Strong coding skills in scripting languages such as Python or SPARQL leveraged for large scale data manipulation and data analytics tasks highly desired
  • Experience working with noSQL technologies including graph databases, SPARQL/XQuery, and other related technologies
  • Knowledge and experience working with ontological data structures like RDF, OWL, TURTLE, highly desired
  • Background in data analysis or data science highly desired
  • Experience with multiple data structures and tools such as relational databases, graph databases, document stores, search indexes, etc. and multiple data formats such as CSV, XML, and JSON.
  • Expertise in text analytics and data analysis specifically related to unstructured data
  • ReST web service design
  • Authentication with oauth2 and/or OIDC
  • Ability to work with AWS services, including Lambda, API Gateway, SNS, SQS, S3, DynamoDB, Cognito, CloudWatch a plus
  • Knowledge and experience with ETL tools like Unified Views, Apache Airflow, and AWS Glue a plus
  • Desired development practices include: SOLID Development Principles, code refactoring, object-oriented design patterns, unit testing, software security (i.e. mitigating OWASP Top Ten risks), and experience working with Terraform or Docker a plus

  • Meticulous attention to detail
  • Process compliance
  • Ingenuity
  • Tolerance for change
  • Passion for learning
  • A self-starter with the ability to work independently or collaboratively as a member of a diverse team
  • Strong communication skills with the ability to express complex concepts to stakeholders with varying backgrounds

HealthStream is an equal opportunity employer. HealthStream prohibits employment practices that discriminate against individual employees or groups of employees on the basis of age, color, disability, national origin, race, religion, sex, sexual orientation, pregnancy, veteran or military status, genetic information or any other category deemed protected by state and/or federal law.

Posted: 2021-09-12 Expires: 2021-10-11
Sponsored by:
ADP Logo
Sponsored by:
Bank of America Logo

Featured Jobs[ View All ]

Featured Employers

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Data Engineer

HealthStream, Inc
Nashville, TN 37201

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast