Data Engineer II

University of Wisconsin

Madison, WI

ID: 7252640
Posted: 3 months ago
Application Deadline: Open Until Filled

Job Description

Job Summary:
The Office of Informatics and Information Technology is looking for an outstanding candidate for a Data Engineer II position to contribute to the innovative informatics efforts of the School of Medicine and Public Health's Informatics group. Reporting to the Director of Research Data Services, the Data Engineer II will be responsible for supporting the clinical informatics needs of the Department of Anesthesiology. As part of the informatics team, the role of this data engineer will also be responsible for data quality assurance and quality control. Additionally, the incumbent will work to develop queries and data pipelines that translate research objectives into datasets. The successful candidate will have a solid work ethic and bring an enthusiastic and professional attitude to the workplace.

Responsibilities:
Contributes to a research agenda set by a lead researcher by creating automated processes for preparing and analyzing data at scale. Plays a leadership role and may lead a team and/or personnel.
30% Prepares data sets for current and future analysis including cleaning/quality assurance, transformations, restructuring, and integration of multiple data sources and may use technologies that support data at scale
15% Implements data analysis steps in collaboration with data scientists, statisticians, and/or other researchers and may use technologies that support data at scale
15% Organizes both data preparation and analysis steps into reproducible pipelines that can process similar data sets automatically
5% Selects appropriate technologies and optimizes pipelines for performance
20% Develops, constructs, tests, and maintains architectures for large-scale data management and analysis
15% Serves as an institutional subject matter expert and liaison to key internal and external stakeholders regarding automated data management and analysis at scale for research and represents the interests of large-scale data management and analysis for research