Kelsey Florek, PhD, MPH
Senior Genomics and Data Scientist
Wisconsin State Laboratory of Hygiene
March 24, 2025
"A data management function to ensure the quality, integrity, security, and usability of the data collected by an organization"
"The purpose of data governance is to build trust in data"
Data Inventory and Cataloging: identify and list all data sources, capturing details including data type, location, owner, access list/tags, and usage
Metadata: data that provides information about other data
Data Owners and Stewards: who is responsible for the management of the data, who uses the data, who ensures compliance
Data Provenance: a comprehensive historical record that details the data's origins, modifications, and usage. Can you answer questions like:
Data Audits: conduct review of data to ensure accuracy, consistency, and security
There are a numerous data formats and many incompatibilities
Is genomic data PHI?
"We have shown that it is possible to infer a genotypic barcode specific to an individual on the basis of RNA profiling data from that individual."
"To determine whether a given subject was enrolled in the HCC studies, we would only need to genotype roughly 1,000 SNP markers and apply the methods described herein."
"it was observed that sequencing data may often need reanalysis... new resource instances can easily be linked to existing ones thanks to FHIR's JSON/XML-based architecture and RESTful application program interface (API)
Secondary Use of Genomic Data: uses of data that are directed to purposes other than patient care
Secondary Use of Genomic Data: uses of data that are directed to purposes other than patient care
Responsibilities: ensure data quality and security, develop guidelines for classification, ensure compliance, define access and sharing protocols, promote integration and interoperability, lifecycle management, risk management.
Functions: establish policies and frameworks, training and awareness, audits
Best Practices: define clear ownership, strong leadership support, interdepartmental collaboration, continuous improvement
Challenges: cultural resistance, changing compliance requirements, balancing access and privacy, data complexity
Small organizations/teams -> fewer people interacting with less data -> simplified access strategies
Larger organizations/teams -> more people interacting with larger data -> complicated access strategies
Size alone is not responsible for complexity -> research institutions are complex web of smaller departments and labs
AWS Cloudtrail / Google Cloud Audit Logs / Azure Audit Logs