The DNA of Healthcare

Part 2: What is Anonymized Data in Healthcare?

LN HC Data Series Blog v2
Senior Director of Healthcare Strategy for Real-World Data
Director of Data Science at Data Prime

In the first part of our series, we took some time to explain de-identified healthcare data and where both Safe Harbor and Expert Determination fit into the mix. In this post, we’ll jump into what anonymized data in healthcare means.

In general terms, anonymized means “to block or eliminate identifying information from (test results, data, authorship, etc.), especially for statistical analysis, product appraisal, or personal privacy; to make anonymous.”

Anonymized data never contains identifiable information, and it is not possible to be re-identified. It can be seen as a sub-category of de-identification. Unlike de-identified healthcare data, this data cannot be linked back to an individual even with a code, algorithm, or pseudonym.

All anonymized data is de-identified, but not all de-identified health data is anonymized (e.g. tokenized health data).

Anonymized data is similar to de-identified health data, but more context associated with categorical identifiers are stripped away. Anonymization could be leveraged to create an aggregated group of data.

In our example of, David Jr.* we can de-identify his data to find insights into other patients with depression – however all identifiable context regarding demographic information is completely removed or obfuscated. The data set can now only be filtered via simplistic parameters like confirmed positive diagnosis for depression and it cannot be linked back to the patient level due to the lack of identifiers.

Anonymized health data could be used with:

  • minors,
  • sexual assault victims,
  • studies using data from federal sources where a subset of data is provided,
  • analysis of aggregated data,
  • research related to hot spots of socioeconomic barriers that correlate with hot spot (or lack thereof) diseases,
  • examining a geographic region for strategic planning or research,
  • and hospital satisfaction surveys where secondary analysis is performed, etc.

Anonymization offers a whole different level of de-identification than compared to tokenization. You’ll find more on tokenized healthcare data in the final part of blog series.

Read More

*Please note, all names, diagnoses, and other information included in this infographic are fictitious. They are included for illustrative purposes and do not identify any actual persons (living or deceased).

The DNA of Healthcare

At LexisNexis Risk Solutions, our goal is to provide the healthcare industry with insights and innovations to improve outcomes, grow market share, reduce fraud and increase compliance.

Related Articles

These blogs are published for information purposes only and can be statements of opinion. Although we LexisNexis rigorously check the accuracy of all information at the time of publishing the blogs, no representations or warranties are expressed or implied as to the blog, its contents and any accompanying materials and it should not be relied upon for acting in specific circumstances. Although links to external websites on any blog posts are tested and deemed accurate at the time of the blog posting, we LexisNexis accept no liability for such links to external websites and do not endorse or warrant in any way any materials available through such links or any privacy or other practices of such sites. In addition to this blog disclaimer, access and use of the blogs is governed by the LexisNexis website.