Google AI
The Times Australia

Times Media Advertising

The best way to protect personal biomedical data from hackers could be to treat the problem like a game

  • Written by: Zhiyu Wan, Postdoctoral Research Fellow in Biomedical Informatics, Vanderbilt University
The best way to protect personal biomedical data from hackers could be to treat the problem like a game

The Research Brief[1] is a short take about interesting academic work.

The big idea

Game theory, which tries to predict how the behavior of competitors influences the choices the other players make, can help researchers find the best ways to share biomedical data while protecting the anonymity of the people contributing the data from hackers.

Modern biomedical research, such as the National COVID Cohort Collaborative[2] and the Personal Genome Project[3], requires large amounts of data that are specific to individuals. Making detailed datasets publicly available without violating anyone’s privacy is a critical challenge for projects like these.

To do so, many programs that collect and disseminate genomic data obscure personal information in the data that could be exploited to re-identify subjects. Even so, it’s possible that residual data could be used to track down personal information from other sources, which could be correlated with the biomedical data to unearth subjects’ identities. For example, comparing someone’s DNA data with public genealogy databases like Ancestry.com can sometimes yield the person’s last name[4], which can be used along with demographic data to track down the person’s identity via online public record search engines like PeopleFinders.

Our research group, the Center for Genetic Privacy and Identity in Community Settings[5], has developed methods to help assess and mitigate privacy risks in biomedical data sharing. Our methods can be used to protect various types of data, such as personal demographics or genome sequences, from attacks on anonymity.

Our most recent work[6] uses a two-player leader-follower game to model the interactions between a data subject and a potentially malicious data user. In this model, the data subject moves first, deciding what data to share. Then the adversary moves next, deciding whether to attack based on the shared data.

A flowchart with boxes and arrows
Poorly protected genomic data attacked by someone with access to multiple data sources (red path) is the most at risk, while better-protected genomic data attacked by someone without access to other sources (blue path) is the least at risk. Vanderbilt University Medical Center, CC BY-ND[7]

Using game theory to assess approaches for sharing data involves scoring each strategy on both privacy and the value of the shared data. Strategies involve trade-offs between leaving out or obscuring parts of the data to protect identities and keeping the data as useful as possible.

The optimal strategy allows the data subject to share the most data with the least risk. Finding the optimal strategy is challenging, however, because genome sequencing data has many dimensions, which makes it impractical to exhaustively search all possible data sharing strategies.

To overcome this problem, we developed search algorithms[8] that focus attention on a small subset of strategies that are the most likely to contain the optimal strategy. We demonstrated that our method is the most effective considering both the utility of the data to the public and the data subject’s privacy.

Why it matters

The worst-case scenario, where an attacker has unlimited capabilities and no aversion to financial losses, is often extremely unlikely. However, data managers sometimes focus on these scenarios, which can lead them to overestimate the risk of re-identification and share substantially less data than they safely could.

The goal of our work is to create a systematic approach to reason about the risks that also accounts for the value of the shared data. Our game-based approach not only provides a more realistic estimate of re-identification risk, but also finds data sharing strategies that can strike the right balance between utility and privacy.

What other research is being done

Data managers use cryptographic techniques[9] to protect[10] biomedical data. Other approaches include adding noise to data[11] and hiding partial data[12].

This work builds on our previous studies, which pioneered using game theory to assess the risk of re-identification within health data[13] and protect against identity attacks on genomic data[14]. Our current study is the first to consider an attack in which the attacker can access multiple resources and combine them in a stepwise manner.

What’s next

We are now working to expand our game-based approach to model the uncertainty and rationality of a player. We are also working to account for environments that consist of multiple data providers and multiple types of data recipients.

[Science, politics, religion or just plain interesting articles: Check out The Conversation’s weekly newsletters[15].]

Read more https://theconversation.com/the-best-way-to-protect-personal-biomedical-data-from-hackers-could-be-to-treat-the-problem-like-a-game-173401

Times Magazine

The Human Supplement Craze Has Officially Gone to the Dogs (Literally)

Australians’ appetite for supplements is no longer limited to their own vitamin cabinets. New reta...

AI Guilt: It’s Real — But it is irrational

Artificial intelligence is rapidly becoming one of the most powerful tools ever made available to ...

Australians Are Keeping Their Cars Longer — And It’s Changing The Market

Australia’s car market is undergoing a subtle but important transformation. People are keeping th...

Streaming Fatigue: Australians Overwhelmed By Subscriptions

Streaming was once supposed to simplify entertainment. Instead, many Australians now feel overwhe...

Why Shopping Centres No Longer Feel Exciting

There was a time when going to the shopping centre felt like an event. Families spent entire Satu...

Harry And Meghan: Less Powerful As Royals, More Powerful As Content

For all the claims of “Harry and Meghan fatigue”, the world’s media still cannot stop talking abou...

The Times Features

The Biden Administration: Did The Inquiry Establish Who…

Questions surrounding former US President Joe Biden and his health while in office continue to dom...

Nationals move Bill to protect women. Sall Grover inter…

Matt Canavan  All good. Look, well, it's great to be here with my friend and colleague, Alison Pe...

The Human Supplement Craze Has Officially Gone to the D…

Australians’ appetite for supplements is no longer limited to their own vitamin cabinets. New reta...

The Teals: Can They Spoil Australia’s New Attraction to…

Australian politics is shifting again. For years, the dominant national contest revolved around L...

Property Paralysis: Buyers Hesitate As Australia’s Hous…

Australia’s property market may still be active, but beneath the auctions, listings and glossy rea...

The Return Of Practical Luxury: Buyers Want Quality Aga…

For years, consumer culture revolved around speed and abundance. Fast fashion.Fast furniture.Fast...

People Are Going Out Less — And Businesses Know It

Restaurants are full on some nights. Concerts still sell tickets. Sporting events attract crowds. ...

Why Shopping Centres No Longer Feel Exciting

There was a time when going to the shopping centre felt like an event. Families spent entire Satu...

The Liberal Party Faces Its Greatest Question Since Men…

When Robert Menzies founded the Liberal Party of Australia in the aftermath of World War II, Austr...