Cost-benefit awareness checklist

Q: 1. The costs and benefits of data storage needed to facilitate a solution

Baseline Federated learning Synthetic data, differential privacy Data stored centrally in one place Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes. Greater privacy at rest using differential privacy or fully synthetic data. Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage. Read more about data storage costs and benefits at: data storage in federated learning data storage in synthetic data and differential privacy

Q: 4. How the solution will create new or augment testing and troubleshooting pathways

Baseline PETs The baseline uses traditional approaches to testing and patching. In HE and TEE solutions data is not visible to the data processor testing and troubleshooting may be more difficult and require mitigation at additional cost. Some data is visible to the data processor in SMPC . TEEs are hardware enabled which may necessitate additional mitigations and resources for testing processes and iterative bug fixes. Read more about testing and troubleshooting considerations at: testing and troubleshooting considerations in input privacy approaches

Q: 5. The technical skills and resources required to implement a solution

Baseline PETs Standard technical skills and resources required. Privacy enhancing technologies require a more niche skillset from specialist developers familiar with PETs libraries. This may require upskilling existing team members or hiring new talent. Read more about technical skills and resources at: technical skills and resources in federated learning and privacy enhancing technologies

Q: 6. The impacts of a potential solution on compliance with relevant regulations

Baseline PETs May require additional mitigations to ensure compliance with GDPR. This may prevent sensitive data from being processed if sufficient protections cannot be implemented. Privacy enhancing technologies may enhance compliance data protection law by providing anonymisation, protection against data breaches, or additional security. Organisations deploying PETs should consult legal teams and relevant ICO guidance. Read more about compliance considerations at: legal considerations for federated learning solutions legal considerations for input privacy approaches legal considerations for output privacy approaches

Q: 7. The long-term costs and benefits that may occur as the PETs ecosystem develops

Baseline PETs Adding new data sources may result in bespoke or semi-bespoke approach in each instance. Some data assets are unmonetizable due to privacy/commercial/IP concerns. Limited network effects due to centralised data and processes. Standardises approach to integrating new data sources, simplifying this process. Value can be derived from previously inaccessible data sources through privacy preserving approaches. More opportunities to benefit from federated learning as it becomes a more widely adopted approach. Read more about long-term costs and benefits at: long-term costs and benefits of federated learning and privacy enhancing technologies

Question 1

1. The costs and benefits of data storage needed to facilitate a solution

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about data storage costs and benefits at:

Question 2

2. How adopting PETs may impact on compute requirements

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about compute costs by at:

Question 3

3. The trade-offs between privacy-preserving infrastructures and data utility

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about privacy-preserving infrastructures at:

privacy preserving infrastructure in federated learning
privacy preserving infrastructure in input privacy approaches
privacy preserving infrastructure in output privacy approaches

Question 4

4. How the solution will create new or augment testing and troubleshooting pathways

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about testing and troubleshooting considerations at:

testing and troubleshooting considerations in input privacy approaches

Question 5

5. The technical skills and resources required to implement a solution

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about technical skills and resources at:

technical skills and resources in federated learning and privacy enhancing technologies

Question 6

6. The impacts of a potential solution on compliance with relevant regulations

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about compliance considerations at:

Question 7

7. The long-term costs and benefits that may occur as the PETs ecosystem develops

Accepted Answer

Baseline	Federated learning	Synthetic data, differential privacy
Data stored centrally in one place	Retain local control over data. Minimise risk from data breach by minimising aggregation. Avoid aggregation of security requirements from multiple organisations, and inflexibility due to complex multi-org governance regimes.	Greater privacy at rest using differential privacy or fully synthetic data.  Partial or hybrid synthetic data may require additional PETs to be stacked to be secure in storage.

Read more about long-term costs and benefits at:

long-term costs and benefits of federated learning and privacy enhancing technologies

Question 8

8. The barriers and risks that might be encountered when developing a solution

Accepted Answer

Develop a risk register using your own governance processes or consult the ICO guidance on identifying and addressing risks.

Baseline	Federated learning	HE, TEE, SMPC	Synthetic data, differential privacy
Invest in advanced hardware in one location and use it efficiently. The baseline example has variable latency and compute costs dependent on the scale of the data being processed.	Spread compute costs among participants.	HE solutions have a higher level of latency compared to other technologies because of processing occurs directly on encrypted data. This may also result in higher compute overheads. TEE has lower latency compared to HE since data is processed without encryption.  SMPC resulting in higher computational overheads than the baseline, however these are trivial in comparison to federated learning.	Synthetic data and DP datasets have higher compute costs in generation. These costs scale with the complexity of the original/real datasets.  Dynamic synthetic data and global DP have continuous compute overheads.

Baseline	PETs	Synthetic data, differential privacy
High level of utility due to data being decrypted and fully visible to all users.  Data is visible to all parties and unprotected after decryption.	HE solutions enable a greater degree of privacy than the baseline example because no external parties can view decrypted data at rest or in transit. TEE and SMPC solutions enable a greater degree of privacy than the baseline example but rely on a greater level of trust between parties than HE.	Synthetic data and DP lose utility as privacy is injected due to the changing distribution of trends in datasets. Organisations seeking to deploy these technologies should consider trade-offs between the appropriate level of privacy when sharing data and the required level of functionality to effectively use the solutions.  Synthetic data and DP enable sensitive data to be shared that would not previously be able to because of privacy concerns.

Cost-benefit awareness checklist

PETs cost-benefit checklist

1. The costs and benefits of data storage needed to facilitate a solution

2. How adopting PETs may impact on compute requirements

3. The trade-offs between privacy-preserving infrastructures and data utility

4. How the solution will create new or augment testing and troubleshooting pathways

5. The technical skills and resources required to implement a solution

6. The impacts of a potential solution on compliance with relevant regulations

7. The long-term costs and benefits that may occur as the PETs ecosystem develops

8. The barriers and risks that might be encountered when developing a solution

Is this page useful?

Help us improve GOV.UK

Help us improve GOV.UK

Baseline	PETs
The baseline uses traditional approaches to testing and patching.	In HE and TEE solutions data is not visible to the data processor testing and troubleshooting may be more difficult and require mitigation at additional cost. Some data is visible to the data processor in SMPC. TEEs are hardware enabled which may necessitate additional mitigations and resources for testing processes and iterative bug fixes.

Baseline	PETs
Standard technical skills and resources required.	Privacy enhancing technologies require a more niche skillset from specialist developers familiar with PETs libraries. This may require upskilling existing team members or hiring new talent.

Baseline	PETs
May require additional mitigations to ensure compliance with GDPR. This may prevent sensitive data from being processed if sufficient protections cannot be implemented.	Privacy enhancing technologies may enhance compliance data protection law by providing anonymisation, protection against data breaches, or additional security. Organisations deploying PETs should consult legal teams and relevant ICO guidance.

Baseline	PETs
Adding new data sources may result in bespoke or semi-bespoke approach in each instance. Some data assets are unmonetizable due to privacy/commercial/IP concerns. Limited network effects due to centralised data and processes.	Standardises approach to integrating new data sources, simplifying this process. Value can be derived from previously inaccessible data sources through privacy preserving approaches. More opportunities to benefit from federated learning as it becomes a more widely adopted approach.

Cookies on GOV.UK

PETs cost-benefit checklist

1. The costs and benefits of data storage needed to facilitate a solution

2. How adopting PETs may impact on compute requirements

3. The trade-offs between privacy-preserving infrastructures and data utility

4. How the solution will create new or augment testing and troubleshooting pathways

5. The technical skills and resources required to implement a solution

6. The impacts of a potential solution on compliance with relevant regulations

7. The long-term costs and benefits that may occur as the PETs ecosystem develops

8. The barriers and risks that might be encountered when developing a solution

Is this page useful?

Help us improve GOV.UK

Help us improve GOV.UK