To handle PII data ingestion and ensure both redaction and re-identification for analytics purposes, you can use Cloud Data Loss Prevention (DLP) with appropriate techniques for masking and encryption.
Cloud Data Loss Prevention (DLP) with Cryptographic Hashing (C):
Use Cloud DLP to apply cryptographic hashing to PII data. Hashing transforms the data into a fixed-length string that is not directly readable, providing a layer of obfuscation. This helps in masking the PII while retaining the ability to verify data integrity.
Cloud Data Loss Prevention (DLP) with Deterministic Encryption using AES-SIV (E):
Apply deterministic encryption using AES-SIV through Cloud DLP. Deterministic encryption ensures that the same input will always produce the same encrypted output, allowing you to re-identify the PII when necessary. This method enables secure encryption while allowing data re-identification for analytics.
By combining these two approaches, you can effectively mask PII for privacy protection and later re-identify it when required for analysis.
References
Cloud Data Loss Prevention Documentation
Data Redaction and Masking Techniques
Submit