From 6 Months of Internal Struggle to a Working PoC in 3 Weeks
A Dutch notarial SaaS platform needed AI for document translation and plain-language explanation — but sensitive personal data blocked any LLM processing. KlusAI delivered a pseudonymization pipeline in 3 weeks.
Pseudonymization pipeline that unlocked AI-powered document processing for a privacy-constrained legal platform
The Challenge
A Dutch notarial SaaS platform wanted to integrate AI-powered document translation and plain-language explanations into their product. The goal: make dense legal documents accessible to clients who aren’t legal professionals.
But there was a fundamental blocker. Notarial documents contain highly sensitive personal data — names, addresses, financial details, BSN numbers. Under GDPR and the emerging EU AI Act, sending this data to external LLM providers was a non-starter. The platform’s own team had spent 6 months trying to build an internal pseudonymization solution and hadn’t reached production quality.
We spent 6 months trying to solve the pseudonymization problem internally. You had a working proof of concept in 3 weeks.
The Solution
KlusAI designed and built a pseudonymization pipeline that processes documents through three stages:
-
Pre-processing — Documents are analyzed and all personally identifiable information (PII) is detected and replaced with consistent pseudonyms. Names become “Person A” and “Person B”, addresses become generic placeholders, financial details are masked — all while preserving the document’s logical structure.
-
AI Processing — The pseudonymized document is sent to the LLM for translation or simplification. Because no real PII exists in the text, this step is fully GDPR-compliant.
-
Post-processing — The AI output is de-pseudonymized: original PII entities are re-inserted into their correct positions, producing a final document that reads naturally with all real names and details intact.
A complete audit trail tracks every pseudonymization and de-pseudonymization operation, satisfying both GDPR accountability requirements and EU AI Act transparency obligations.
Key Results
The proof of concept was delivered in 3 weeks and validated against the platform’s real document corpus.
3 weeks
Time to working PoC
100%
GDPR compliant
EU AI Act
Ready from day one
Full
Audit trail coverage
The Transformation
The pseudonymization pipeline fundamentally changed what was possible for the platform.
PII Handling
Before
Documents couldn't be processed by AI due to privacy constraints
After
Automated pseudonymization enables safe LLM processing with full audit trail
Time to Solution
Before
6 months of internal development without production-ready results
After
Working proof of concept delivered in 3 weeks
Compliance
Before
Regulatory uncertainty blocking AI adoption
After
GDPR and EU AI Act compliant by design
Ready to build something like this?
Let's discuss how AI can solve your specific challenges and deliver measurable results.