Legal Tech 6 min read

From 6 Months of Internal Struggle to a Working PoC in 3 Weeks

A Dutch notarial SaaS platform needed AI for document translation and plain-language explanation — but sensitive personal data blocked any LLM processing. KlusAI delivered a pseudonymization pipeline in 3 weeks.

6 months → 3 weeks to working PoC
Pseudonymization NLP Legal Tech
From 6 Months of Internal Struggle to a Working PoC in 3 Weeks

Pseudonymization pipeline that unlocked AI-powered document processing for a privacy-constrained legal platform

The Challenge

A Dutch notarial SaaS platform wanted to integrate AI-powered document translation and plain-language explanations into their product. The goal: make dense legal documents accessible to clients who aren’t legal professionals.

But there was a fundamental blocker. Notarial documents contain highly sensitive personal data — names, addresses, financial details, BSN numbers. Under GDPR and the emerging EU AI Act, sending this data to external LLM providers was a non-starter. The platform’s own team had spent 6 months trying to build an internal pseudonymization solution and hadn’t reached production quality.

We spent 6 months trying to solve the pseudonymization problem internally. You had a working proof of concept in 3 weeks.

— CEO, Dutch Notarial SaaS Platform

The Solution

KlusAI designed and built a pseudonymization pipeline that processes documents through three stages:

  1. Pre-processing — Documents are analyzed and all personally identifiable information (PII) is detected and replaced with consistent pseudonyms. Names become “Person A” and “Person B”, addresses become generic placeholders, financial details are masked — all while preserving the document’s logical structure.

  2. AI Processing — The pseudonymized document is sent to the LLM for translation or simplification. Because no real PII exists in the text, this step is fully GDPR-compliant.

  3. Post-processing — The AI output is de-pseudonymized: original PII entities are re-inserted into their correct positions, producing a final document that reads naturally with all real names and details intact.

A complete audit trail tracks every pseudonymization and de-pseudonymization operation, satisfying both GDPR accountability requirements and EU AI Act transparency obligations.

Key Results

The proof of concept was delivered in 3 weeks and validated against the platform’s real document corpus.

3 weeks

Time to working PoC

100%

GDPR compliant

EU AI Act

Ready from day one

Full

Audit trail coverage

The Transformation

The pseudonymization pipeline fundamentally changed what was possible for the platform.

PII Handling

Before

Documents couldn't be processed by AI due to privacy constraints

After

Automated pseudonymization enables safe LLM processing with full audit trail

Time to Solution

Before

6 months of internal development without production-ready results

After

Working proof of concept delivered in 3 weeks

Compliance

Before

Regulatory uncertainty blocking AI adoption

After

GDPR and EU AI Act compliant by design

Ready to build something like this?

Let's discuss how AI can solve your specific challenges and deliver measurable results.