World Bank — Romania.
A national database of energy performance certificates — document-intelligence pipelines processing hundreds of thousands of documents in inconsistent formats into one structured, searchable database.
- Industry
- Public sector / document intelligence
- Timeline
- August 2025
- Outcome
- Hundreds of thousands of documents in one database
The problem
Several hundred thousand energy-performance-certificate documents in inconsistent formats — PDFs, scans, mixed formats — with no structured, searchable database.
What I built
Design and delivery of AI-supported document-intelligence pipelines: extraction, normalisation and structuring of data from hundreds of thousands of documents into a single database.
The outcome
A structured, searchable database for government administration — the foundation for the national programme's analytics and reporting.