Jordy Van Landeghem | AI & Machine Learning Consultant

Jordy Van Landeghem

World-class AI researcher with a PhD in Computer Science, specializing in Document AI, Agentic Systems, and Generative AI. I help organizations build ambitious AI roadmaps and turn cutting-edge research into production-ready solutions.

10+ Years in AI/ML

3 Advanced Degrees

15+ Publications

2024 – now

Full-time

Senior Software Engineer, Machine Learning

Instabase — Remote

Designed and shipped DocumentReActAgent: agentic self-correction loop over 1M+ document repositories; drove straight-through processing to 100% by resolving the hard edge cases where document automation typically hits its ceiling.
Designed evaluation & benchmarking framework (multi-provider, LLM-as-a-Judge, structured logprobs); drove Gemini model selection and contributed to enterprise deal closure.
Authored PRD for Unified Extractor v2 architecture redesign (schema / state / prompts / engines / orchestration separation).
Technical lead, Agent Mode team; mentored engineers across Project Accuracy, Agent Mode, and AXIS teams.
Published two ICML 2025 technical blog posts on agentic document AI; ranked #1 in Cursor AI productivity company-wide.

2017 – 2024

Full-time · 7 years

Lead AI Research Engineer

Contract.fit — Brussels, Belgium

Led end-to-end Document AI engineering (NLP + CV) for insurance, finance, and legal domains across a production SaaS platform.
Designed and shipped production-grade ML pipelines for document classification and information extraction, maintained over 7 years of growth.
Secured 4 Flemish innovation grants (VLAIO) as lead researcher; co-wrote all applications.
Supervised 11 Master's AI/CS thesis internships across KU Leuven and VUB.
Translated academic advances (DUDE benchmark, uncertainty estimation) into scalable product features.

2017

Research Intern

Language Modelling Research

Nuance Communications — Aachen, Germany

Researched regularisation techniques for RNN language models; implemented biLSTM character-based word embeddings.

2016 – 2017

Research Intern

NLP for Dialogue Systems

Oracle — Barcelona, Spain

Investigated Seq2Seq neural networks for chatbot and virtual assistant technology.

Let's Build Something Ambitious

Whether you're exploring AI strategy, need help building a GenAI prototype, or want to discuss how agentic automation can transform your workflows, I'm here to help.

jordy.vlan@gmail.com

Belgium (EU) — Available Globally

6 Languages: Dutch, English, Spanish, French, German, Portuguese

Download CV

About Me

Education

Experience

Senior Software Engineer, Machine Learning

Lead AI Research Engineer

Language Modelling Research

NLP for Dialogue Systems

Areas of Expertise

Document AI

Agentic AI Systems

LLM/VLM Engineering

Evaluation & Benchmarking

ML Operations

Research & Innovation

Consulting Services

AI Strategy & Roadmap

GenAI Prototype Development

Agentic Automation

Technical Due Diligence

Projects

DUDE

DocumentReActAgent

DRAG

Selected Publications

Intelligent Automation for AI-Driven Document Understanding

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

A Novel Characterization of the Population Area Under the Risk Coverage Curve (AURC) and Rates of Finite Sample Estimators

Where Layout Meets Language

DistilDoc: Knowledge Distillation for Visually-Rich Document Applications

Beyond Document Page Classification: Design, Datasets, and Challenges

Document Understanding Dataset and Evaluation (DUDE)

ICDAR 2023 Competition on Document UnderstanDing of Everything (DUDE)

Benchmarking Scalable Predictive Uncertainty in Text Classification

Predictive Uncertainty for Probabilistic Novelty Detection in Text Classification

Transfer Learning for Named Entity Recognition in Financial & Biomedical Documents

Talks & Presentations

Parse, Reflect, Retrieve, Compile: An Agent Stack for Enterprise Document AI

Beyond Document Page Classification: Design, Datasets, and Challenges

ICDAR 2023 Competition on Document UnderstanDing of Everything (DUDE)

DUDE — What's Next?

Calibration Primer for Document AI

Grants & Funding

Leveraging Document Structure for Improved Document Understanding

Intelligent Automation for AI-driven Document Understanding

Development of a Performant and User-friendly API Self-service Portal and World-class Classification Modules

Self-Learning Platform for Simplifying Data-intensive Client Interactions

Let's Build Something Ambitious