Research that ships
We build practical AI innovations — large-scale synthetic datasets, efficient models that run on consumer hardware, and production-ready systems for real-world deployment.
TinyFabulist
Large-scale synthetic narrative generation
A multi-phase research initiative producing open datasets, translation frameworks, and compact language models — all optimized for cost-effective deployment on consumer hardware.
Compact Romanian Language Models
End-to-end pipeline for training Romanian LMs from scratch: custom tokenizers, pretraining, compression via distillation, and large-scale dataset generation.
Synthetic Data Generation
Our comprehensive survey on generating training data using LLMs — published in IEEE Access.
Synthetic Data Generation Using Large Language Models: Advances in Text and Code
Mihai Nadăș, Laura Dioșan, Andreea Tomescu — IEEE Access, 2025
How enterprises can generate training data at scale — reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data.
Where we push boundaries
Our research translates directly into practical capabilities for clients and portfolio companies.
Synthetic Data Generation
Generate training data at scale without exposing sensitive data. Our comprehensive IEEE Access survey covers techniques from prompt engineering to reinforcement learning — achieving 3-26% performance gains in low-data scenarios.
Efficient AI Systems
Models that run on consumer hardware at a fraction of the cost. Techniques include quantization, pruning, and knowledge distillation.
Multilingual NLP
Specialized expertise in NLP, including low-resource languages — addressing tokenization penalties and underrepresented language challenges.
Educational AI
Value-aligned content generation for educational applications. Child-safe AI systems with explicit moral reasoning and age-appropriate outputs.
Datasets & models
We publish our datasets and models openly to advance the field and enable others to build on our work.
Partner on R&D
From synthetic data generation to domain-specific model development — let's explore what's possible together.
Message received!
We'll send you a confirmation email shortly.
Want to track your inquiry and access exclusive content?
Create your KlusAI Hub accountStay in the loop
Weekly insights on production AI — no hype, just what works.