You are viewing the site in Agent Mode. It shows what an AI agent gets from the web page. You can either push it directly to your chatbot account or copy it locally for revision before pasting it to any AI agent. We added a bit of colour — even though we are not sure all agents will detect it.
Structured extraction with LLMs works best when input data is carefully prepared. We share how a parser-oriented approach — decoupling document parsing from AI tasks — enabled reliable extraction of lessons learned and recommendations from 200+ project reports.
Bridging knowledge management with controlled vocabularies and AI-powered retrieval. Working with M&E specialists, we built a SKOS-based knowledge graph, integrated three domain thesauri, and used it for RAG based retrieval.
Decoding the unstructured world. A week dedicated to document parsing, evaluation, and the open tools powering the next generation of data intelligence.
In traditional software, we debug behavior; in AI, we evaluate function. This post explores the tension between behavioral transparency and functional performance in AI systems, drawing on both philosophy and software engineering. When the internal workings are opaque—like in neural networks—we shift from analyzing how a system works to judging what it achieves.
Your AI system is only as good as your evaluation loop. A great model means nothing if you can’t measure and improve its real-world behavior. We introduce a practical approach for creating your AI-evaluation loop that's reliable, measureable and user-center, so you can add AI confidently to your product.
Does PDF parsing affects retrieval? Let's dive in our experiments on PDF parsing and the effect on document retrieval presented at the Berlin Buzzwords 2025 .
Over time, we've gathered a collection of links to useful services, libraries, and datasets. Now it's time to share this curated list of PDF tools with the world.
The Retrieval-Augmented Generation (RAG) approach improves LLMs by incorporating domain-specific information for more accurate answers. Let's see how it works.
Explore how to handle personal and sensitive data when developing or interacting with machine learning applications. A case study focusing on Retrieval Augmented Generation (RAG).
OneOff-Tech is partner of the European funded DIGITOO project to improve digital literacy education of students. Autumn 2023 marks the start of the collaborative review and testing of the toolkit on media and digital literacy skills and the handbook on digital ecology.
OneOff-Tech is partner of the European funded DIGITOO project to improve digital literacy education of students. November 2022 marks the start of the journey to develop a toolkit for education professionals that includes information and pedagogical methods on digital citizenship, digital footprint, and digital ecology.
How OneOffTech is piloting Open Source solutions to power the communication and collaboration in the International Technical Higher Education Network (ITHEN).
OneOffTech is one of the contributors of the Multi-Donor Learning Partnership (MDLP) publication on how international development agencies are collaborating to deliver impact through knowledge, learning, research and evidence
Data-driven exploration builds value for knowledge discovery in expert networks. To illustrate, here is an example that comes from an assessment study for the Low Emission Development Strategies Global Partnership
---
title: 'Stories | OneOffTech'
description: 'Insights, case studies, and technical reflections from the OneOff-Tech team on knowledge management, AI, and open-source tools.'
---
# OneOffTech's Stories
Insights, case studies, and technical reflections from the OneOff-Tech team on knowledge management, AI, and open-source tools.
## [Introducing Agent Mode](/blog/agent-mode)
Agent Mode is our way to embrace AI Agents browsing OneOffTech websites with markdown-first, AI-ready content, that is more accessible and visible.
## [Rethinking Document Intelligence: Structured Extraction and the Primacy of Data Preparation](/blog/structured-extraction)
Structured extraction with LLMs works best when input data is carefully prepared. We share how a parser-oriented approach — decoupling document parsing from AI tasks — enabled reliable extraction of lessons learned and recommendations from 200+ project reports.
## [Teaching AI to Think Like an Expert: Knowledge Graphs and RAG in Climate Networks](/blog/skos-for-km)
Bridging knowledge management with controlled vocabularies and AI-powered retrieval. Working with M&E specialists, we built a SKOS-based knowledge graph, integrated three domain thesauri, and used it for RAG based retrieval.
## [Parxing Week 2025](/blog/parxing-week-2025)
Decoding the unstructured world. A week dedicated to document parsing, evaluation, and the open tools powering the next generation of data intelligence.
## [You Can't Debug a Judgment: Behavior(alism) vs Function(alism) in AI Evaluation](/blog/behavior-vs-function)
In traditional software, we debug behavior; in AI, we evaluate function. This post explores the tension between behavioral transparency and functional performance in AI systems, drawing on both philosophy and software engineering. When the internal workings are opaque—like in neural networks—we shift from analyzing how a system works to judging what it achieves.
## [AI Evals: How to evaluate your artificial intelligence component](/blog/ai-eval-loop)
Your AI system is only as good as your evaluation loop. A great model means nothing if you can’t measure and improve its real-world behavior. We introduce a practical approach for creating your AI-evaluation loop that's reliable, measureable and user-center, so you can add AI confidently to your product.
## [Contexts & Machines: How Document Parsing Shapes RAG results](/blog/pdf-parsing)
Does PDF parsing affects retrieval? Let's dive in our experiments on PDF parsing and the effect on document retrieval presented at the Berlin Buzzwords 2025 .
## [Awesome PDF: A Curated List of Libraries, Services, and Resources for Working with PDF Files](/blog/awesome-pdf)
Over time, we've gathered a collection of links to useful services, libraries, and datasets. Now it's time to share this curated list of PDF tools with the world.
## [Tip: Use Playwright to quickly test across multiple browsers](/blog/browser-testing)
Need to test your application in a specific browser version? Here's a quick tip to help you get started using Playwright.
## [AI Agents and agentic systems: definitions and patterns](/blog/agents)
Agent is now a common term when referring to interactions with large language models. Let's explore some definitions following Anthropic guidance.
## [Exploring the AI Act](/blog/ai-act)
We explore the AI Act and its pillars from the perspective of companies/organizations using artificial intelligence.
## [An introduction to Retrieval-Augmented Generation (RAG)](/blog/rag)
The Retrieval-Augmented Generation (RAG) approach improves LLMs by incorporating domain-specific information for more accurate answers. Let's see how it works.
## [Knowledge matters for everybody](/blog/km-workshop-for-all)
A workshop on knowledge management for all!
## [Personal data management in the age of Machine Learning](/blog/personal-data-ml)
Explore how to handle personal and sensitive data when developing or interacting with machine learning applications. A case study focusing on Retrieval Augmented Generation (RAG).
## [Proof of Concepts, what are they and how do they apply to knowledge management?](/blog/poc)
Proof of Concept (POC) is a common tool used in technology-focused as well as knowledge management activities. We explore how the two souls compare.
## [Creating and Testing the DIGITOO Toolkit: Enhancing Media and Digital Literacy in Education](/blog/digitoo-toolkit)
OneOff-Tech is partner of the European funded DIGITOO project to improve digital literacy education of students. Autumn 2023 marks the start of the collaborative review and testing of the toolkit on media and digital literacy skills and the handbook on digital ecology.
## [Digital citizenship becomes a crucial objective for schools](/blog/digitoo)
OneOff-Tech is partner of the European funded DIGITOO project to improve digital literacy education of students. November 2022 marks the start of the journey to develop a toolkit for education professionals that includes information and pedagogical methods on digital citizenship, digital footprint, and digital ecology.
## [Open Source digital platform for ITHEN - a success story](/blog/open-source-digital-platform-for-ithen-a-success-story)
How OneOffTech is piloting Open Source solutions to power the communication and collaboration in the International Technical Higher Education Network (ITHEN).
## [MDLP Return on Knowledge book](/blog/mdlp-return-on-knowledge-book)
OneOffTech is one of the contributors of the Multi-Donor Learning Partnership (MDLP) publication on how international development agencies are collaborating to deliver impact through knowledge, learning, research and evidence
## [An example of a data driven network analysis](/blog/an-example-of-a-data-driven-network-analysis)
Data-driven exploration builds value for knowledge discovery in expert networks. To illustrate, here is an example that comes from an assessment study for the Low Emission Development Strategies Global Partnership
**Navigation**
- [Blog](https://oneofftech.de/blog)
- [Our projects](https://oneofftech.de/our-projects)
- [How we work](https://oneofftech.de/our-approach)
- [Open source](https://oneofftech.de/open-source)
- [Who we are](https://oneofftech.de/about-us)
This content was copied from https://oneofftech.de/blog