5 February 2026 – Stories from a Software Tester

AI and Testing: Evaluation and DeepEval

written by Jeff Nyman

In previous posts in this series, I’ve largely been talking about how to use local LLMs by writing scripts and, along the way, I’ve been able to shoehorn in some testing ideas. We even wrote a bespoke test script together. In this post, I’m going to focus more specifically on testing by considering the idea of evaluation.

Continue reading AI and Testing: Evaluation and DeepEval →

Stories from a Software Tester

Twice upon a time, in another space, no distance in any direction from here …

Day: February 5, 2026

AI and Testing: Evaluation and DeepEval