AI and Testing – Page 2 – Stories from a Software Tester

AI and Testing: Answer Relevancy

written by Jeff Nyman

In the previous post we got set up with DeepEval. Here we’re going to put that tool to use by looking at our first test case and our first quality metric.

Continue reading AI and Testing: Answer Relevancy →

AI and Testing

AI and Testing: Evaluation and DeepEval

written by Jeff Nyman

In previous posts in this series, I’ve largely been talking about how to use local LLMs by writing scripts and, along the way, I’ve been able to shoehorn in some testing ideas. We even wrote a bespoke test script together. In this post, I’m going to focus more specifically on testing by considering the idea of evaluation.

Continue reading AI and Testing: Evaluation and DeepEval →

AI and Testing

AI and Testing: Personal Marketability

written by Jeff Nyman

In the posts in this series, I’ve been taking you through a lot of concepts and tooling. That’s going to continue but, for this post, it felt prudent to take a little break and talk about why doing all this can matter. That gets into interviewing and potentially being hired.

Continue reading AI and Testing: Personal Marketability →

AI and Testing

AI and Testing: Scaling Tests

written by Jeff Nyman

In the previous post, we refactored a test case that we have been working on. In this post, we’re going to use that refactored test case and scale it up a bit.

Continue reading AI and Testing: Scaling Tests →

AI and Testing

AI and Testing: Refactoring Tests

written by Jeff Nyman

In the previous post, we refined an AI test case that we had previously created as a testing example. In this brief post, I want to show a refactoring of that code. We will also align on the output of this test.

Continue reading AI and Testing: Refactoring Tests →

AI and Testing

AI and Testing: Refining Tests

written by Jeff Nyman

In the previous post I provided an extended testing example where we wrote an “AI test case” together. This post will provide some more test thinking around that initial test case.

Continue reading AI and Testing: Refining Tests →

AI and Testing

AI and Testing: A Testing Example

written by Jeff Nyman

In this post, my goal is to write a relatively substantive test case and, while doing so, bring together many of the topics talked about in previous posts of this series.

Continue reading AI and Testing: A Testing Example →

AI and Testing

AI and Testing: LangChain and Orchestration

written by Jeff Nyman

Here I’m going to continue the thread from the previous post, where we started to look at the concept of Runnables, which is really what puts the “Chain” in “LangChain.”

Continue reading AI and Testing: LangChain and Orchestration →

AI and Testing

AI and Testing: LangChain Messages

written by Jeff Nyman

In the previous post, we got familiar with LangChain templates and dipped our toes into messages. In this post, I’m going to focus a bit more on those messages since these are the key to communicating with AI.

Continue reading AI and Testing: LangChain Messages →

AI and Testing

AI and Testing: LangChain Templates

written by Jeff Nyman

In this post I’m going to follow the thread from the previous post and dig more into the LangChain ecosystem and start looking at the idea of templates for prompts.

Continue reading AI and Testing: LangChain Templates →

AI and Testing

AI and Testing: Local LLMs and LangChain

written by Jeff Nyman

The previous post covered the concept of Ollama, to get a local LLM on your machine. Here, I’ll focus on using that LLM and introduce two key properties of testability in this context. Doing so will introduce LangChain.

Continue reading AI and Testing: Local LLMs and LangChain →

AI and Testing

AI and Testing: Ollama and Models

written by Jeff Nyman

In this post I want to take the initial steps to get some basic tooling available and operating. This is step one if you’re going to work in a technologist context with AI applications.

Continue reading AI and Testing: Ollama and Models →

AI and Testing

AI and Testing: Evaluating the Future

written by Jeff Nyman

As our technocracy continues to grow and as (at least some) technologists continue to push us toward a potentially dehumanized and dehumanizing future, I want to focus on how we can work from within this technocracy to make sure that human experimentation is front and center.

Continue reading AI and Testing: Evaluating the Future →

Stories from a Software Tester

Twice upon a time, in another space, no distance in any direction from here …

Category: AI and Testing

AI and Testing: Answer Relevancy

AI and Testing: Evaluation and DeepEval

AI and Testing: Personal Marketability

AI and Testing: Scaling Tests

AI and Testing: Refactoring Tests

AI and Testing: Refining Tests

AI and Testing: A Testing Example

AI and Testing: LangChain and Orchestration

AI and Testing: LangChain Messages

AI and Testing: LangChain Templates

AI and Testing: Local LLMs and LangChain

AI and Testing: Ollama and Models

AI and Testing: Evaluating the Future