How do you know your improvements don't have regressions? How do you know you're driving better outcomes for your customers? This issue is a guide for AI Engineers building system evals for their LLM-driven apps.
Recent launches
Forest Friends Zine
How do you know your improvements don't have regressions? How do you know you're driving better outcomes for your customers? This issue is a guide for AI Engineers building system evals for their LLM-driven apps.