vitals_view

vitals bundles the Inspect log viewer, an interactive app for exploring
evaluation logs. Supply a path to a directory of tasks written to json.
For individual Task objects, use the <code>$view()</code> method instead.

A port of 'Inspect', a widely adopted 'Python' framework for
large language model evaluation. Specifically aimed at 'ellmer' users
who want to measure the effectiveness of their large language model-based
products, the package supports prompt engineering, tool usage,
multi-turn dialog, and model graded evaluations.

Simon Couch

vitals

Large Language Model Evaluation

Max Kuhn

Hadley Wickham

Mine Cetinkaya-Rundel

Posit Software, PBC 

vitals_view function

<dl><dt>dir</dt>
<dd>Path to a directory containing task eval logs.</dd>
<dt>host</dt>
<dd>Host to serve on. Defaults to "127.0.0.1".</dd>
<dt>port</dt>
<dd>Port to serve on. Defaults to 7576, one greater than the Python
implementation.</dd></dl>

Arguments

Interactively view local evaluation logs — vitals_view

<dl>

<dt>dir</dt>
<dd>Path to a directory containing task eval logs.</dd>


<dt>host</dt>
<dd>Host to serve on. Defaults to "127.0.0.1".</dd>


<dt>port</dt>
<dd>Port to serve on. Defaults to 7576, one greater than the Python
implementation.</dd>

</dl>

vitals_view: Interactively view local evaluation logs

Description

Usage

Value

Arguments

Examples