Skip to content

🦑 TruLens

generate_test_set

trulens.benchmark.generate.generate_test_set¶

trulens.benchmark.generate.generate_test_set ¶

Classes¶

GenerateTestSet ¶

This class is responsible for generating a test set using the provided application callable.

Functions¶

init ¶

__init__(app_callable: Callable)

Initialize the GenerateTestSet class.

PARAMETER	DESCRIPTION
`app_callable`	The application callable to be used for generating the test set. TYPE: `Callable`

generate_test_set ¶

generate_test_set(
    test_breadth: int,
    test_depth: int,
    examples: Optional[list] = None,
) -> dict

Generate a test set, optionally using few shot examples provided.

PARAMETER	DESCRIPTION
`test_breadth`	The breadth of the test set. TYPE: `int`
`test_depth`	The depth of the test set. TYPE: `int`
`examples`	An optional list of examples to guide the style of the questions. TYPE: `Optional[list]` DEFAULT: `None`

RETURNS	DESCRIPTION
`dict`	A dictionary containing the test set. TYPE: `dict`

Example

# Instantiate GenerateTestSet with your app callable, in this case: rag_chain.invoke
test = GenerateTestSet(app_callable = rag_chain.invoke)

# Generate the test set of a specified breadth and depth without examples
test_set = test.generate_test_set(test_breadth = 3, test_depth = 2)

# Generate the test set of a specified breadth and depth with examples
examples = ["Why is it hard for AI to plan very far into the future?", "How could letting AI reflect on what went wrong help it improve in the future?"]
test_set_with_examples = test.generate_test_set(test_breadth = 3, test_depth = 2, examples = examples)