curl --request GET \
--url https://api.avidoai.com/v0/tests \
--header 'x-api-key: <api-key>' \
--header 'x-application-id: <api-key>'{
"data": [
{
"id": "123e4567-e89b-12d3-a456-426614174000",
"applicationId": "<string>",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z",
"status": "COMPLETED",
"evals": [
{
"id": "123e4567-e89b-12d3-a456-426614174000",
"orgId": "org_123456",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z",
"status": "COMPLETED",
"results": {},
"definition": {
"id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z",
"type": "NATURALNESS",
"name": "<string>",
"globalConfig": {
"criterion": "<string>"
},
"styleGuideId": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
"application": {
"id": "123e4567-e89b-12d3-a456-426614174000",
"orgId": "org_123456",
"title": "Customer Support Bot",
"slug": "customer-support-bot",
"description": "AI assistant for customer support inquiries",
"context": "You are a helpful customer support assistant...",
"language": "en",
"type": "CHATBOT",
"environment": "DEV",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z"
}
},
"score": 0.9,
"passed": true
}
],
"taskId": "123e4567-e89b-12d3-a456-426614174000",
"runId": "123e4567-e89b-12d3-a456-426614174000",
"task": {
"id": "123e4567-e89b-12d3-a456-426614174000",
"title": "Credit Card Block",
"topicId": "789e4567-e89b-12d3-a456-426614174000"
},
"traceId": "789e4567-e89b-12d3-a456-426614174000",
"result": {
"passed": 1,
"failed": 1,
"total": 1,
"passRate": 50,
"averageScore": 0.5
}
}
],
"pagination": {
"skip": 0,
"limit": 25,
"total": 100,
"totalPages": 4
}
}Retrieves a paginated list of tests with optional filtering.
curl --request GET \
--url https://api.avidoai.com/v0/tests \
--header 'x-api-key: <api-key>' \
--header 'x-application-id: <api-key>'{
"data": [
{
"id": "123e4567-e89b-12d3-a456-426614174000",
"applicationId": "<string>",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z",
"status": "COMPLETED",
"evals": [
{
"id": "123e4567-e89b-12d3-a456-426614174000",
"orgId": "org_123456",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z",
"status": "COMPLETED",
"results": {},
"definition": {
"id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z",
"type": "NATURALNESS",
"name": "<string>",
"globalConfig": {
"criterion": "<string>"
},
"styleGuideId": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
"application": {
"id": "123e4567-e89b-12d3-a456-426614174000",
"orgId": "org_123456",
"title": "Customer Support Bot",
"slug": "customer-support-bot",
"description": "AI assistant for customer support inquiries",
"context": "You are a helpful customer support assistant...",
"language": "en",
"type": "CHATBOT",
"environment": "DEV",
"createdAt": "2024-01-05T12:34:56.789Z",
"modifiedAt": "2024-01-05T12:34:56.789Z"
}
},
"score": 0.9,
"passed": true
}
],
"taskId": "123e4567-e89b-12d3-a456-426614174000",
"runId": "123e4567-e89b-12d3-a456-426614174000",
"task": {
"id": "123e4567-e89b-12d3-a456-426614174000",
"title": "Credit Card Block",
"topicId": "789e4567-e89b-12d3-a456-426614174000"
},
"traceId": "789e4567-e89b-12d3-a456-426614174000",
"result": {
"passed": 1,
"failed": 1,
"total": 1,
"passRate": 50,
"averageScore": 0.5
}
}
],
"pagination": {
"skip": 0,
"limit": 25,
"total": 100,
"totalPages": 4
}
}Your unique Avido API key
Your unique Avido Application ID
Number of items to skip before starting to collect the result set.
x >= 00
Number of items to include in the result set.
1 <= x <= 10025
Field to order by in the result set.
"createdAt"
Order direction.
asc, desc "desc"
Filter by test status (e.g. COMPLETED, FAILED)
Status of the evaluation/test
PENDING, IN_PROGRESS, COMPLETED, FAILED ["COMPLETED", "FAILED"]Filter eval runs after this date (inclusive).
"2024-01-01T00:00:00.000Z"
Filter eval runs before this date (inclusive).
"2024-12-31T23:59:59.999Z"
Filter by pass rate status badges (success: >75%, warning: 51-75%, error: ≤50%)
success, warning, error ["success", "warning"]Filter tests by eval definition ID
"123e4567-e89b-12d3-a456-426614174000"
Filter tests by experiment variant ID
"123e4567-e89b-12d3-a456-426614174000"
Filter tests by run type (MANUAL, SCHEDULED, EXPERIMENT)
MANUAL, SCHEDULED, EXPERIMENT ["MANUAL", "SCHEDULED"]Successfully retrieved evaluations
Paginated response containing tests
Show child attributes
Unique identifier of the run
"123e4567-e89b-12d3-a456-426614174000"
The ID of the application this test belongs to
When the test was created
"2024-01-05T12:34:56.789Z"
When the test was last modified
"2024-01-05T12:34:56.789Z"
Status of the evaluation/test
PENDING, IN_PROGRESS, COMPLETED, FAILED "COMPLETED"
Array of evaluations in this run
Show child attributes
Unique identifier of the evaluation
"123e4567-e89b-12d3-a456-426614174000"
Organization ID that owns this evaluation
"org_123456"
When the evaluation was created
"2024-01-05T12:34:56.789Z"
When the evaluation was last modified
"2024-01-05T12:34:56.789Z"
Status of the evaluation/test
PENDING, IN_PROGRESS, COMPLETED, FAILED "COMPLETED"
Show child attributes
When the eval definition was created
"2024-01-05T12:34:56.789Z"
When the eval definition was last modified
"2024-01-05T12:34:56.789Z"
The type of evaluation
NATURALNESS, STYLE, RECALL, CUSTOM, FACT, OUTPUT_MATCH "NATURALNESS"
Show child attributes
The criterion describes what our evaluation LLM must look for in the response. Remember that the answer to the criterion must be as a pass/fail.
1Application configuration and metadata
Show child attributes
Unique identifier of the application
"123e4567-e89b-12d3-a456-426614174000"
Organization ID that owns this application
"org_123456"
Title of the application
1"Customer Support Bot"
URL-friendly slug for the application
1"customer-support-bot"
Description of the application
"AI assistant for customer support inquiries"
Context/instructions for the application
"You are a helpful customer support assistant..."
Language of the application
"en"
Type of the application. Valid values are CHATBOT or AGENT.
CHATBOT, AGENT "CHATBOT"
Environment of the application. Defaults to DEV.
DEV, PROD "DEV"
When the application was created
"2024-01-05T12:34:56.789Z"
When the application was last modified
"2024-01-05T12:34:56.789Z"
Overall score of the evaluation
0.9
Whether the evaluation passed
true
The unique identifier of the task
"123e4567-e89b-12d3-a456-426614174000"
The unique identifier of the run
"123e4567-e89b-12d3-a456-426614174000"
A task that represents a specific job-to-be-done by the LLM in the user application.
Show child attributes
The unique identifier of the task
"123e4567-e89b-12d3-a456-426614174000"
The title of the task
"Credit Card Block"
The ID of the topic this task belongs to
"789e4567-e89b-12d3-a456-426614174000"
Optional ID of the trace this run is associated with
"789e4567-e89b-12d3-a456-426614174000"
Aggregated test result with pass/fail statistics
Show child attributes
Pagination metadata returned in a paginated response.
Show child attributes
Number of items skipped.
x >= 00
Number of items per page.
x >= 125
Total number of items available.
x >= 0100
Total number of pages available.
x >= 04