Get a scrape job by ID

curl --request GET \
  --url https://api.avidoai.com/v0/scrape-jobs/{id} \
  --header 'x-api-key: <api-key>' \
  --header 'x-application-id: <api-key>'

{
  "id": "123e4567-e89b-12d3-a456-426614174000",
  "createdAt": "2024-01-05T12:34:56.789Z",
  "modifiedAt": "2024-01-05T12:34:56.789Z",
  "orgId": "org_123",
  "initiatedBy": "user_123",
  "name": "Documentation Scrape",
  "url": "https://example.com",
  "status": "PENDING",
  "pages": [
    {
      "url": "https://example.com/page1",
      "title": "Page 1",
      "description": "This is the first page of the documentation.",
      "category": "Documentation"
    },
    {
      "url": "https://example.com/page2"
    }
  ]
}

GET

scrape-jobs

{id}

Get a scrape job by ID

curl --request GET \
  --url https://api.avidoai.com/v0/scrape-jobs/{id} \
  --header 'x-api-key: <api-key>' \
  --header 'x-application-id: <api-key>'

{
  "id": "123e4567-e89b-12d3-a456-426614174000",
  "createdAt": "2024-01-05T12:34:56.789Z",
  "modifiedAt": "2024-01-05T12:34:56.789Z",
  "orgId": "org_123",
  "initiatedBy": "user_123",
  "name": "Documentation Scrape",
  "url": "https://example.com",
  "status": "PENDING",
  "pages": [
    {
      "url": "https://example.com/page1",
      "title": "Page 1",
      "description": "This is the first page of the documentation.",
      "category": "Documentation"
    },
    {
      "url": "https://example.com/page2"
    }
  ]
}

Authorizations

x-api-key

string

header

required

Your unique Avido API key

x-application-id

string

header

required

Your unique Avido Application ID

Path Parameters

string<uuid>

required

The unique identifier of the scrape job

Example:

"123e4567-e89b-12d3-a456-426614174000"

Response

Scrape job retrieved successfully

Response containing the scrape job details

string<uuid>

required

The unique identifier of the scrape job

Example:

"123e4567-e89b-12d3-a456-426614174000"

createdAt

string<date-time>

required

When the scrape job was created

Example:

"2024-01-05T12:34:56.789Z"

modifiedAt

string<date-time>

required

When the scrape job was last modified

Example:

"2024-01-05T12:34:56.789Z"

orgId

string

required

Organization ID that owns the scrape job

Example:

"org_123"

initiatedBy

string

required

User ID who initiated the scrape job

Example:

"user_123"

name

string

required

The name/title of the scrape job

Example:

"Documentation Scrape"

url

string<uri>

required

The URL that was scraped

Example:

"https://example.com"

status

enum<string>

required

Current status of the scrape job

Available options:

MAPPING,

PENDING,

IN_PROGRESS,

COMPLETED,

FAILED

Example:

"PENDING"

pages

object[]

The pages scraped from the URL

Show child attributes

Example:

[
  {
    "url": "https://example.com/page1",
    "title": "Page 1",
    "description": "This is the first page of the documentation.",
    "category": "Documentation"
  },
  { "url": "https://example.com/page2" }
]

Create a scrape job Update a scrape job

⌘I

Webhook

Applications

Traces

Ingestion

Tasks

Task Schedules

Eval Definitions

Experiments

Inference Steps

Evals

Tests

Topics

Annotations

Issues

Runs

Style Guides

Documents

Document Chunks

Document Versions

Document Tests

Tags

Document Tags

Task Tags

Scrape Jobs

Quickstarts

Reports

Get a scrape job by ID

Authorizations

Path Parameters

Response