Skip to main content
GET
/
v0
/
scrape-jobs
/
{id}
Get a scrape job by ID
curl --request GET \
  --url https://api.avidoai.com/v0/scrape-jobs/{id} \
  --header 'x-api-key: <api-key>' \
  --header 'x-application-id: <api-key>'
{
  "id": "123e4567-e89b-12d3-a456-426614174000",
  "createdAt": "2024-01-05T12:34:56.789Z",
  "modifiedAt": "2024-01-05T12:34:56.789Z",
  "orgId": "org_123",
  "initiatedBy": "user_123",
  "name": "Documentation Scrape",
  "url": "https://example.com",
  "pages": [
    {
      "url": "https://example.com/page1",
      "title": "Page 1",
      "description": "This is the first page of the documentation.",
      "category": "Documentation"
    },
    {
      "url": "https://example.com/page2"
    }
  ],
  "status": "PENDING"
}

Authorizations

x-api-key
string
header
required

Your unique Avido API key

x-application-id
string
header
required

Your unique Avido Application ID

Path Parameters

id
string<uuid>
required

The unique identifier of the scrape job

Example:

"123e4567-e89b-12d3-a456-426614174000"

Response

Scrape job retrieved successfully

Response containing the scrape job details A scrape job for extracting content from a website

id
string<uuid>
required

The unique identifier of the scrape job

Example:

"123e4567-e89b-12d3-a456-426614174000"

createdAt
string<date-time>
required

When the scrape job was created

Example:

"2024-01-05T12:34:56.789Z"

modifiedAt
string<date-time>
required

When the scrape job was last modified

Example:

"2024-01-05T12:34:56.789Z"

orgId
string
required

Organization ID that owns the scrape job

Example:

"org_123"

initiatedBy
string
required

User ID who initiated the scrape job

Example:

"user_123"

name
string
required

The name/title of the scrape job

Example:

"Documentation Scrape"

url
string<uri>
required

The URL that was scraped

Example:

"https://example.com"

status
enum<string>
required

Current status of the scrape job The status of the scrape job

Available options:
PENDING,
IN_PROGRESS,
RUNNING,
COMPLETED,
FAILED
Example:

"PENDING"

pages
object[]

The pages scraped from the URL

Example:
[
{
"url": "https://example.com/page1",
"title": "Page 1",
"description": "This is the first page of the documentation.",
"category": "Documentation"
},
{ "url": "https://example.com/page2" }
]