Parse Website

curl --request POST \
  --url https://rag-prod.studio.lyzr.ai/v3/parse/website/ \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --data '
{
  "urls": [
    "string"
  ],
  "source": "website",
  "max_crawl_pages": 1,
  "max_crawl_depth": 0,
  "dynamic_content_wait_secs": 5,
  "actor": "apify/website-content-crawler",
  "crawler_type": "cheerio",
  "chunk_size": 1000,
  "chunk_overlap": 100
}
'

{
  "documents": []
}

POST

parse

website

Parse Website

curl --request POST \
  --url https://rag-prod.studio.lyzr.ai/v3/parse/website/ \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --data '
{
  "urls": [
    "string"
  ],
  "source": "website",
  "max_crawl_pages": 1,
  "max_crawl_depth": 0,
  "dynamic_content_wait_secs": 5,
  "actor": "apify/website-content-crawler",
  "crawler_type": "cheerio",
  "chunk_size": 1000,
  "chunk_overlap": 100
}
'

{
  "documents": []
}

Authorizations

x-api-key

string

header

required

Body

application/json

urls

string[]

required

List of website URLs to crawl

Example:

["string"]

source

string

required

Data source identifier

Example:

"website"

max_crawl_pages

integer

required

Maximum number of pages to crawl

Example:

1

max_crawl_depth

integer

required

Maximum crawl depth

Example:

0

dynamic_content_wait_secs

integer

required

Time to wait for dynamic content to load (in seconds)

Example:

5

actor

string

required

Apify actor used for crawling

Example:

"apify/website-content-crawler"

crawler_type

string

required

Type of crawler used

Example:

"cheerio"

chunk_size

integer

required

Size of the chunks for text splitting

Example:

1000

chunk_overlap

integer

required

Overlap between consecutive text chunks

Example:

100

Response

Website successfully parsed and documents returned

documents

array

List of parsed documents

Example:

[]

Session Management Parse Csv

⌘I

Agent

Sessions

Knowledgebase

Semantic Model

Knowledge Graph

Tools

Responsible & Safe AI

Orchestration (Manager Agent)

Orchestration (Workflow)

Parse Website

Authorizations

Body

Response