Crawl websites, process the content, and use it to train the specified RAG system.
The ID of the RAG system to train (must be a 24-character hex string).
"654c602a46c3b6d4e28741b0"
List of website URLs to crawl
["string"]Data source identifier
"website"
Maximum number of pages to crawl
1
Maximum crawl depth
0
Time to wait for dynamic content to load (in seconds)
5
Apify actor used for crawling
"apify/website-content-crawler"
Type of crawler used
"cheerio"
Size of the chunks for text splitting
1000
Overlap between consecutive text chunks
100
Website successfully crawled, processed, and RAG system trained.
Placeholder for a success message or job ID.