Optionaladd_Add chunks to the dataset that the crawl is created for, defaults to true
Optionalallow_Option for allowing the crawl to follow links to external websites.
Optionalbody_Text strings to remove from body when creating chunks for each page
Optionalboost_Boost titles such that keyword matches in titles are prioritized in search results. Strongly recommended to leave this on. Defaults to true.
Optionalexclude_URL Patterns to exclude from the crawl
Optionalexclude_Specify the HTML tags, classes and ids to exclude from the response.
Optionalheading_Text strings to remove from headings when creating chunks for each page
Optionalignore_Ignore the website sitemap when crawling, defaults to true.
Optionalinclude_URL Patterns to include in the crawl
Optionalinclude_Specify the HTML tags, classes and ids to include in the response.
Optionalinterval?: CrawlInterval | nullOptionallimit?: number | nullHow many pages to crawl, defaults to 1000
Optionalscrape_Optionalsite_The URL to crawl
Optionaltags?: string[] | nullTags to add to the crawl
Optionalwebhook_Metadata to send back with the webhook call for each successful page scrape
Optionalwebhook_Host to call back on the webhook for each successful page scrape
Options for setting up the crawl which will populate the dataset.