Skip to main content

Saving Crawl Data Settings

Save webpage data like HTML, webpage text, or screenshots.

Updated over 2 months ago

Sitebulb's 'Saving Crawl Data' features allow you to save selected data as Sitebulb is crawling, which can be useful for bulk analysis, building data models, or diagnosing crawl issues.

Saving crawl data on larger sites can take up a lot of disk space - please ensure that your machine has the necessary resources if crawling on Sitebulb Desktop.

Webpage HTML

The webpage HTML Feature allows you to save the crawled HTML found on each internal webpage. This will be the response HTML when using the HTML crawler and the full rendered HTML when using the Chrome Crawler.

The webpage HTML will be available within the Sitebulb interface under 'URL Details' for each individual page.

On Sitebulb Desktop, the webpage HTML will also be saved locally on your machine for bulk analysis.

Webpage Text

This feature allows you to save the text found on each internal HTML page, including all the text content, the page title, and the meta description.

Saving Webpage Text is the perfect feature if you're looking to extract all page content.

Once the audit is completed, the webpage text data will be available via the ‘Website Text’ export in Bulk Exports.

Screenshots

Enabling the Screenshots save data feature will save screenshots of each rendered webpage. You can choose to save screenshots for mobile screen resolution, desktop screen resolution, or both.

Saving screenshots is only available in the Sitebulb desktop application.

Did this answer your question?