Enterprise Web Scraping
& Data Extraction

Extract the web. Automate data harvesting.

Production-ready framework with 57 plugins: 47 core (built-in) + 10 extensions (optional), modern plugin architecture, session management, and professional web IDE.

Get Started Download v2.2.0

Or learn more about features

Plugins

3,091

Tests Pass

15+

Years

Professional Web IDE

Develop, test, and debug your scrapers directly in the browser

Monaco Editor

VS Code's powerful editor with XML syntax highlighting and auto-completion

Real-Time Updates

WebSocket-based live streaming of logs, progress, and results

Multi-Tab Support

Per-tab logs, results, and session tracking (v2.2)

Session Metrics

Track duration, tokens, and performance in real-time

scraper.xml

<config xmlns="http://org.webharvest/schema/2.1/core">
  <!-- Fetch HTML -->
  <def var="html">
    <http url="https://example.com"/>
  </def>
  
  <!-- Parse to XML -->
  <def var="page">
    <html-to-xml>
      <get var="html"/>
    </html-to-xml>
  </def>
  
  <!-- Extract Title -->
  <def var="title">
    <xpath expression="//title/text()">
      <get var="page"/>
    </xpath>
  </def>
</config>

Try IDE Now