Interactive execution control for WebHarvest IDE
Temporarily halt scraper execution and continue later from the same point. Perfect for debugging, resource management, and interactive development workflows.
Halt execution at any time, state preserved
Continue from exact point where paused
Permanently cancel execution
Programmatic access via REST endpoints
Running State: ┌──────────────────────────────────┐ │ Progress: ████████░░░░░ 60% │ │ [⏸ Pause] [⏹ Stop] │ └──────────────────────────────────┘ Paused State: ┌──────────────────────────────────┐ │ Status: ⏸ Paused │ │ [▶ Resume] [⏹ Stop] │ └──────────────────────────────────┘
Pauses a running execution.
POST /api/execution/abc-123-def-456/pause
{
"success": true,
"executionId": "abc-123-def-456",
"action": "pause",
"status": "PAUSED"
}
Resumes a paused execution.
POST /api/execution/abc-123-def-456/resume
Stops execution permanently (cannot resume).
POST /api/execution/abc-123-def-456/stop
1. Run scraper
2. Pause after HTTP request
3. Inspect HTML in Variables panel
4. Verify XPath selectors work
5. Resume to continue
6. Pause again after next step
7. Iterate until working perfectly
Pause expensive scrapers during peak hours:
const hour = new Date().getHours();
if (hour >= 9 && hour <= 17) { // Business hours
await fetch(`/api/execution/${executionId}/pause`, {
method: 'POST'
});
console.log('Paused during peak hours');
}
Pause to manually verify scraped data before continuing:
<config>
<!-- Scrape first page -->
<http url="https://example.com/page1"/>
<def var="page1">${http}</def>
<!-- PAUSE HERE: Check if page1 looks correct -->
<!-- Continue with processing -->
<xpath expression="//data">
<html-to-xml>${page1}</html-to-xml>
</xpath>
</config>
Pause mechanism:
paused
flag in ExecutionJobpauseLock
monitorResume mechanism:
paused
flagpauseLock
monitorStop mechanism:
stopped
flagAdvanced pause/resume with checkpoints coming in v2.3.0