Bug Fixes
in v2.2.0

Critical bugs addressed using Test-Driven Development

WebHarvest 2.2.0 includes fixes for binary file downloads, exception handling, and JavaScript support - all with comprehensive test coverage.

Test Coverage

All bugs fixed with TDD approach

3,091 Tests

Comprehensive test suite with 100% pass rate

5 Critical Bugs

Fixed in v2.2.0 with complete test coverage

Zero Regression

All existing functionality maintained

Fixed Bugs

Detailed information in BUGS_FIXED.md

Bug #41: Binary File Downloads

Issue: RAR/ZIP/PDF files corrupted when saved with type="binary"

Fix: FilePlugin now uses NodeVariable.toBinary() for binary writes

Tests: 7 tests added

Bug #27: Memory Leak in Exception Handling

Issue: Log indent grows, memory leak, 100% CPU when exceptions caught

Fix: Added try-finally wrapper to ensure cleanup always happens

Tests: 6 tests added

Bug #33: JavaScript/AJAX Support

Issue: No JavaScript execution for dynamic pages (SPAs)

Fix: Reimplemented with HtmlUnit 2.70.0

Tests: 15 tests added, 51% coverage

Bug #39: Binary MIME Type Detection

Issue: Binary content corrupted when server sends wrong Content-Type

Fix: Auto-detection of binary MIME types (image/*, video/*, audio/*)

Tests: 4 validation tests

Bug #13: RSS/XML Feed Detection

Issue: application/rss+xml not recognized as text

Fix: Inverse approach - detect binary, default to text

Tests: 8 verification tests

See Also

Full Bug Database

BUGS_FIXED.md - Complete details for all bugs

Release Notes

News Page - Full v2.2.0 release notes