Critical bugs addressed using Test-Driven Development
WebHarvest 2.2.0 includes fixes for binary file downloads, exception handling, and JavaScript support - all with comprehensive test coverage.
All bugs fixed with TDD approach
Comprehensive test suite with 100% pass rate
Fixed in v2.2.0 with complete test coverage
All existing functionality maintained
Detailed information in BUGS_FIXED.md
Issue: RAR/ZIP/PDF files corrupted when saved with type="binary"
Fix: FilePlugin now uses NodeVariable.toBinary() for binary writes
Tests: 7 tests added
Issue: Log indent grows, memory leak, 100% CPU when exceptions caught
Fix: Added try-finally wrapper to ensure cleanup always happens
Tests: 6 tests added
Issue: No JavaScript execution for dynamic pages (SPAs)
Fix: Reimplemented with HtmlUnit 2.70.0
Tests: 15 tests added, 51% coverage
Issue: Binary content corrupted when server sends wrong Content-Type
Fix: Auto-detection of binary MIME types (image/*, video/*, audio/*)
Tests: 4 validation tests
Issue: application/rss+xml not recognized as text
Fix: Inverse approach - detect binary, default to text
Tests: 8 verification tests
BUGS_FIXED.md - Complete details for all bugs
News Page - Full v2.2.0 release notes