Ready-to-use scraping configurations
Professional examples organized by category: Web & HTTP, Data Extraction, Data Transformation, Control Flow, and File Operations.
java -jar webharvest-cli.jar config.xml
) or IDEHTTP requests, HTML parsing, web scraping
Basic HTTP request with HTML parsing
Extract product info from Amazon-like sites
Advanced techniques with error handling
Follow links and crawl multiple pages
Extract articles from news websites
Call REST APIs and process responses
XPath, XQuery, regex-based data extraction
Complex XPath queries and data extraction
Advanced XML querying and transformation
Extract structured product data
Download images from search results
Extract and analyze social media data
Price tracking and inventory monitoring
JSON, XML, CSV conversion and processing
Loops, conditions, functions, error handling
Database, Mail, FTP, Browser automation
JDBC connections, SQL queries, result processing
SMTP email with HTML templates
FTP file transfer operations
Headless browser with JavaScript support
Complete list of downloadable configurations
Basic HTTP test configuration
Advanced search and extraction
Amazon product scraping
REST API integration
Variable usage examples
XML canonicalization
Namespace handling
Web crawler with link following
Complete ETL pipeline
Database plugin example
Price and inventory monitoring
Flickr photo scraping
FTP plugin example
Custom function examples
Google Images scraping
New York Times articles
Mail plugin example
Modern scraping techniques
Product catalog extraction
Social analytics
Browser plugin example
XQuery transformations
Yahoo Mail integration