57 production-ready plugins for web scraping
47 core plugins built-in, 10 optional extensions for specialized needs. Modern architecture with automatic discovery and dependency injection.
Complete reference for all 57 plugins with examples, parameters, and usage details.
View Plugin Reference →New to WebHarvest? Start with our step-by-step tutorial and examples.
Start Tutorial →47 core plugins built-in, optional extensions when you need them
WebHarvest delivers 47 production-ready core plugins right out of the box - everything you need for web scraping with zero external dependencies.
The framework's modular architecture allows you to add 10 optional extension modules only when your project needs them:
Why this architecture? This keeps your core lightweight and lets you add only the dependencies you actually need - smaller JARs, faster startup, and reduced complexity.
Ready to use immediately
Everything you need for web scraping:
Optional add-ons with specific dependencies
Specialized capabilities for enterprise use:
Proven in production, trusted by developers worldwide
Optional modules with heavy dependencies - add only what your project needs
Ready to extend your scraping power? Each module is a single Maven dependency that adds specialized enterprise capabilities to your WebHarvest projects.
Tip: Add only the modules you need - keep your project lightweight and focused.
Module: webharvest-database
Connect to any database seamlessly. Execute SQL queries, store scraped data, and integrate with your existing data infrastructure. Perfect for data warehouses, analytics pipelines, and enterprise applications.
Module: webharvest-ftp
Secure file transfers made simple. Upload scraped data, download resources, and manage files on remote servers. Essential for data pipelines, backup systems, and distributed scraping architectures.
Module: webharvest-mail
Automate email workflows with style. Send notifications, reports, and alerts with rich HTML formatting. Perfect for monitoring systems, data delivery pipelines, and automated customer communications.
Module: webharvest-zip
Compress, organize, and distribute data efficiently. Create archives from scraped content, extract downloaded files, and manage data packages. Essential for backup systems, data distribution, and storage optimization.
Module: webharvest-webbrowser
Handle modern web applications with ease. Execute JavaScript, interact with dynamic content, and scrape Single Page Applications (SPAs). Perfect for React, Vue, Angular apps, and sites with complex AJAX interactions.
Auto-discovery, registration, and dependency injection
@Autoscanned
+ @Definition
@Inject
InjectorHelper
for manual lookupExtend WebHarvest with your own functionality using the modern @CorePlugin architecture
Learn @CorePlugin architecture • Automatic discovery • Real examples • Best practices
From zero to scraping in minutes
Begin with the 47 built-in plugins. No setup required - just start scraping immediately.
Need databases or email? Add the modules you need with a single Maven dependency.
Create your own plugins with the @CorePlugin annotation. Integrate any API or service.
Download extension modules or build your own custom plugins
All Plugins • Apache License 2.0 • Production Ready