A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
All Classes All Packages

A

AbstractConfigSource - Class in org.webharvest.definition
Base class for other more specialized implementations of ConfigSource interface.
AbstractConfigSource() - Constructor for class org.webharvest.definition.AbstractConfigSource
 
AbstractDatabasePlugin - Class in org.webharvest.runtime.processors.plugins.db
 
AbstractDatabasePlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
 
AbstractElementDef - Class in org.webharvest.definition
 
AbstractElementDef(XmlNode) - Constructor for class org.webharvest.definition.AbstractElementDef
 
AbstractProcessor<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
Base processor that contains common processor logic.
AbstractProcessor() - Constructor for class org.webharvest.runtime.processors.AbstractProcessor
 
AbstractProcessor(ProcessorExecutionStrategy) - Constructor for class org.webharvest.runtime.processors.AbstractProcessor
Constructor with custom execution strategy (for testing).
AbstractProcessorDecorator<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
Abstract implementation of Processor interface which realizes Decorator design pattern and overrides almost all Processor's methods (without #run() method) in the default way.
AbstractProcessorDecorator(Processor<TDef>) - Constructor for class org.webharvest.runtime.processors.AbstractProcessorDecorator
 
AbstractRefreshableResolver - Class in org.webharvest.definition
Abstract ConfigurableResolver implementation serving as a base class for definition resolvers supposed to support multiple refresh operations.
AbstractRefreshableResolver(Provider<ElementsRegistry>) - Constructor for class org.webharvest.definition.AbstractRefreshableResolver
 
AbstractRegistry<K,​V> - Class in org.webharvest
Abstract class implementing Registry interface.
AbstractRegistry() - Constructor for class org.webharvest.AbstractRegistry
 
accept(ConfigLocationVisitor) - Method in interface org.webharvest.definition.ConfigLocationVisitor.VisitableLocation
adaptFilename(String) - Static method in class org.webharvest.utils.CommonUtil
 
add(IElementDef) - Method in class org.webharvest.definition.AbstractElementDef
 
addAttribute(String, String, String) - Method in class org.webharvest.definition.XmlNode
Adds new attribute with specified name and value.
addDriverResource(URI) - Method in enum org.webharvest.runtime.database.DefaultDriverManager
Adds resource containing database driver (in most cases it is a JAR file).
addDriverResource(URI) - Method in interface org.webharvest.runtime.database.DriverManager
Adds resource containing database driver (in most cases it is a JAR file).
addElement(String) - Method in class org.webharvest.definition.XmlNode
Adds new textBuff to element list
addElement(XmlNode) - Method in class org.webharvest.definition.XmlNode
Adds new subelement.
addEventHandler(EventHandler<?>) - Method in interface org.webharvest.Harvest
Register event handler.
addEventHandler(EventHandler<?>) - Method in class org.webharvest.runtime.DefaultHarvest
Register event handler.
addFunctionDef(FunctionDef) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
addFunctionDef(FunctionDef) - Method in interface org.webharvest.runtime.DynamicScopeContext
Deprecated.
addFunctionDef(FunctionDef) - Method in class org.webharvest.runtime.ScraperContext
Deprecated.
addFunctionParam(String, Variable) - Method in class org.webharvest.runtime.processors.CallProcessor
Adds parameter of function which is going to call.
addHttpHeader(String, String) - Method in class org.webharvest.runtime.processors.HttpProcessor
 
addHttpParam(String, boolean, String, String, Variable) - Method in class org.webharvest.runtime.processors.HttpProcessor
 
addListener(ScrapingAware) - Method in class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
Register ScrapingAware listener.
addPostProcessor(ResolverPostProcessor) - Method in class org.webharvest.definition.AbstractRefreshableResolver
Adds new ResolverPostProcessor which is going to be applied on the current resolver instance on the resolver refresh.
addPostProcessor(ResolverPostProcessor) - Method in interface org.webharvest.definition.ConfigurableResolver
Adds new ResolverPostProcessor which is going to be applied on the current resolver instance on the resolver refresh.
addPostProcessor(SchemaResolverPostProcessor) - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
Adds new SchemaResolverPostProcessor which is going to be applied on the current resolver instance on the resolver refresh.
addPostProcessor(SchemaResolverPostProcessor) - Method in interface org.webharvest.definition.validation.SchemaResolver
Adds new SchemaResolverPostProcessor which is going to be applied on the current resolver instance on the resolver refresh.
addVariable(Variable) - Method in class org.webharvest.runtime.variables.ListVariable
 
ALLOWED_PARAM_TYPES - Static variable in class org.webharvest.runtime.processors.XQueryProcessor
 
AlreadyBoundException - Exception in org.webharvest
Checked exception thrown if object with the given name is already bound in the registry.
AlreadyBoundException(String) - Constructor for exception org.webharvest.AlreadyBoundException
AlreadyBoundException constructor accepting explanation message.
AnnotatedPluginsPostProcessor - Class in org.webharvest.definition
ResolverPostProcessor implementation capable of scanning Java class path looking for Autoscanned web harvest plugins.
AnnotatedPluginsPostProcessor(String) - Constructor for class org.webharvest.definition.AnnotatedPluginsPostProcessor
Constructs AnnotatedPluginsPostProcessor accepting name of java package which is going to be scanned for the web harvest plugins.
AnnotatedPluginsPostProcessor(String, PluginRegistrationStrategy) - Constructor for class org.webharvest.definition.AnnotatedPluginsPostProcessor
Constructs AnnotatedPluginsPostProcessor with custom registration strategy.
AnnotatedPluginsPostProcessorBuilder - Class in org.webharvest.definition
Builder for creating AnnotatedPluginsPostProcessor instances.
annotatedWith(Class<? extends Annotation>) - Static method in class org.webharvest.ioc.TypeMatchers
 
apos - Static variable in class org.webharvest.utils.SystemUtilities
 
appendBinary(Variable) - Method in class org.webharvest.runtime.processors.FileProcessor
 
areAllTagsAllowed() - Method in class org.webharvest.definition.ElementInfo
 
Assert - Class in org.webharvest.utils
Created by IntelliJ IDEA.
Assert() - Constructor for class org.webharvest.utils.Assert
 
AttributeHolder - Interface in org.webharvest.ioc
Implementors of this interface can serve as the backing store for Objects that are scoped within an (subclass of) AttributeHolderScope.
AttributeHolderScope<AHT extends AttributeHolder> - Class in org.webharvest.ioc
A Scope that uses an AttributeHolder as the backing store for its scoped objects.
AttributeHolderScope() - Constructor for class org.webharvest.ioc.AttributeHolderScope
 
AuthSSLInitializationError - Error in org.apache.commons.httpclient.contrib.ssl
Signals fatal error in initialization of AuthSSLProtocolSocketFactory.
AuthSSLInitializationError() - Constructor for error org.apache.commons.httpclient.contrib.ssl.AuthSSLInitializationError
Creates a new AuthSSLInitializationError.
AuthSSLInitializationError(String) - Constructor for error org.apache.commons.httpclient.contrib.ssl.AuthSSLInitializationError
Creates a new AuthSSLInitializationError with the specified message.
AuthSSLProtocolSocketFactory - Class in org.apache.commons.httpclient.contrib.ssl
AuthSSLProtocolSocketFactory can be used to validate the identity of the HTTPS server against a list of trusted certificates and to authenticate to the HTTPS server using a private key.
AuthSSLProtocolSocketFactory(URL, String, URL, String) - Constructor for class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
Constructor for AuthSSLProtocolSocketFactory.
AuthSSLX509TrustManager - Class in org.apache.commons.httpclient.contrib.ssl
AuthSSLX509TrustManager can be used to extend the default X509TrustManager with additional trust decisions.
AuthSSLX509TrustManager(X509TrustManager) - Constructor for class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
Constructor for AuthSSLX509TrustManager.
Autoscanned - Annotation Type in org.webharvest.runtime.processors.plugins
Indicates that the annotated plugin should be automatically registered in system.

B

backspace - Static variable in class org.webharvest.utils.SystemUtilities
 
BaseException - Exception in org.webharvest.exception
Basic Scraper exception.
BaseException() - Constructor for exception org.webharvest.exception.BaseException
 
BaseException(String) - Constructor for exception org.webharvest.exception.BaseException
 
BaseException(String, Throwable) - Constructor for exception org.webharvest.exception.BaseException
 
BaseException(Throwable) - Constructor for exception org.webharvest.exception.BaseException
 
BaseTemplater - Class in org.webharvest.runtime.templaters
Simple templater - replaces ${expression} sequences in string with evaluated expressions.
BaseTemplater() - Constructor for class org.webharvest.runtime.templaters.BaseTemplater
 
BEANSHELL - org.webharvest.runtime.scripting.ScriptingLanguage
 
bind(K, V) - Method in class org.webharvest.AbstractRegistry
Binds value under the given name.
bind(K, V) - Method in class org.webharvest.LockedRegistry
Binds value under the given name.
bind(K, V) - Method in interface org.webharvest.Registry
Binds value under the given name.
bindConfigParser() - Method in class org.webharvest.ioc.ScraperModule
 
bindDBConnectionFactory() - Method in class org.webharvest.ioc.ScraperModule
 
bindDebugFileLogger() - Method in class org.webharvest.ioc.ScraperModule
 
bindScraperContext() - Method in class org.webharvest.ioc.ScraperModule
 
bindStatusHolder() - Method in class org.webharvest.ioc.ScraperModule
 
body() - Method in annotation type org.webharvest.annotation.Definition
 
BodyProcessor - Class in org.webharvest.runtime.processors
Processor which executes only body and returns variables list.
BodyProcessor() - Constructor for class org.webharvest.runtime.processors.BodyProcessor
 
BodyProcessor.Builder - Class in org.webharvest.runtime.processors
A builder responsible for creating instance of BodyProcessor and completing it with appropriate IElementDef.
BufferConfigSource - Class in org.webharvest.definition
Implementation of ConfigSource that uses plain old string as source of configuration/
BufferConfigSource(String) - Constructor for class org.webharvest.definition.BufferConfigSource
Class constructor expecting plain old string configuration.
BufferConfigSource(String, ConfigSource.Location) - Constructor for class org.webharvest.definition.BufferConfigSource
Class constructor expecting both the XML configuration as well as configuration location (eg. in situation when the original ConfigSource object has been modified and such a change should be reflected in new immutable state.
build() - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
Builds the AnnotatedPluginsPostProcessor instance.
build() - Method in class org.webharvest.definition.PluginDefinitionBuilder
Builds the WebHarvestPluginDef instance.
build() - Method in class org.webharvest.runtime.processors.BodyProcessor.Builder
Returns an instance of BodyProcessor which is completed with element definition.
build() - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
Builder(String) - Constructor for class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
Builder(IElementDef) - Constructor for class org.webharvest.runtime.processors.BodyProcessor.Builder
Default builder constructor which accepts IElementDef for BodyProcessor.

C

Cache<K,​V> - Interface in org.webharvest
Cache containing semi-persistent, key-value mappings.
CallDef - Class in org.webharvest.definition
Definition of function call.
CallDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.CallDef
 
CallParamDef - Class in org.webharvest.definition
Definition of function call parameter.
CallParamDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.CallParamDef
 
CallParamProcessor - Class in org.webharvest.runtime.processors
Variable definition http param processor.
CallParamProcessor() - Constructor for class org.webharvest.runtime.processors.CallParamProcessor
 
CallProcessor - Class in org.webharvest.runtime.processors
Function call processor.
CallProcessor() - Constructor for class org.webharvest.runtime.processors.CallProcessor
 
CallProcessor10 - Class in org.webharvest.deprecated.runtime.processors
Function call processor.
CallProcessor10() - Constructor for class org.webharvest.deprecated.runtime.processors.CallProcessor10
 
CaseDef - Class in org.webharvest.definition
Definition of case processor.
CaseDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.CaseDef
 
CaseProcessor - Class in org.webharvest.runtime.processors
Conditional processor.
CaseProcessor() - Constructor for class org.webharvest.runtime.processors.CaseProcessor
 
CatchProcessor - Class in org.webharvest.runtime.processors
 
CatchProcessor() - Constructor for class org.webharvest.runtime.processors.CatchProcessor
 
charset - Variable in class org.webharvest.runtime.web.HttpInfo
 
checkClientTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
 
checkClientTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
 
checkServerTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
 
checkServerTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
 
ClassLoaderUtil - Class in org.webharvest.utils
Class loading utility - used for loading JDBC driver classes and plugin classes.
ClassPathScanner - Interface in org.webharvest.utils
Component capable of scanning types available on Java class path that meet certain criteria.
ClassPathScannerImpl - Class in org.webharvest.utils
ClassPathScanner interface implementation using Reflections library under the hood.
ClassPathScannerImpl(String) - Constructor for class org.webharvest.utils.ClassPathScannerImpl
Constructor accepting name of the java package.
client - Variable in class org.webharvest.runtime.web.HttpInfo
 
close() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
CommonUtil - Class in org.webharvest.utils
Basic evaluation utilities
CommonUtil() - Constructor for class org.webharvest.utils.CommonUtil
 
CommonUtil.IntPair - Class in org.webharvest.utils
Contains pair of integer values
compareTo(Object) - Method in class org.webharvest.definition.ElementName
 
Config - Interface in org.webharvest.definition
Represents lazily loaded configuration object that is loaded from associated ConfigSource on demand with call to Config.reload() method.
Config.Version - Enum in org.webharvest.definition
 
ConfigDef - Class in org.webharvest.definition
Web Harvest's definition of the <config> element.
ConfigDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ConfigDef
Default class constructor which obtains from given XmlNode attributes such as charset and scripting language.
ConfigFactory - Interface in org.webharvest.definition
Represents simple Config factory.
ConfigLocationVisitor - Interface in org.webharvest.definition
Represents object implementing Visitor patter.
ConfigLocationVisitor.VisitableLocation - Interface in org.webharvest.definition
Extended version of ConfigSource.Location interface that closely cooperate with ConfigLocationVisitor in order to take special action on certain ConfigSource.Location instance of which we haev lost a type.
ConfigParser - Interface in org.webharvest.definition
Created by rba on 16.07.2017.
ConfigProcessor - Class in org.webharvest.runtime.processors
 
ConfigProcessor() - Constructor for class org.webharvest.runtime.processors.ConfigProcessor
 
ConfigSource - Interface in org.webharvest.definition
Represents source of XML configuration object.
ConfigSource.Location - Interface in org.webharvest.definition
Just a marker interface to indicate the actual type of location eg. file, url or any other
ConfigSourceFactory - Interface in org.webharvest.definition
Represents factory object capable to instantiate ConfigSource objects either from URL, File or just raw XML configuration.
ConfigurableResolver - Interface in org.webharvest.definition
Interface to be implemented by the web harvest configuration elements definition resolvers.
ConfigurationException - Exception in org.webharvest.exception
Configuration exception - occurred during parsing configuration file etc.
ConfigurationException() - Constructor for exception org.webharvest.exception.ConfigurationException
 
ConfigurationException(String) - Constructor for exception org.webharvest.exception.ConfigurationException
 
ConfigurationException(String, Throwable) - Constructor for exception org.webharvest.exception.ConfigurationException
 
ConfigurationException(Throwable) - Constructor for exception org.webharvest.exception.ConfigurationException
 
configure() - Method in class org.webharvest.ioc.HttpModule
 
configure() - Method in class org.webharvest.ioc.ScraperModule
ConnectionFactory - Interface in org.webharvest.runtime.database
Factory responsible for creating SQL Connections basis on the provided parameters.
ConnectionProxy - Interface in org.webharvest.runtime.database
Connection subinterface implemented by Connection proxies.
ConstantDef - Class in org.webharvest.definition
Definition of constant processor.
ConstantDef(String, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ConstantDef
 
ConstantProcessor - Class in org.webharvest.runtime.processors
Constant processor.
ConstantProcessor() - Constructor for class org.webharvest.runtime.processors.ConstantProcessor
 
CONTAINER_PREFIX - Static variable in class org.webharvest.runtime.database.JNDIConnectionFactory
Environment-related component's binding subtree used as default by JavaEE container.
contains(Object) - Method in class org.webharvest.runtime.variables.ListVariable
Checks if list contains specified object's string representation
contains(K) - Method in interface org.webharvest.Cache
Returns {@link true} if Cache contains object identified by the given key.
contains(K) - Method in class org.webharvest.ThreadLocalCache
Returns {@link true} if Cache contains object identified by the given key.
containsVar(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
containsVar(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
containsVar(String) - Method in class org.webharvest.runtime.ScraperContext
 
contentLength - Variable in class org.webharvest.runtime.web.HttpInfo
 
ContextFactory - Interface in org.webharvest.ioc
Factory creating brand new instances of DynamicScopeContext.
ContextHolder - Interface in org.webharvest.runtime
Component responsible for providing current DynamicScopeContext which is used by the Scraper.
convertIntoJndiName(String) - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
Converts provided name to conform java enterprise JNDI naming policy.
countChars(String, char, int, int) - Static method in class org.webharvest.utils.CommonUtil
Counts number of specified characters in give text.
cr - Static variable in class org.webharvest.utils.SystemUtilities
 
create() - Static method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
Creates a new builder instance.
create() - Static method in class org.webharvest.definition.PluginDefinitionBuilder
Creates a new builder instance.
create(File) - Method in interface org.webharvest.definition.ConfigSourceFactory
Create new instance of ConfigSource from File.
create(String) - Method in interface org.webharvest.definition.ConfigSourceFactory
Create new instance of ConfigSource from raw XML content.
create(URL) - Method in interface org.webharvest.definition.ConfigSourceFactory
Create new instance of ConfigSource from URL.
create(Config) - Method in interface org.webharvest.ioc.ContextFactory
Returns brand new instance of DynamicScopeContext.
create(Config) - Method in interface org.webharvest.ioc.HarvesterFactory
Produces new instance of Harvester for specified Config.
create(ConfigSource) - Method in interface org.webharvest.definition.ConfigFactory
Produces new instance of Config for specified ConfigSource resource.
create(DynamicScopeContext) - Static method in class org.webharvest.runtime.NestedContextFactory
Creates new instance of DynamicScopeContext.
createContext() - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
 
createDataSourceOfVariable(Variable, String, String) - Static method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
createElementDefinition(XmlNode) - Method in interface org.webharvest.definition.ConfigurableResolver
Creates proper element definition instance based on given xml node from input configuration.
createElementDefinition(XmlNode) - Method in class org.webharvest.definition.DefinitionResolver
Creates proper element definition instance based on given xml node from input configuration.
createListOfXmlNodes(XQueryExpression, DynamicQueryContext) - Static method in class org.webharvest.utils.XmlUtil
Creates list variable of resulting XML nodes.
createPlugin() - Method in interface org.webharvest.definition.PluginDef
 
createPlugin() - Method in class org.webharvest.definition.WebHarvestPluginDef
createPlugin(Class<? extends Processor>, IElementDef) - Static method in class org.webharvest.definition.PluginFactory
Creates a new plugin instance with proper dependency injection.
createProcessor(DynamicScopeContext, TDef) - Method in class org.webharvest.runtime.processors.DefaultProcessorFactory
createProcessor(DynamicScopeContext, TDef) - Method in interface org.webharvest.runtime.processors.ProcessorFactory
 
createSocket(String, int) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
 
createSocket(String, int) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
 
createSocket(String, int) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
 
createSocket(String, int, InetAddress, int) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
 
createSocket(String, int, InetAddress, int) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
 
createSocket(String, int, InetAddress, int) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
 
createSocket(String, int, InetAddress, int, HttpConnectionParams) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
Attempts to get a new socket connection to the given host within the given time limit.
createSocket(String, int, InetAddress, int, HttpConnectionParams) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
Attempts to get a new socket connection to the given host within the given time limit.
createSocket(String, int, InetAddress, int, HttpConnectionParams) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
Attempts to get a new socket connection to the given host within the given time limit.
createSocket(Socket, String, int, boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
 
createSocket(Socket, String, int, boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
 
createSocket(Socket, String, int, boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
 
createVariable(Object) - Static method in class org.webharvest.utils.CommonUtil
Creates appropriate AbstractVariable instance for the specified object.

D

DatabaseException - Exception in org.webharvest.exception
Database execution exception - occured during executing database processor.
DatabaseException() - Constructor for exception org.webharvest.exception.DatabaseException
 
DatabaseException(String) - Constructor for exception org.webharvest.exception.DatabaseException
 
DatabaseException(String, Throwable) - Constructor for exception org.webharvest.exception.DatabaseException
 
DatabaseException(Throwable) - Constructor for exception org.webharvest.exception.DatabaseException
 
DatabaseJNDIPlugin - Class in org.webharvest.runtime.processors.plugins.db
Web Harvest plugin supporting database operations within JNDI - enabled environment.
DatabaseJNDIPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.DatabaseJNDIPlugin
 
DatabasePlugin - Class in org.webharvest.runtime.processors.plugins.db
Support for database operations.
DatabasePlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
 
date() - Method in class org.webharvest.utils.SystemUtilities
Returns current date formatted as "yyyyMMdd".
datetime(Object) - Method in class org.webharvest.utils.SystemUtilities
Returns formatted date/time for specified format string.
DbColumnDescription - Class in org.webharvest.runtime.processors.plugins.db
Information about database record columns.
DbColumnDescription(String, int) - Constructor for class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
 
DbParamPlugin - Class in org.webharvest.runtime.processors.plugins.db
DB param plugin - can be used only inside database plugin.
DbParamPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
 
DbRowVariable - Class in org.webharvest.runtime.processors.plugins.db
Special variable used for database records
DbRowVariable(DbColumnDescription[], Object[]) - Constructor for class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
debug(IElementDef, DynamicScopeContext, Variable) - Method in class org.webharvest.runtime.processors.AbstractProcessor
 
DebugFileLogger - Annotation Type in org.webharvest.ioc
 
decoratedProcessor - Variable in class org.webharvest.runtime.processors.AbstractProcessorDecorator
 
DEFAULT_CONTENT_TYPE - Static variable in class org.webharvest.definition.HttpDef
 
DEFAULT_MAX_LOOPS - Static variable in class org.webharvest.WHConstants
 
DEFAULT_METHOD - Static variable in class org.webharvest.definition.HttpDef
 
DEFAULT_PARAM_TYPE - Static variable in class org.webharvest.runtime.processors.XQueryProcessor
 
DEFAULT_USER_AGENT - Static variable in class org.webharvest.runtime.web.HttpClientManager
 
DefaultDriverManager - Enum in org.webharvest.runtime.database
Default implementation of the DriverManger interface.
DefaultHandlerHolder - Class in org.webharvest.events
Default implementation of HandlerHolder storing references to registered EventHandler listeners.
DefaultHandlerHolder(Registry<Harvester, EventBus>, Provider<EventBus>) - Constructor for class org.webharvest.events.DefaultHandlerHolder
Default class constructor specifying the Registry storing associations between Harvester's scope and particular EventBus that is bound to it.
DefaultHarvest - Class in org.webharvest.runtime
Default implementation of Harvest interface.
DefaultHarvest(ConfigFactory, HarvesterFactory, HandlerHolder, EventSink) - Constructor for class org.webharvest.runtime.DefaultHarvest
Default class constructor specifying HarvesterFactory, HandlerHolder and EventSink that are expected to be Guice injected.
DefaultPluginRegistrationStrategy - Class in org.webharvest.definition
Default implementation of PluginRegistrationStrategy.
DefaultPluginRegistrationStrategy() - Constructor for class org.webharvest.definition.DefaultPluginRegistrationStrategy
 
DefaultProcessorExecutionStrategy - Class in org.webharvest.runtime.processors
Default implementation of processor execution strategy.
DefaultProcessorExecutionStrategy() - Constructor for class org.webharvest.runtime.processors.DefaultProcessorExecutionStrategy
 
DefaultProcessorFactory - Class in org.webharvest.runtime.processors
Created by rbala on 13.07.2017.
DefaultProcessorFactory() - Constructor for class org.webharvest.runtime.processors.DefaultProcessorFactory
 
defineFromString(String, char, int) - Method in class org.webharvest.utils.CommonUtil.IntPair
 
defineVariable(String, Object) - Method in class org.webharvest.utils.SystemUtilities
Adds or/replaces variable in scraper's context.
defineVariable(String, Object, boolean) - Method in class org.webharvest.utils.SystemUtilities
Adds or/replaces variable in scraper's context.
Definition - Annotation Type in org.webharvest.annotation
Annotation on Processor classes specifying the IElementDef this particular processor is described by.
definitionClass() - Method in annotation type org.webharvest.annotation.Definition
The definition type that the processor is mapped to.
DefinitionResolver - Class in org.webharvest.definition
Class contains information and logic to validate and crate definition classes for parsed xml nodes from Web-Harvest configurations.
DefinitionResolver() - Constructor for class org.webharvest.definition.DefinitionResolver
Constructor for non-Guice usage.
DefinitionResolver(Provider<ElementsRegistry>) - Constructor for class org.webharvest.definition.DefinitionResolver
 
DefinitionResolver(ElementsRegistry) - Constructor for class org.webharvest.definition.DefinitionResolver
Constructor for non-Guice usage with custom ElementsRegistry.
DefinitionResolverProvider - Class in org.webharvest.ioc
Created by rba on 22.07.2017.
DefinitionResolverProvider(List<? extends ResolverPostProcessor>, Provider<ElementsRegistry>) - Constructor for class org.webharvest.ioc.DefinitionResolverProvider
 
DefVarPlugin - Class in org.webharvest.runtime.processors.plugins.variable
Support for database operations.
DefVarPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.variable.DefVarPlugin
 
dependantProcessors() - Method in annotation type org.webharvest.annotation.Definition
Defines dependant subprocessors that are used inside this plugin and that will automatically be registered with this plugin.
doCall(DynamicScopeContext, Callable<Object>) - Method in class org.webharvest.deprecated.runtime.processors.CallProcessor10
 
doCall(DynamicScopeContext, Callable<Object>) - Method in class org.webharvest.runtime.processors.CallProcessor
 
doExecute(DynamicScopeContext, String, Variable) - Method in class org.webharvest.runtime.processors.plugins.variable.DefVarPlugin
 
doExecute(DynamicScopeContext, String, Variable) - Method in class org.webharvest.runtime.processors.plugins.variable.SetVarPlugin
 
DriverManager - Interface in org.webharvest.runtime.database
DriverManager allows to register database drivers placed in the arbitrary locations.
DynamicScopeContext - Interface in org.webharvest.runtime
Created by IntelliJ IDEA.

E

EasySSLProtocolSocketFactory - Class in org.apache.commons.httpclient.contrib.ssl
EasySSLProtocolSocketFactory can be used to creats SSL Sockets that accept self-signed certificates.
EasySSLProtocolSocketFactory() - Constructor for class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
Constructor for EasySSLProtocolSocketFactory.
EasyX509TrustManager - Class in org.apache.commons.httpclient.contrib.ssl
EasyX509TrustManager unlike default X509TrustManager accepts self-signed certificates.
EasyX509TrustManager(KeyStore) - Constructor for class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
Constructor for EasyX509TrustManager.
elementDef - Variable in class org.webharvest.runtime.processors.AbstractProcessor
 
ElementInfo - Class in org.webharvest.definition
 
ElementInfo(String, Class<? extends IElementDef>, Class<? extends Processor>, String, String, boolean, Class[]) - Constructor for class org.webharvest.definition.ElementInfo
 
ElementInfoFactory - Class in org.webharvest.annotation
Deprecated.
ElementInfoPluginRegistrationStrategy - Class in org.webharvest.definition
ElementInfo-based implementation of PluginRegistrationStrategy.
ElementInfoPluginRegistrationStrategy() - Constructor for class org.webharvest.definition.ElementInfoPluginRegistrationStrategy
 
ElementName - Class in org.webharvest.definition
Name of the single element (tag in configuraton xml).
ElementName(String, String) - Constructor for class org.webharvest.definition.ElementName
 
ElementsRegistry - Interface in org.webharvest.definition
Registry containing all valid Web Harvest definition elements.
ElementsRegistryImpl - Class in org.webharvest.definition
Default implementation of ElementsRegistry interface.
ElementsRegistryImpl() - Constructor for class org.webharvest.definition.ElementsRegistryImpl
 
ElseProcessor - Class in org.webharvest.runtime.processors
 
ElseProcessor() - Constructor for class org.webharvest.runtime.processors.ElseProcessor
 
EMPTY - Static variable in class org.webharvest.utils.Stack
 
EmptyDef - Class in org.webharvest.definition
Definition of empty element.
EmptyDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.EmptyDef
 
EmptyProcessor - Class in org.webharvest.runtime.processors
Empty processor = executes body and returns empty variable.
EmptyProcessor() - Constructor for class org.webharvest.runtime.processors.EmptyProcessor
 
EmptyVariable - Class in org.webharvest.runtime.variables
Empty variable.
encodeUrl(String, String) - Static method in class org.webharvest.utils.CommonUtil
 
enter(AHT) - Method in class org.webharvest.ioc.AttributeHolderScope
Lets the current Thread enter this Scope.
EnvironmentException - Exception in org.webharvest.exception
Thrown when current JVM does not support a specific required feature.
EnvironmentException(Exception) - Constructor for exception org.webharvest.exception.EnvironmentException
 
equals(Object) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
 
equals(Object) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
 
equals(Object) - Method in class org.webharvest.definition.ElementName
 
equals(Object) - Method in class org.webharvest.definition.validation.SchemaSource
Equals system identifiers of SchemaSource specified as parameter and this SchemaSource.
equals(Object) - Method in class org.webharvest.runtime.scripting.ScriptSource
 
ErrMsg - Class in org.webharvest.exception
Error messages utility
ErrMsg() - Constructor for class org.webharvest.exception.ErrMsg
 
error(String) - Method in class org.webharvest.utils.SystemUtilities
 
ERROR - org.webharvest.runtime.ScraperState
Scraper has finished its work with error.
escapeXml(Object) - Method in class org.webharvest.utils.SystemUtilities
Escapes XML string - special characters: &'"<> are replaced with XML escape sequences: & ' " < >
escapeXml(String) - Static method in class org.webharvest.utils.CommonUtil
Escapes XML string - special characters: &'"<> are replaced with XML escape sequences: & ' " < >
evaluate(DynamicScopeContext, ScriptSource) - Method in class org.webharvest.runtime.scripting.jsr.JSRScriptEngineAdapter
Executes script provided in ScriptSource.
evaluate(DynamicScopeContext, ScriptSource) - Method in interface org.webharvest.runtime.scripting.ScriptEngine
Executes script provided in ScriptSource.
evaluateAttribute(String, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
 
evaluateAttributeAsBoolean(String, boolean, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
 
evaluateAttributeAsDouble(String, double, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
 
evaluateAttributeAsInteger(String, int, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
 
evaluateToString(String, ScriptingLanguage, DynamicScopeContext) - Static method in class org.webharvest.runtime.templaters.BaseTemplater
 
evaluateToVariable(String, ScriptingLanguage, DynamicScopeContext) - Static method in class org.webharvest.runtime.templaters.BaseTemplater
 
evaluateXPath(String, String, RuntimeConfig) - Static method in class org.webharvest.utils.XmlUtil
Evaluates specified XPath expression against given XML text and using given runtime configuration.
evaluateXPath(String, Document) - Static method in class org.webharvest.utils.XmlUtil
 
EventBasedStatusHolder - Class in org.webharvest.runtime
StatusHolder implementation which bases on Scraper's events.
EventBasedStatusHolder(Monitor) - Constructor for class org.webharvest.runtime.EventBasedStatusHolder
Default class constructor which creates Monitor.Guard instances using given Monitor.
EventBusTypeListener - Class in org.webharvest.ioc
Guice TypeListener implementation that is responsible for registration of objects managed by Guice in singleton EventBus.
EventBusTypeListener() - Constructor for class org.webharvest.ioc.EventBusTypeListener
 
EventHandler<T> - Interface in org.webharvest.events
Represents an object that is handler for particular type of event.
EventSink - Interface in org.webharvest.events
Dispatches events to registered listeners.
execute(String, Boolean, String, String, String, String, String, Variable, Map<String, HttpParamInfo>, Map, int, long, double) - Method in class org.webharvest.runtime.web.HttpClientManager
 
execute(Harvester.ContextInitCallback) - Method in interface org.webharvest.Harvester
Execute scraping session in its own scope.
execute(Harvester.ContextInitCallback) - Method in class org.webharvest.runtime.ScrapingHarvester
Execute scraping session in its own scope.
execute(DynamicScopeContext) - Method in class org.webharvest.deprecated.runtime.processors.VarDefProcessor
Deprecated.
 
execute(DynamicScopeContext) - Method in class org.webharvest.deprecated.runtime.processors.VarProcessor
Deprecated.
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.AbstractProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.BodyProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CallParamProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CallProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CaseProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ConfigProcessor
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ConstantProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.EmptyProcessor
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ExitProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.FileProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.FunctionProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HtmlToXmlProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HttpHeaderProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HttpParamProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HttpProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.IncludeProcessor
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.LoopProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ReturnProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ScriptProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.TemplateProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.TextProcessor
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.TryProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WhileProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XPathProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XQueryProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XsltProcessor
 
execute(DynamicScopeContext) - Method in class org.webharvest.runtime.Scraper
 
execute(DynamicScopeContext) - Method in interface org.webharvest.runtime.WebScraper
 
execute(Processor<?>, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.DefaultProcessorExecutionStrategy
 
execute(Processor<?>, DynamicScopeContext) - Method in interface org.webharvest.runtime.processors.ProcessorExecutionStrategy
Executes the processor with the given context.
executeBody(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
Executes body of plugin processor
executeFunctionCall(Callable<R>) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CatchProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ElseProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.IfProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ListProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.JsonToXmlPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.SleepPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ValueOfPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.variable.GetVarPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserJavascriptPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserLoadPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserRenderPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.XmlToJsonPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RealBodyProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpPatternProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpResultProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpSourceProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.StylesheetProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
Deprecated.
Use execute(...) instead
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XMLProcessor
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XQExpression
Mathod that actually executes processor.
executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XQParamProcessor
Mathod that actually executes processor.
executeWithinNewContext(Callable<R>) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
executeWithinNewContext(Callable<R>) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
executeWithinNewContext(Callable<R>) - Method in class org.webharvest.runtime.ScraperContext
 
EXECUTION_TIME_PROPERTY_NAME - Static variable in class org.webharvest.WHConstants
 
existsInStringArray(String[], String, boolean) - Static method in class org.webharvest.utils.CommonUtil
Checks if specified string exists in given array
exit() - Method in class org.webharvest.ioc.AttributeHolderScope
Lets the current Thread leave this Scope.
exit(ScraperExecutionExitEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
Changes status to ScraperState.EXIT.
EXIT - org.webharvest.runtime.ScraperState
Scraper's execution has been stopped by exit command.
ExitDef - Class in org.webharvest.definition
Definition of exit processor.
ExitDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ExitDef
 
ExitProcessor - Class in org.webharvest.runtime.processors
Exit processor.
ExitProcessor() - Constructor for class org.webharvest.runtime.processors.ExitProcessor
 

F

FileConfigSource - Class in org.webharvest.definition
Implementation of ConfigSource that uses a file system as source of XML configurations.
FileConfigSource(File) - Constructor for class org.webharvest.definition.FileConfigSource
Class constructor expecting File as configuration source.
FileDef - Class in org.webharvest.definition
Definition of file proessor.
FileDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.FileDef
 
FileException - Exception in org.webharvest.exception
File management exception.
FileException() - Constructor for exception org.webharvest.exception.FileException
 
FileException(String) - Constructor for exception org.webharvest.exception.FileException
 
FileException(String, Throwable) - Constructor for exception org.webharvest.exception.FileException
 
FileException(Throwable) - Constructor for exception org.webharvest.exception.FileException
 
FileListIterator - Class in org.webharvest.utils
 
FileListIterator(File, boolean) - Constructor for class org.webharvest.utils.FileListIterator
 
FileProcessor - Class in org.webharvest.runtime.processors
File processor.
FileProcessor() - Constructor for class org.webharvest.runtime.processors.FileProcessor
 
findParentPlugin(AbstractProcessor) - Static method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
 
findValueInCollection(Collection, Object) - Static method in class org.webharvest.utils.CommonUtil
Searches specified value in given collection
FINISHED - org.webharvest.runtime.ScraperState
Scraper has successfully finished its work.
FtpDelPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
Ftp Del plugin - can be used only inside ftp plugin for deleting file on remote directory.
FtpDelPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
 
FtpGetPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
Ftp Get plugin - can be used only inside ftp plugin for retrieving file from remote directory.
FtpGetPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
 
FtpListPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
Ftp List plugin - can be used only inside ftp plugin for listing file in working remote directory.
FtpListPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
FtpMkdirPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
Ftp Mkdir plugin - can be used only inside ftp plugin for creating directory on remote directory.
FtpMkdirPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
 
FtpPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
FTP processor
FtpPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
FtpPluginException - Exception in org.webharvest.runtime.processors.plugins.ftp
Runtime exception for FtpPlugin
FtpPluginException() - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
 
FtpPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
 
FtpPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
 
FtpPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
 
FtpPutPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
Ftp Put plugin - can be used only inside ftp plugin for storing file to remote directory.
FtpPutPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
 
FtpRmdirPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
Ftp Mkdir plugin - can be used only inside ftp plugin for removing subdirectory on remote directory.
FtpRmdirPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
 
fullUrl(Object, Object) - Method in class org.webharvest.utils.SystemUtilities
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.
fullUrl(String, String) - Static method in class org.webharvest.utils.CommonUtil
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.
FunctionDef - Class in org.webharvest.definition
Definition of user-defined function.
FunctionDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.FunctionDef
 
FunctionException - Exception in org.webharvest.exception
Function processor exception.
FunctionException() - Constructor for exception org.webharvest.exception.FunctionException
 
FunctionException(String) - Constructor for exception org.webharvest.exception.FunctionException
 
FunctionException(String, Throwable) - Constructor for exception org.webharvest.exception.FunctionException
 
FunctionException(Throwable) - Constructor for exception org.webharvest.exception.FunctionException
 
FunctionProcessor - Class in org.webharvest.runtime.processors
Function definition processor.
FunctionProcessor() - Constructor for class org.webharvest.runtime.processors.FunctionProcessor
 

G

generateProcessorReference(String) - Method in class org.webharvest.docs.ProcessorReferenceGenerator
 
get() - Method in class org.webharvest.ioc.AttributeHolderScope
get() - Method in class org.webharvest.ioc.DefinitionResolverProvider
get(int) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
get(int) - Method in class org.webharvest.runtime.variables.ListVariable
 
get(Object) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
get(String) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
getAbsoluteFilename(String, String) - Static method in class org.webharvest.utils.CommonUtil
For the given working path and file path returns absolute file path.
getAcceptedIssuers() - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
 
getAcceptedIssuers() - Method in class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
 
getAction() - Method in class org.webharvest.definition.FileDef
 
getAdvancedXmlEscape() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getAllowHtmlInsideAttributes() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getAllowMultiWordAttributes() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getAttribute(Object) - Method in interface org.webharvest.ioc.AttributeHolder
Extracts the Object memorized for the specified key from this AttributeHolder.
getAttribute(Object) - Method in class org.webharvest.ioc.ScopeAttributeHolder
Extracts the Object memorized for the specified key from this AttributeHolder.
getAttribute(String) - Method in class org.webharvest.definition.XmlNode
 
getAttribute(String, String) - Method in class org.webharvest.definition.XmlNode
 
getAttributeLock() - Method in interface org.webharvest.ioc.AttributeHolder
Returns an object on which to lock when access to multiple methods of the AttributeHolder are to be made atomic.
getAttributeLock() - Method in class org.webharvest.ioc.ScopeAttributeHolder
Returns an object on which to lock when access to multiple methods of the AttributeHolder are to be made atomic.
getAttributes() - Method in class org.webharvest.definition.WebHarvestPluginDef
 
getAttributes() - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
Deprecated.
getAttributes(String) - Method in class org.webharvest.definition.WebHarvestPluginDef
 
getAttributes(String) - Method in class org.webharvest.definition.XmlNode
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
 
getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
 
getAttsSet() - Method in class org.webharvest.definition.ElementInfo
 
getAttValues(String) - Method in class org.webharvest.definition.ElementInfo
 
getBody() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
Deprecated.
getBodyAsInputStream() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getBodyTextContent(IElementDef, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.AbstractProcessor
 
getBodyTextContent(IElementDef, DynamicScopeContext, boolean) - Method in class org.webharvest.runtime.processors.AbstractProcessor
 
getBodyTextContent(IElementDef, DynamicScopeContext, boolean, KeyValuePair[]) - Method in class org.webharvest.runtime.processors.AbstractProcessor
 
getBooleanAtts() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getBooleanValue(String, Boolean) - Static method in class org.webharvest.utils.CommonUtil
Reads boolean value from string
getCatchValueDef() - Method in class org.webharvest.definition.TryDef
 
getCharset() - Method in class org.webharvest.definition.ConfigDef
Returns default charset for current configuration.
getCharset() - Method in class org.webharvest.definition.FileDef
 
getCharset() - Method in class org.webharvest.definition.HttpDef
 
getCharset() - Method in class org.webharvest.definition.TextDef
 
getCharset() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
Returns default configuration's charset.
getCharset() - Method in interface org.webharvest.runtime.DynamicScopeContext
Returns default configuration's charset.
getCharset() - Method in class org.webharvest.runtime.ScraperContext
Returns default configuration's charset.
getCharset() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getColumnCount() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
getColumnName(int) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
getColumnNumber() - Method in class org.webharvest.definition.AbstractElementDef
getColumnNumber() - Method in interface org.webharvest.definition.IElementDef
 
getColumnNumber() - Method in class org.webharvest.definition.XmlNode
 
getColumnNumber() - Method in class org.webharvest.utils.XmlValidator
 
getCompiledExpression(String) - Method in class org.webharvest.runtime.processors.XQueryExpressionPool
 
getCondition() - Method in class org.webharvest.definition.ExitDef
 
getCondition() - Method in class org.webharvest.definition.IfDef
 
getCondition() - Method in class org.webharvest.definition.WhileDef
 
getConfig() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
getConfig() - Method in interface org.webharvest.runtime.DynamicScopeContext
 
getConfig() - Method in class org.webharvest.runtime.ScraperContext
getConfigSource() - Method in interface org.webharvest.definition.Config
Gets reference to configuration resource ConfigSource.
getConfigSource() - Method in class org.webharvest.definition.IncludeVisitor
 
getConfigSource() - Method in class org.webharvest.definition.XMLConfig
Gets reference to configuration resource ConfigSource.
getConnection(String) - Method in interface org.webharvest.runtime.database.ConnectionFactory
Returns SQL Connection instance available under the given name (for example under JNDI name within Java EE container).
getConnection(String) - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
Returns SQL Connection instance available under the given name (for example under JNDI name within Java EE container).
getConnection(String) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
Returns SQL Connection instance available under the given name (for example under JNDI name within Java EE container).
getConnection(String, String, String, String) - Method in interface org.webharvest.runtime.database.ConnectionFactory
Returns SQL Connection instance configured according to provided parameters.
getConnection(String, String, String, String) - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
Returns SQL Connection instance configured according to provided parameters.
getConnection(String, String, String, String) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
Returns SQL Connection instance configured according to provided parameters.
getContentLength() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getContenttype() - Method in class org.webharvest.definition.HttpParamDef
 
getContentType() - Method in class org.webharvest.definition.HttpDef
 
getContentType() - Method in class org.webharvest.runtime.web.HttpParamInfo
 
getContext() - Method in interface org.webharvest.runtime.ContextHolder
Returns current DynamicScopeContext which is used by the Scraper.
getContext() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
 
getCookiePolicy() - Method in class org.webharvest.definition.HttpDef
 
getDefinitionClass() - Method in class org.webharvest.definition.ElementInfo
 
getDelimiter() - Method in class org.webharvest.definition.TextDef
 
getDependantProcessors() - Method in class org.webharvest.definition.ElementInfo
 
getDependantProcessors() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
getDependantProcessors() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getDependantProcessors() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
getDirectoryFromPath(String) - Static method in class org.webharvest.utils.CommonUtil
Extracts a filename and directory from an absolute path.
getDoubleValue(String, double) - Static method in class org.webharvest.utils.CommonUtil
Reads double value from string
getElementDef() - Method in interface org.webharvest.definition.Config
Gets reference to configuration root element.
getElementDef() - Method in class org.webharvest.definition.XMLConfig
Gets reference to configuration root element.
getElementDef() - Method in class org.webharvest.runtime.processors.AbstractProcessor
getElementDef() - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
getElementDef() - Method in interface org.webharvest.runtime.processors.Processor
 
getElementDefs() - Method in class org.webharvest.definition.AbstractElementDef
getElementDefs() - Method in interface org.webharvest.definition.IElementDef
 
getElementInfo(Class<? extends WebHarvestPlugin>) - Static method in class org.webharvest.annotation.ElementInfoFactory
Deprecated.
Creates new ElementInfo object from processor class that is about to be registered.
getElementInfo(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
Deprecated.
getElementInfo(String, String) - Method in class org.webharvest.definition.DefinitionResolver
 
getElementList() - Method in class org.webharvest.definition.XmlNode
 
getElementNames() - Method in interface org.webharvest.definition.ConfigurableResolver
Deprecated.
getElementNames() - Method in class org.webharvest.definition.DefinitionResolver
Returns names of all known elements.
getElementsRegistry() - Method in class org.webharvest.definition.AbstractRefreshableResolver
Returns current instance of the ElementsRegistry.
getElseDef() - Method in class org.webharvest.definition.CaseDef
 
getEmail() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getEmpty() - Method in class org.webharvest.definition.LoopDef
 
getEmpty() - Method in class org.webharvest.definition.WhileDef
 
getEngine(ScriptSource) - Method in class org.webharvest.runtime.scripting.jsr.JSRScriptEngineFactory
Returns ScriptEngine capable of executing provided ScriptSource.
getEngine(ScriptSource) - Method in interface org.webharvest.runtime.scripting.ScriptEngineFactory
Returns ScriptEngine capable of executing provided ScriptSource.
getException() - Method in class org.webharvest.events.ScraperExecutionErrorEvent
Return reference to the exception occurred.
getException() - Method in class org.webharvest.utils.XmlValidator
 
getExecutionTime() - Method in class org.webharvest.events.ScraperExecutionEndEvent
Returns how long the execution took in milliseconds.
getExecutionTime() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
 
getExpression() - Method in class org.webharvest.definition.XPathDef
 
getExternalParamDefs() - Method in class org.webharvest.definition.XQueryDef
 
getFileFromPath(String) - Static method in class org.webharvest.utils.CommonUtil
Extracts a filename from an absolute path.
getFilename() - Method in class org.webharvest.definition.HttpParamDef
 
getFilename(String) - Method in class org.webharvest.utils.SystemUtilities
 
getFileName() - Method in class org.webharvest.runtime.web.HttpParamInfo
 
getFilter() - Method in class org.webharvest.definition.LoopDef
 
getFirstSubnode(ElementName) - Method in class org.webharvest.definition.XmlNode
 
getFlagCanoneq() - Method in class org.webharvest.definition.RegexpDef
 
getFlagCaseInsensitive() - Method in class org.webharvest.definition.RegexpDef
 
getFlagDotall() - Method in class org.webharvest.definition.RegexpDef
 
getFlagMultiline() - Method in class org.webharvest.definition.RegexpDef
 
getFlagUnicodecase() - Method in class org.webharvest.definition.RegexpDef
 
getFollowRedirects() - Method in class org.webharvest.definition.HttpDef
 
getFtpClient() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
getFunctionDef(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
getFunctionDef(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
Deprecated.
getFunctionDef(String) - Method in class org.webharvest.runtime.ScraperContext
Deprecated.
getHarvester() - Method in interface org.webharvest.events.HarvesterEvent
Gets reference of event's source/destination.
getHarvester() - Method in class org.webharvest.events.ScraperExecutionContinuedEvent
 
getHarvester() - Method in class org.webharvest.events.ScraperExecutionPausedEvent
 
getHarvester() - Method in class org.webharvest.events.ScraperExecutionStoppedEvent
Gets reference of event's source/destination.
getHarvester(ConfigSource, HarvestLoadCallback) - Method in interface org.webharvest.Harvest
Creates new Harvester object with configuration with configuration represented by ConfigSource object.
getHarvester(ConfigSource, HarvestLoadCallback) - Method in class org.webharvest.runtime.DefaultHarvest
Creates new Harvester object with configuration with configuration represented by ConfigSource object.
getHeader(String) - Method in class org.webharvest.runtime.web.HttpInfo
 
getHeaders() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getHeaders(String) - Method in class org.webharvest.runtime.web.HttpInfo
 
getHostnameVerification() - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
Gets the status of the host name verification flag.
getHttpClient() - Method in class org.webharvest.runtime.web.HttpClientManager
 
getHttpInfo() - Method in class org.webharvest.runtime.web.HttpClientManager
 
getHyphenReplacement() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getId() - Method in class org.webharvest.definition.AbstractElementDef
 
getId() - Method in interface org.webharvest.definition.IElementDef
 
getIdentifier() - Method in class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
 
getIfDefs() - Method in class org.webharvest.definition.CaseDef
 
getIgnoreResponseBody() - Method in class org.webharvest.definition.HttpDef
 
getIndex() - Method in class org.webharvest.definition.LoopDef
 
getIndex() - Method in class org.webharvest.definition.WhileDef
 
getInjector() - Static method in class org.webharvest.ioc.InjectorHelper
Gets reference to the current Injector.
getIntValue(String, int) - Static method in class org.webharvest.utils.CommonUtil
Reads integer value from string
getIsfile() - Method in class org.webharvest.definition.HttpParamDef
 
getItem() - Method in class org.webharvest.definition.LoopDef
 
getKey() - Method in class org.webharvest.utils.KeyValuePair
 
getLanguage() - Method in class org.webharvest.definition.ScriptDef
 
getLanguage() - Method in class org.webharvest.definition.TemplateDef
 
getLanguage() - Method in class org.webharvest.runtime.scripting.ScriptSource
 
getLineNumber() - Method in class org.webharvest.definition.AbstractElementDef
getLineNumber() - Method in interface org.webharvest.definition.IElementDef
 
getLineNumber() - Method in class org.webharvest.definition.XmlNode
 
getLineNumber() - Method in class org.webharvest.utils.XmlValidator
 
getList() - Method in class org.webharvest.runtime.variables.ListVariable
 
getList() - Method in class org.webharvest.utils.Stack
 
getListDirs() - Method in class org.webharvest.definition.FileDef
 
getListFiles() - Method in class org.webharvest.definition.FileDef
 
getListFilter() - Method in class org.webharvest.definition.FileDef
 
getListRecursive() - Method in class org.webharvest.definition.FileDef
 
getLocation() - Method in class org.webharvest.definition.BufferConfigSource
Gets the reference to physical location of configuration.
getLocation() - Method in interface org.webharvest.definition.ConfigSource
Gets the reference to physical location of configuration.
getLocation() - Method in class org.webharvest.definition.FileConfigSource
Gets the reference to physical location of configuration.
getLocation() - Method in class org.webharvest.definition.URLConfigSource
Gets the reference to physical location of configuration.
getLoopBodyDef() - Method in class org.webharvest.definition.LoopDef
 
getLoopValueDef() - Method in class org.webharvest.definition.LoopDef
 
getMax() - Method in class org.webharvest.definition.RegexpDef
 
getMaxloops() - Method in class org.webharvest.definition.LoopDef
 
getMaxLoops() - Method in class org.webharvest.definition.WhileDef
 
getMessage() - Method in class org.webharvest.definition.ExitDef
 
getMessage() - Method in class org.webharvest.events.ScraperExecutionExitEvent
Returns cause of execution's exit.
getMethod() - Method in class org.webharvest.definition.HttpDef
 
getMimeType() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getName() - Method in class org.webharvest.definition.CallDef
 
getName() - Method in class org.webharvest.definition.CallParamDef
 
getName() - Method in class org.webharvest.definition.ElementInfo
 
getName() - Method in class org.webharvest.definition.ElementName
 
getName() - Method in class org.webharvest.definition.FunctionDef
 
getName() - Method in class org.webharvest.definition.HttpHeaderDef
 
getName() - Method in class org.webharvest.definition.HttpParamDef
 
getName() - Method in class org.webharvest.definition.VarDef
 
getName() - Method in class org.webharvest.definition.VarDefDef
 
getName() - Method in class org.webharvest.definition.XmlAttribute
 
getName() - Method in class org.webharvest.definition.XmlNode
 
getName() - Method in class org.webharvest.definition.XQueryExternalParamDef
 
getName() - Method in class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
 
getName() - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.XmlToJsonPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
 
getName() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
getName() - Method in class org.webharvest.runtime.web.HttpParamInfo
 
getNamespacesAware() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getNamespaceURI() - Method in interface org.webharvest.definition.IElementDef
Deprecated. 
getNamespaceURI() - Method in class org.webharvest.definition.WebHarvestPluginDef
 
getNextAttachmentName() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getNode() - Method in interface org.webharvest.definition.IElementDef
Deprecated.
getNode() - Method in class org.webharvest.definition.WebHarvestPluginDef
Deprecated.
getNsAttsSet() - Method in class org.webharvest.definition.ElementInfo
 
getOmitComments() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getOmitDeprecatedTags() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getOmitHtmlEnvelope() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getOmitUnknownTags() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getOmitXmlDecl() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getOperationDefs() - Method in class org.webharvest.definition.AbstractElementDef
Deprecated.
getOperationDefs() - Method in class org.webharvest.definition.CaseDef
 
getOperationDefs() - Method in interface org.webharvest.definition.IElementDef
Deprecated.
getOperationDefs() - Method in class org.webharvest.definition.LoopDef
 
getOperationDefs() - Method in class org.webharvest.definition.RegexpDef
 
getOperationDefs() - Method in class org.webharvest.definition.TryDef
 
getOperationDefs() - Method in class org.webharvest.definition.XQueryDef
 
getOperationDefs() - Method in class org.webharvest.definition.XsltDef
 
getOutputType() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getOverwrite() - Method in class org.webharvest.definition.VarDefDef
 
getParentProcessor() - Method in class org.webharvest.runtime.processors.AbstractProcessor
Returns reference to parent Processor of this processor.
getParentProcessor() - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
Returns reference to parent Processor of this processor.
getParentProcessor() - Method in interface org.webharvest.runtime.processors.Processor
Returns reference to parent Processor of this processor.
getPassword() - Method in class org.webharvest.definition.HttpDef
 
getPath() - Method in class org.webharvest.definition.FileDef
 
getPath() - Method in class org.webharvest.definition.IncludeDef
 
getPhantomTemplateAsString() - Static method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
 
getPluginClass(String) - Static method in class org.webharvest.utils.ClassLoaderUtil
 
getProcessor() - Method in class org.webharvest.events.ProcessorStartEvent
Returns reference to Processor which has been started.
getProcessor() - Method in class org.webharvest.events.ProcessorStopEvent
Returns reference to Processor which has been stopped.
getProcessor() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
 
getProcessorClass() - Method in class org.webharvest.definition.ElementInfo
 
getProcessorId() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
 
getProperties() - Method in class org.webharvest.events.ProcessorStopEvent
Returns properties of the Processor which has been stopped.
getProperties() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
 
getPrunetags() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getQName() - Method in class org.webharvest.definition.XmlNode
 
getReader() - Method in class org.webharvest.definition.BufferConfigSource
Gets the Reader for character stream.
getReader() - Method in class org.webharvest.definition.FileConfigSource
Gets the Reader for character stream.
getReader() - Method in class org.webharvest.definition.URLConfigSource
Gets the Reader for character stream.
getReader() - Method in interface org.webharvest.utils.HasReader
Gets the Reader for character stream.
getRecognizeUnicodeChars() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getRegexpPatternDef() - Method in class org.webharvest.definition.RegexpDef
 
getRegexpResultDef() - Method in class org.webharvest.definition.RegexpDef
 
getRegexpSourceDef() - Method in class org.webharvest.definition.RegexpDef
 
getReplace() - Method in class org.webharvest.definition.RegexpDef
 
getReplaceNbspWithSp() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
 
getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
getRequiredAttsSet() - Method in class org.webharvest.definition.ElementInfo
 
getRequiredSubprocessors() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getRequiredTagsSet() - Method in class org.webharvest.definition.ElementInfo
 
getRetryAttempts() - Method in class org.webharvest.definition.HttpDef
 
getRetryDelay() - Method in class org.webharvest.definition.HttpDef
 
getRetryDelayFactor() - Method in class org.webharvest.definition.HttpDef
 
getReturnExpression() - Method in class org.webharvest.definition.ScriptDef
For backward compatibility with ver.2b1 only!
getRunningLevel() - Method in class org.webharvest.runtime.processors.AbstractProcessor
Returns information how deep in processors hierarchy is this Processor.
getRunningLevel() - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
Returns information how deep in processors hierarchy is this Processor.
getRunningLevel() - Method in interface org.webharvest.runtime.processors.Processor
Returns information how deep in processors hierarchy is this Processor.
getSAXParserFactory(boolean, boolean) - Static method in class org.webharvest.utils.XmlUtil
 
getSchema() - Method in interface org.webharvest.definition.validation.SchemaFactory
Returns an instance of Schema which should be used to validate a XML file.
getSchema() - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
Returns an instance of Schema which should be used to validate a XML file.
getSchemaFactory() - Static method in class org.webharvest.definition.validation.SchemaComponentFactory
Returns an instance of SchemaFactory.
getSchemaResolver() - Static method in class org.webharvest.definition.validation.SchemaComponentFactory
Returns an instance of SchemaResolver.
getScraper() - Method in class org.webharvest.events.ScraperExecutionEndEvent
Returns instance of WebScraper which execution has been completed.
getScraper() - Method in class org.webharvest.events.ScraperExecutionStartEvent
 
getScriptingLanguage() - Method in class org.webharvest.definition.ConfigDef
Returns default ScriptingLanguage for current configuration.
getScriptingLanguage() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
getScriptingLanguage() - Method in interface org.webharvest.runtime.DynamicScopeContext
Deprecated.
getScriptingLanguage() - Method in class org.webharvest.runtime.ScraperContext
Deprecated.
getShortElementName() - Method in class org.webharvest.definition.AbstractElementDef
getShortElementName() - Method in class org.webharvest.definition.CallDef
 
getShortElementName() - Method in class org.webharvest.definition.CallParamDef
 
getShortElementName() - Method in class org.webharvest.definition.CaseDef
 
getShortElementName() - Method in class org.webharvest.definition.ConstantDef
 
getShortElementName() - Method in class org.webharvest.definition.EmptyDef
 
getShortElementName() - Method in class org.webharvest.definition.ExitDef
 
getShortElementName() - Method in class org.webharvest.definition.FileDef
 
getShortElementName() - Method in class org.webharvest.definition.FunctionDef
 
getShortElementName() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getShortElementName() - Method in class org.webharvest.definition.HttpDef
 
getShortElementName() - Method in class org.webharvest.definition.HttpHeaderDef
 
getShortElementName() - Method in class org.webharvest.definition.HttpParamDef
 
getShortElementName() - Method in interface org.webharvest.definition.IElementDef
 
getShortElementName() - Method in class org.webharvest.definition.IfDef
 
getShortElementName() - Method in class org.webharvest.definition.IncludeDef
 
getShortElementName() - Method in class org.webharvest.definition.LoopDef
 
getShortElementName() - Method in class org.webharvest.definition.RegexpDef
 
getShortElementName() - Method in class org.webharvest.definition.ReturnDef
 
getShortElementName() - Method in class org.webharvest.definition.ScriptDef
 
getShortElementName() - Method in class org.webharvest.definition.TemplateDef
 
getShortElementName() - Method in class org.webharvest.definition.TextDef
 
getShortElementName() - Method in class org.webharvest.definition.TryDef
 
getShortElementName() - Method in class org.webharvest.definition.VarDef
 
getShortElementName() - Method in class org.webharvest.definition.VarDefDef
 
getShortElementName() - Method in class org.webharvest.definition.WhileDef
 
getShortElementName() - Method in class org.webharvest.definition.XPathDef
 
getShortElementName() - Method in class org.webharvest.definition.XQueryDef
 
getShortElementName() - Method in class org.webharvest.definition.XQueryExternalParamDef
 
getShortElementName() - Method in class org.webharvest.definition.XsltDef
 
getSource() - Method in class org.webharvest.definition.validation.SchemaSource
Returns an instance of created XML schema's Source.
getSourceCode() - Method in class org.webharvest.runtime.scripting.ScriptSource
 
getStartTime() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
 
getStaticQueryContext() - Method in class org.webharvest.runtime.RuntimeConfig
 
getStatus() - Method in class org.webharvest.runtime.EventBasedStatusHolder
Returns current status of being processed configuration.
getStatus() - Method in interface org.webharvest.runtime.StatusHolder
Returns current status of being processed configuration.
getStatusCode() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getStatusText() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
getStylesheetDef() - Method in class org.webharvest.definition.XsltDef
 
getSubnodes(ElementName) - Method in class org.webharvest.definition.XmlNode
 
getTagsSet() - Method in class org.webharvest.definition.ElementInfo
 
getTargetConnection() - Method in interface org.webharvest.runtime.database.ConnectionProxy
Returns proxied Connection instance.
getTemplate(boolean) - Method in class org.webharvest.definition.ElementInfo
 
getText() - Method in class org.webharvest.definition.ConstantDef
 
getText() - Method in class org.webharvest.definition.XmlNode
 
getTranslateSpecialEntities() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getTreatDeprecatedTagsAsContent() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getTreatUnknownTagsAsContent() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getTryBodyDef() - Method in class org.webharvest.definition.TryDef
 
getType() - Method in class org.webharvest.definition.FileDef
 
getType() - Method in class org.webharvest.definition.XQueryExternalParamDef
 
getType() - Method in class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
 
getTypesAnnotatedWith(Class<? extends Annotation>) - Method in interface org.webharvest.utils.ClassPathScanner
Returns Set of all types annotated with the given annotation.
getTypesAnnotatedWith(Class<? extends Annotation>) - Method in class org.webharvest.utils.ClassPathScannerImpl
Returns Set of all types annotated with the given annotation.
getUri() - Method in class org.webharvest.definition.ElementName
 
getUri() - Method in class org.webharvest.definition.WebHarvestPluginDef
 
getUri() - Method in class org.webharvest.definition.XmlAttribute
 
getUri() - Method in class org.webharvest.definition.XmlNode
 
getUrl() - Method in class org.webharvest.definition.HttpDef
 
getUseCdataForScriptAndStyle() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getUseEmptyElementTags() - Method in class org.webharvest.definition.HtmlToXmlDef
 
getUsername() - Method in class org.webharvest.definition.HttpDef
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
 
getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
getValidIdentifier(String) - Static method in class org.webharvest.utils.CommonUtil
For the given string creates valid identifier name.
getValidSubprocessors() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
getValue() - Method in class org.webharvest.definition.XmlAttribute
 
getValue() - Method in class org.webharvest.runtime.variables.VariableName
Gets Variable value.
getValue() - Method in class org.webharvest.runtime.web.HttpParamInfo
 
getValue() - Method in class org.webharvest.utils.KeyValuePair
 
getVar(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
getVar(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
getVar(String) - Method in class org.webharvest.runtime.ScraperContext
 
getVar(String) - Method in class org.webharvest.utils.SystemUtilities
Returns variable from scraper context
getVariableMap() - Method in class org.webharvest.definition.XPathDef
 
GetVarPlugin - Class in org.webharvest.runtime.processors.plugins.variable
 
GetVarPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.variable.GetVarPlugin
 
getVersion() - Method in interface org.webharvest.definition.Config
 
getVersion() - Method in class org.webharvest.definition.XMLConfig
getWrappedObject() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
getWrappedObject() - Method in class org.webharvest.runtime.variables.EmptyVariable
 
getWrappedObject() - Method in class org.webharvest.runtime.variables.ListVariable
 
getWrappedObject() - Method in class org.webharvest.runtime.variables.NodeVariable
 
getWrappedObject() - Method in class org.webharvest.runtime.variables.Variable
 
getXmlDef() - Method in class org.webharvest.definition.XsltDef
 
getXqDef() - Method in class org.webharvest.definition.XQueryDef
 
getXQueryExpressionPool() - Method in class org.webharvest.runtime.RuntimeConfig
 
getZipOutStream() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
GROOVY - org.webharvest.runtime.scripting.ScriptingLanguage
 

H

handle(T) - Method in interface org.webharvest.events.EventHandler
Handle event of supported type..
HandlerHolder - Interface in org.webharvest.events
Represents an object that serves purpose as storage of EventHandler (supporting different types of events).
Harvest - Interface in org.webharvest
Web-Harvest application facade that provides control over creation of scraping processors Harvester and dispatching of scraping events.
Harvester - Interface in org.webharvest
Represents scraping session object that is associated with particular configuration and can be executed multiple times.
Harvester.ContextInitCallback - Interface in org.webharvest
Context initialization callback that is invoked for all newly created context objects shortly before of scraping session.
HarvesterEvent - Interface in org.webharvest.events
Represent an event object that is either addressed to particular Harvester instance or represents state change that happened on it.
HarvesterEventSink - Class in org.webharvest.events
Implementation of EventSink intended to guarantee scraping scope's events delivery.
HarvesterEventSink(Registry<Harvester, EventBus>) - Constructor for class org.webharvest.events.HarvesterEventSink
Default class constructor expecting Registry to be injected possibly by IoC container.
HarvesterFactory - Interface in org.webharvest.ioc
Guice dynamic factory helper interface that help to instantiate Harvester objects.
HarvestLoadCallback - Interface in org.webharvest
Callback interface representing successfully loaded scraping configuration that is projected as collection of IElementDef objects.
hasAttribute(Object) - Method in interface org.webharvest.ioc.AttributeHolder
Decides if this AttributeHolder has an association for the specified key.
hasAttribute(Object) - Method in class org.webharvest.ioc.ScopeAttributeHolder
Decides if this AttributeHolder has an association for the specified key.
hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
 
hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
 
hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
 
hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
 
hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
 
hashCode() - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
 
hashCode() - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
 
hashCode() - Method in class org.webharvest.definition.ElementName
 
hashCode() - Method in class org.webharvest.definition.validation.SchemaSource
Returns hash code of system identifier of the schema's Source.
hashCode() - Method in class org.webharvest.runtime.scripting.ScriptSource
 
hasNext() - Method in class org.webharvest.utils.FileListIterator
 
hasOperations() - Method in class org.webharvest.definition.AbstractElementDef
hasOperations() - Method in interface org.webharvest.definition.IElementDef
 
HasReader - Interface in org.webharvest.utils
Represent an object holding character stream Reader.
headers - Variable in class org.webharvest.runtime.web.HttpInfo
 
hear(TypeLiteral<I>, TypeEncounter<I>) - Method in class org.webharvest.ioc.EventBusTypeListener
hear(TypeLiteral<I>, TypeEncounter<I>) - Method in class org.webharvest.ioc.PostConstructListener
hear(TypeLiteral<I>, TypeEncounter<I>) - Method in class org.webharvest.ioc.ScrapingAwareTypeListener
HtmlToXmlDef - Class in org.webharvest.definition
Definition of HTML to XML rensformation task.
HtmlToXmlDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HtmlToXmlDef
 
HtmlToXmlProcessor - Class in org.webharvest.runtime.processors
Advanced HTML to XML processor using Chain of Responsibility pattern with Strategy pattern for different HTML parsing strategies.
HtmlToXmlProcessor() - Constructor for class org.webharvest.runtime.processors.HtmlToXmlProcessor
 
HttpClientManager - Class in org.webharvest.runtime.web
HTTP client functionality.
HttpClientManager(HttpClientManager.ProxySettings) - Constructor for class org.webharvest.runtime.web.HttpClientManager
 
HttpClientManager.ProxySettings - Class in org.webharvest.runtime.web
 
HttpClientManager.ProxySettings.Builder - Class in org.webharvest.runtime.web
 
HttpDef - Class in org.webharvest.definition
Definition of HTTP processor.
HttpDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HttpDef
Deprecated.
HttpException - Exception in org.webharvest.exception
Http exception - occures during http requests.
HttpException() - Constructor for exception org.webharvest.exception.HttpException
 
HttpException(String) - Constructor for exception org.webharvest.exception.HttpException
 
HttpException(String, Throwable) - Constructor for exception org.webharvest.exception.HttpException
 
HttpException(Throwable) - Constructor for exception org.webharvest.exception.HttpException
 
HttpHeaderDef - Class in org.webharvest.definition
Definition of HTTP header.
HttpHeaderDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HttpHeaderDef
 
HttpHeaderProcessor - Class in org.webharvest.runtime.processors
Variable definition http header processor.
HttpHeaderProcessor() - Constructor for class org.webharvest.runtime.processors.HttpHeaderProcessor
 
HttpInfo - Class in org.webharvest.runtime.web
Class offers access to HTTP client and response details to the user.
HttpInfo(HttpClient) - Constructor for class org.webharvest.runtime.web.HttpInfo
 
HttpModule - Class in org.webharvest.ioc
Google Guice module containing bindings for Web-Harvest's HTTP-related components.
HttpModule(HttpClientManager.ProxySettings) - Constructor for class org.webharvest.ioc.HttpModule
Instantiates HttpModule accepting not-null reference to the HTTP HttpClientManager.ProxySettings.
HttpParamDef - Class in org.webharvest.definition
Definition of HTTP parameter.
HttpParamDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HttpParamDef
 
HttpParamInfo - Class in org.webharvest.runtime.web
Information about http request parameter.
HttpParamInfo(String, boolean, String, String, Variable) - Constructor for class org.webharvest.runtime.web.HttpParamInfo
 
HttpParamProcessor - Class in org.webharvest.runtime.processors
Variable definition http param processor.
HttpParamProcessor() - Constructor for class org.webharvest.runtime.processors.HttpParamProcessor
 
HttpProcessor - Class in org.webharvest.runtime.processors
Http processor.
HttpProcessor() - Constructor for class org.webharvest.runtime.processors.HttpProcessor
 
HttpResponseWrapper - Class in org.webharvest.runtime.web
Class defines http server response.
HttpResponseWrapper(HttpMethodBase) - Constructor for class org.webharvest.runtime.web.HttpResponseWrapper
Constructor - defines response result based on specified HttpMethodBase instance.

I

IElementDef - Interface in org.webharvest.definition
Marker for element definition.
IfDef - Class in org.webharvest.definition
Definition of conditional processor.
IfDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.IfDef
 
IfProcessor - Class in org.webharvest.runtime.processors
 
IfProcessor() - Constructor for class org.webharvest.runtime.processors.IfProcessor
 
include(ConfigSource.Location) - Method in class org.webharvest.definition.AbstractConfigSource
Includes specified ConfigSource.Location from current configuration if possible.
include(ConfigSource.Location) - Method in interface org.webharvest.definition.ConfigSource
Includes specified ConfigSource.Location from current configuration if possible.
IncludeDef - Class in org.webharvest.definition
Definition of include element.
IncludeDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.IncludeDef
 
IncludeProcessor - Class in org.webharvest.runtime.processors
Include processor.
IncludeProcessor() - Constructor for class org.webharvest.runtime.processors.IncludeProcessor
 
IncludeVisitor - Class in org.webharvest.definition
 
IncludeVisitor(String) - Constructor for class org.webharvest.definition.IncludeVisitor
 
indent(int) - Static method in class org.webharvest.utils.CommonUtil
Deprecated.
Provides unnecessary complexity for logging facility.
informListenersAboutError(Exception) - Method in class org.webharvest.runtime.Scraper
Inform all scraper listeners that an error has occured during scraper execution.
informListenersAboutError(Exception) - Method in interface org.webharvest.runtime.WebScraper
Deprecated.
initContext() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
initContext() - Method in class org.webharvest.runtime.ScraperContext
 
InjectorHelper - Class in org.webharvest.ioc
Guice static injector helper.
INSTANCE - org.webharvest.runtime.database.DefaultDriverManager
Singleton instance reference
INSTANCE - Static variable in class org.webharvest.definition.validation.SchemaFactoryImpl
Reference to singleton instance.
INSTANCE - Static variable in interface org.webharvest.runtime.processors.ProcessorFactory
 
INSTANCE - Static variable in class org.webharvest.runtime.variables.EmptyVariable
 
internal() - Method in annotation type org.webharvest.annotation.Definition
 
IntPair() - Constructor for class org.webharvest.utils.CommonUtil.IntPair
 
IntPair(int, int) - Constructor for class org.webharvest.utils.CommonUtil.IntPair
 
invalidate(K) - Method in interface org.webharvest.Cache
Discard value cached under the specified key.
invalidate(K) - Method in class org.webharvest.ThreadLocalCache
Discard value cached under the specified key.
invalidAttribute(String, String) - Static method in class org.webharvest.exception.ErrMsg
 
invalidTag(String, String) - Static method in class org.webharvest.exception.ErrMsg
 
invoke(MethodInvocation) - Method in class org.webharvest.ioc.ScrapingInterceptor
isBooleanTrue(String) - Static method in class org.webharvest.utils.CommonUtil
Checks if specified string value represents boolean true value.
isEmpty() - Method in class org.webharvest.runtime.variables.EmptyVariable
 
isEmpty() - Method in class org.webharvest.runtime.variables.ListVariable
 
isEmpty() - Method in class org.webharvest.runtime.variables.NodeVariable
 
isEmpty() - Method in class org.webharvest.runtime.variables.Variable
 
isEmpty() - Method in class org.webharvest.utils.Stack
 
isEmpty() - Method in class org.webharvest.utils.XmlNodeWrapper
 
isEmpty(String) - Static method in class org.webharvest.utils.CommonUtil
 
isEmptyString(Object) - Static method in class org.webharvest.utils.CommonUtil
 
isFalse(boolean, String, Object...) - Static method in class org.webharvest.utils.Assert
 
isFile() - Method in class org.webharvest.runtime.web.HttpParamInfo
 
isFullUrl(String) - Static method in class org.webharvest.utils.CommonUtil
Checks if specified link is full URL.
isInternal() - Method in class org.webharvest.definition.ElementInfo
 
isNull(Object) - Static method in class org.webharvest.utils.Assert
 
isNull(Object, String, Object...) - Static method in class org.webharvest.utils.Assert
 
isPathAbsolute(String) - Static method in class org.webharvest.utils.CommonUtil
Checks if specified file path is absolute.
isPluginRegistered(Class, String) - Method in class org.webharvest.definition.DefinitionResolver
Deprecated.
isPluginRegistered(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
Deprecated.
isPluginRegistered(String, String) - Method in class org.webharvest.definition.DefinitionResolver
 
isSatisfied() - Method in class org.webharvest.runtime.RunningStatusGuard
isTrue(boolean, String, Object...) - Static method in class org.webharvest.utils.Assert
 
isValidXmlIdentifier(String) - Static method in class org.webharvest.utils.CommonUtil
Checks if given string is valid XML identifier, i.e. it can be valid XML tag or attribute name.
isVariableDefined(String) - Method in class org.webharvest.utils.SystemUtilities
 
iterator() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
iterator() - Method in class org.webharvest.runtime.ScraperContext
 
iterator() - Method in class org.webharvest.runtime.variables.ListVariable
 

J

JAVASCRIPT - org.webharvest.runtime.scripting.ScriptingLanguage
 
JNDI_NAME_ATTRIBUTE - Static variable in class org.webharvest.runtime.processors.plugins.db.DatabaseJNDIPlugin
Name of XML attribute representing JNDI hook name.
JNDIConnectionFactory - Class in org.webharvest.runtime.database
ConnectionFactory implementation supporting JNDI DataSource lookups.
JNDIConnectionFactory() - Constructor for class org.webharvest.runtime.database.JNDIConnectionFactory
Instantiates JNDIConnectionFactory, obtaining reference to the JNDI Context.
JsonToXmlPlugin - Class in org.webharvest.runtime.processors.plugins
Converter from JSON to XML
JsonToXmlPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.JsonToXmlPlugin
 
JSRScriptEngineAdapter - Class in org.webharvest.runtime.scripting.jsr
Adapter design pattern implementation.
JSRScriptEngineAdapter(ScriptEngine) - Constructor for class org.webharvest.runtime.scripting.jsr.JSRScriptEngineAdapter
JSRScriptEngineAdapter constructor accepting reference to the not-null ScriptEngine delegate.
JSRScriptEngineFactory - Class in org.webharvest.runtime.scripting.jsr
ScriptEngineFactory implementation that creates script engines based on JSR-223 specification.
JSRScriptEngineFactory() - Constructor for class org.webharvest.runtime.scripting.jsr.JSRScriptEngineFactory
 

K

KeyValuePair<T> - Class in org.webharvest.utils
 
KeyValuePair(String, T) - Constructor for class org.webharvest.utils.KeyValuePair
 

L

LAYOUT - Static variable in annotation type org.webharvest.ioc.DebugFileLogger
Logger's layout
lf - Static variable in class org.webharvest.utils.SystemUtilities
 
listBound() - Method in class org.webharvest.AbstractRegistry
Lists names of all values bound.
listBound() - Method in class org.webharvest.LockedRegistry
Lists names of all values bound.
listBound() - Method in interface org.webharvest.Registry
Lists names of all values bound.
ListProcessor - Class in org.webharvest.runtime.processors
 
ListProcessor() - Constructor for class org.webharvest.runtime.processors.ListProcessor
 
ListVariable - Class in org.webharvest.runtime.variables
List variable - String wrapper.
ListVariable() - Constructor for class org.webharvest.runtime.variables.ListVariable
 
ListVariable(Iterable) - Constructor for class org.webharvest.runtime.variables.ListVariable
 
LockedRegistry<K,​V> - Class in org.webharvest
Generic locking registry implementation that follows 'decorator' design pattern.
LockedRegistry(Registry<K, V>) - Constructor for class org.webharvest.LockedRegistry
Class constructor expecting Registry to be specified.
log - Static variable in class org.webharvest.definition.XmlNode
 
LOG - Static variable in class org.webharvest.runtime.processors.AbstractProcessor
 
lookup(K) - Method in class org.webharvest.AbstractRegistry
Lookups value bound under the given name.
lookup(K) - Method in interface org.webharvest.Cache
Returns value associated with the specified key.
lookup(K) - Method in class org.webharvest.LockedRegistry
Lookups value bound under the given name.
lookup(K) - Method in interface org.webharvest.Registry
Lookups value bound under the given name.
lookup(K) - Method in class org.webharvest.ThreadLocalCache
Returns value associated with the specified key.
LoopDef - Class in org.webharvest.definition
Definition of loop processor.
LoopDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.LoopDef
 
LoopProcessor - Class in org.webharvest.runtime.processors
Loop list processor.
LoopProcessor() - Constructor for class org.webharvest.runtime.processors.LoopProcessor
 

M

MailAttachPlugin - Class in org.webharvest.runtime.processors.plugins.mail
Mail attachment plugin - can be used only inside mail plugin.
MailAttachPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
 
MailPlugin - Class in org.webharvest.runtime.processors.plugins.mail
Mail sending processor.
MailPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.mail.MailPlugin
 
MailPluginException - Exception in org.webharvest.runtime.processors.plugins.mail
Runtime exception for MailPlugin
MailPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.mail.MailPluginException
 
MailPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.mail.MailPluginException
 
MailPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.mail.MailPluginException
 
main(String[]) - Static method in class org.webharvest.docs.ProcessorReferenceGenerator
 
MIME_TYPES - Static variable in class org.webharvest.WHConstants
 
mimeType - Variable in class org.webharvest.runtime.web.HttpInfo
 
missingAttribute(String, String) - Static method in class org.webharvest.exception.ErrMsg
 
missingTag(String, String) - Static method in class org.webharvest.exception.ErrMsg
 

N

NAME - Static variable in annotation type org.webharvest.ioc.DebugFileLogger
Logger's name
NestedContextFactory - Class in org.webharvest.runtime
Factory capable of creating nested DynamicScopeContext instances.
next() - Method in class org.webharvest.utils.FileListIterator
 
NO_PROXY_SET - Static variable in class org.webharvest.runtime.web.HttpClientManager.ProxySettings
 
NodeVariable - Class in org.webharvest.runtime.variables
Node variable - Single node wrapper.
NodeVariable(Object) - Constructor for class org.webharvest.runtime.variables.NodeVariable
 
notNull(Object) - Static method in class org.webharvest.utils.Assert
 
notNull(Object, String, Object...) - Static method in class org.webharvest.utils.Assert
 
NULL - Static variable in class org.webharvest.definition.XmlNode
 
nvl(Object, String) - Static method in class org.webharvest.utils.CommonUtil
 

O

obtainConnection(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
Returns configured, ready for use database Connection
obtainConnection(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.DatabaseJNDIPlugin
Returns configured, ready for use database Connection
obtainConnection(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
Returns configured, ready for use database Connection
onAfterScraping(Harvester) - Method in class org.webharvest.events.DefaultHandlerHolder
Receive notification about the scope for particular Harvester that is about to be tore down.
onAfterScraping(Harvester) - Method in class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
Notify all registered listeners about scraping scope that is about the end.
onAfterScraping(Harvester) - Method in interface org.webharvest.ScrapingAware
Receive notification about the scope for particular Harvester that is about to be tore down.
onBeforeScraping(Harvester) - Method in class org.webharvest.events.DefaultHandlerHolder
Receive notification about new scope initialized for particular Harvester object.
onBeforeScraping(Harvester) - Method in class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
Notify all registered listeners about new scraping scope.
onBeforeScraping(Harvester) - Method in interface org.webharvest.ScrapingAware
Receive notification about new scope initialized for particular Harvester object.
onExecutionEnd(ScraperExecutionEndEvent) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
Reacts on the end of configuration execution releasing all pooled database connections.
onExecutionError(ScraperExecutionErrorEvent) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
Reacts on the configuration execution error releasing all pooled database connections.
onExecutionFinished(ScraperExecutionEndEvent) - Method in class org.webharvest.runtime.Scraper
Logs information about time of Scraper's execution on ScraperExecutionEndEvent.
onExecutionStopped(ScraperExecutionStoppedEvent) - Method in class org.webharvest.runtime.Scraper
Logs information that Scraper's execution has been stopped.
onSuccess(List<IElementDef>) - Method in interface org.webharvest.HarvestLoadCallback
Invoked when successfully loaded scraping configuraiton.
onSuccess(DynamicScopeContext) - Method in interface org.webharvest.Harvester.ContextInitCallback
Adjust newly created DynamicScopeContext object shortly before it is used to launch scraping session.
org.apache.commons.httpclient.contrib.ssl - package org.apache.commons.httpclient.contrib.ssl
 
org.webharvest - package org.webharvest
 
org.webharvest.annotation - package org.webharvest.annotation
 
org.webharvest.definition - package org.webharvest.definition
 
org.webharvest.definition.validation - package org.webharvest.definition.validation
 
org.webharvest.deprecated.runtime - package org.webharvest.deprecated.runtime
 
org.webharvest.deprecated.runtime.processors - package org.webharvest.deprecated.runtime.processors
 
org.webharvest.docs - package org.webharvest.docs
 
org.webharvest.events - package org.webharvest.events
 
org.webharvest.exception - package org.webharvest.exception
 
org.webharvest.ioc - package org.webharvest.ioc
 
org.webharvest.runtime - package org.webharvest.runtime
 
org.webharvest.runtime.database - package org.webharvest.runtime.database
 
org.webharvest.runtime.processors - package org.webharvest.runtime.processors
 
org.webharvest.runtime.processors.plugins - package org.webharvest.runtime.processors.plugins
 
org.webharvest.runtime.processors.plugins.db - package org.webharvest.runtime.processors.plugins.db
 
org.webharvest.runtime.processors.plugins.ftp - package org.webharvest.runtime.processors.plugins.ftp
 
org.webharvest.runtime.processors.plugins.mail - package org.webharvest.runtime.processors.plugins.mail
 
org.webharvest.runtime.processors.plugins.variable - package org.webharvest.runtime.processors.plugins.variable
 
org.webharvest.runtime.processors.plugins.webbrowser - package org.webharvest.runtime.processors.plugins.webbrowser
 
org.webharvest.runtime.processors.plugins.zip - package org.webharvest.runtime.processors.plugins.zip
 
org.webharvest.runtime.scripting - package org.webharvest.runtime.scripting
 
org.webharvest.runtime.scripting.jsr - package org.webharvest.runtime.scripting.jsr
 
org.webharvest.runtime.templaters - package org.webharvest.runtime.templaters
 
org.webharvest.runtime.variables - package org.webharvest.runtime.variables
 
org.webharvest.runtime.web - package org.webharvest.runtime.web
 
org.webharvest.utils - package org.webharvest.utils
 

P

parse(ConfigSource) - Method in interface org.webharvest.definition.ConfigParser
 
parse(ConfigSource) - Method in class org.webharvest.definition.SAXConfigParser
parse(InputSource) - Static method in class org.webharvest.utils.XmlUtil
 
parse(InputSource) - Method in class org.webharvest.utils.XmlValidator
 
ParserException - Exception in org.webharvest.exception
General parsing exception.
ParserException() - Constructor for exception org.webharvest.exception.ParserException
 
ParserException(String) - Constructor for exception org.webharvest.exception.ParserException
 
ParserException(String, Throwable) - Constructor for exception org.webharvest.exception.ParserException
 
ParserException(Throwable) - Constructor for exception org.webharvest.exception.ParserException
 
pause(ScraperExecutionPausedEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
PAUSED - org.webharvest.runtime.ScraperState
Scraper has been paused.
peek() - Method in class org.webharvest.utils.Stack
 
PluginDef - Interface in org.webharvest.definition
 
PluginDefinitionBuilder - Class in org.webharvest.definition
Builder for creating WebHarvestPluginDef instances.
PluginException - Exception in org.webharvest.exception
Runtime exception occurred during plugin processors registration or creation.
PluginException() - Constructor for exception org.webharvest.exception.PluginException
 
PluginException(String) - Constructor for exception org.webharvest.exception.PluginException
 
PluginException(String, Throwable) - Constructor for exception org.webharvest.exception.PluginException
 
PluginException(Throwable) - Constructor for exception org.webharvest.exception.PluginException
 
PluginFactory - Class in org.webharvest.definition
Factory for creating plugin instances.
PluginFactory() - Constructor for class org.webharvest.definition.PluginFactory
 
PluginRegistrationStrategy - Interface in org.webharvest.definition
Strategy interface for plugin registration.
pop() - Method in class org.webharvest.utils.Stack
 
PostConstructListener - Class in org.webharvest.ioc
TypeListener implementation enabling Guice support for JSR-250 @PostConstruct annotation.
PostConstructListener() - Constructor for class org.webharvest.ioc.PostConstructListener
 
postEvent(E) - Method in interface org.webharvest.Harvest
Post an event with hope it will be routed to target scope.
postEvent(E) - Method in class org.webharvest.runtime.DefaultHarvest
Post an event with hope it will be routed to target scope.
postProcess(ConfigurableResolver) - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessor
Post process provided ConfigurableResolver by registering all Autoscanned web harvest plugins available on classpath under package (and its subpackages) declared in the post processor's constructor.
postProcess(ConfigurableResolver) - Method in interface org.webharvest.definition.ResolverPostProcessor
Triggered on the ConfigurableResolver refresh.
postProcess(SchemaResolver) - Method in interface org.webharvest.definition.validation.SchemaResolverPostProcessor
Triggered on the SchemaResolver refresh.
postProcess(SchemaResolver) - Method in class org.webharvest.definition.validation.SchemaResourcesPostProcessor
Triggered on the SchemaResolver refresh.
prettyPrintXml(String) - Static method in class org.webharvest.utils.XmlUtil
 
prettyPrintXml(Document, Writer) - Static method in class org.webharvest.utils.XmlUtil
 
Processor<TDef extends IElementDef> - Interface in org.webharvest.runtime.processors
 
ProcessorExecutionContext - Class in org.webharvest.runtime.processors
Context object that holds execution state and configuration.
ProcessorExecutionContext(Processor<?>, DynamicScopeContext, Map<String, Object>, long, String) - Constructor for class org.webharvest.runtime.processors.ProcessorExecutionContext
 
ProcessorExecutionStrategy - Interface in org.webharvest.runtime.processors
Strategy interface for processor execution.
ProcessorFactory - Interface in org.webharvest.runtime.processors
Created by rbala on 13.07.2017.
ProcessorReferenceGenerator - Class in org.webharvest.docs
Generator for processor reference documentation from source code annotations.
ProcessorReferenceGenerator() - Constructor for class org.webharvest.docs.ProcessorReferenceGenerator
 
ProcessorStartEvent - Class in org.webharvest.events
Event informing that the specified Processor has been started.
ProcessorStartEvent(Processor) - Constructor for class org.webharvest.events.ProcessorStartEvent
Default class constructor which accepts not null reference to Processor.
ProcessorStopEvent - Class in org.webharvest.events
Event informing that the specified Processor has successfully finished its work.
ProcessorStopEvent(Processor, Map) - Constructor for class org.webharvest.events.ProcessorStopEvent
Default class constructor which accepts not null reference to Processor and its properties.
publish(E) - Method in interface org.webharvest.events.EventSink
Posts an event to all registered handlers.
publish(E) - Method in class org.webharvest.events.HarvesterEventSink
Posts an event to all registered handlers.
push(T) - Method in class org.webharvest.utils.Stack
 
put(K, V) - Method in interface org.webharvest.Cache
Associates the specified value with the specified key in Cache.
put(K, V) - Method in class org.webharvest.ThreadLocalCache
Associates the specified value with the specified key in Cache.
putAttribute(Object, Object) - Method in interface org.webharvest.ioc.AttributeHolder
Stores a new value in this AttributeHolder.
putAttribute(Object, Object) - Method in class org.webharvest.ioc.ScopeAttributeHolder
Stores a new value in this AttributeHolder.

Q

quot - Static variable in class org.webharvest.utils.SystemUtilities
 

R

readBodyAsArray() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
 
readBytesFromFile(File) - Static method in class org.webharvest.utils.CommonUtil
 
readStringFromFile(File, String) - Static method in class org.webharvest.utils.CommonUtil
 
readStringFromUrl(URL) - Static method in class org.webharvest.utils.CommonUtil
Reads content from specified URL
READY - org.webharvest.runtime.ScraperState
Scraper is ready to use.
RealBodyProcessor - Class in org.webharvest.runtime.processors
 
RealBodyProcessor() - Constructor for class org.webharvest.runtime.processors.RealBodyProcessor
 
recognize(String) - Static method in enum org.webharvest.runtime.scripting.ScriptingLanguage
 
recordCount() - Method in class org.webharvest.utils.SystemUtilities
Returns the number of records processed in the current context.
refresh() - Method in class org.webharvest.definition.AbstractRefreshableResolver
Refresh the current resolver instance, causing all configuration element definitions to be reinitialized.
refresh() - Method in interface org.webharvest.definition.ConfigurableResolver
Refresh the current resolver instance, causing all configuration element definitions to be reinitialized.
refresh() - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
Refresh the current resolver instance, causing all XML schema sources to be reinitialized.
refresh() - Method in interface org.webharvest.definition.validation.SchemaResolver
Refresh the current resolver instance, causing all XML schema sources to be reinitialized.
refreshSchemaResolver() - Static method in class org.webharvest.utils.XmlUtil
Force refresh of the schema resolver to pick up XSD changes.
RegexpDef - Class in org.webharvest.definition
Definition of regular expression processor.
RegexpDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.RegexpDef
 
RegexpPatternProcessor - Class in org.webharvest.runtime.processors
 
RegexpPatternProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpPatternProcessor
 
RegexpProcessor - Class in org.webharvest.runtime.processors
Regular expression replace processor.
RegexpProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpProcessor
 
RegexpResultProcessor - Class in org.webharvest.runtime.processors
 
RegexpResultProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpResultProcessor
 
RegexpSourceProcessor - Class in org.webharvest.runtime.processors
 
RegexpSourceProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpSourceProcessor
 
register(EventHandler<?>) - Method in class org.webharvest.events.DefaultHandlerHolder
Register event handler.
register(EventHandler<?>) - Method in interface org.webharvest.events.HandlerHolder
Register event handler.
registerDriver(String) - Method in enum org.webharvest.runtime.database.DefaultDriverManager
Registers driver with the provided class name.
registerDriver(String) - Method in interface org.webharvest.runtime.database.DriverManager
Registers driver with the provided class name.
registerPlugin(Class<? extends WebHarvestPlugin>, ConfigurableResolver) - Method in class org.webharvest.definition.DefaultPluginRegistrationStrategy
 
registerPlugin(Class<? extends WebHarvestPlugin>, ConfigurableResolver) - Method in class org.webharvest.definition.ElementInfoPluginRegistrationStrategy
 
registerPlugin(Class<? extends WebHarvestPlugin>, ConfigurableResolver) - Method in interface org.webharvest.definition.PluginRegistrationStrategy
Registers a plugin in the resolver using the specified strategy.
registerPlugin(Class, String) - Method in interface org.webharvest.definition.ConfigurableResolver
Deprecated.
registerPlugin(Class, String) - Method in class org.webharvest.definition.DefinitionResolver
registerPlugin(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
Deprecated.
registerPlugin(String, String) - Method in class org.webharvest.definition.DefinitionResolver
Deprecated.
registerPlugin(ElementInfo, String) - Method in interface org.webharvest.definition.ConfigurableResolver
Register provided WebHarvestPlugin based on information provided by specified ElementInfo.
registerPlugin(ElementInfo, String) - Method in class org.webharvest.definition.DefinitionResolver
Register provided WebHarvestPlugin based on information provided by specified ElementInfo.
registerSchemaSource(SchemaSource) - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
Register provided SchemaSource which cannot be null
registerSchemaSource(SchemaSource) - Method in interface org.webharvest.definition.validation.SchemaResolver
Register provided SchemaSource which cannot be null
Registry<K,​V> - Interface in org.webharvest
Generic registry interface following 'registry' design pattern.
reload() - Method in interface org.webharvest.definition.Config
Loads configuration from resource defined by Config.getConfigSource().
reload() - Method in class org.webharvest.definition.XMLConfig
Loads configuration from resource defined by Config.getConfigSource().
remove() - Method in class org.webharvest.utils.FileListIterator
 
removeDriverResource(URI) - Method in enum org.webharvest.runtime.database.DefaultDriverManager
Removes previously added database driver resource.
removeDriverResource(URI) - Method in interface org.webharvest.runtime.database.DriverManager
Removes previously added database driver resource.
replaceExistingVar(String, Variable) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
replaceExistingVar(String, Variable) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
replaceExistingVar(String, Variable) - Method in class org.webharvest.runtime.ScraperContext
 
replaceTop(T) - Method in class org.webharvest.utils.Stack
 
requiredAttributes() - Method in annotation type org.webharvest.annotation.Definition
This method should return all mandatory attribute names for the plugin processor.
requiredSubprocessors() - Method in annotation type org.webharvest.annotation.Definition
This method should return all mandatory subprocessor names, or in other words all mandatory subtags that must be present in the body of this processor plugin.
ResolverPostProcessor - Interface in org.webharvest.definition
Allows for custom modification of the web harvest configuration element definition resolvers.
ResourcePathToURITransformer - Class in org.webharvest.definition.validation
An implementation of Transformer interface which supports transformation from given resource name (resource path as string) to its URI.
ResourcePathToURITransformer() - Constructor for class org.webharvest.definition.validation.ResourcePathToURITransformer
 
resume(ScraperExecutionContinuedEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
ReturnDef - Class in org.webharvest.definition
Definition of function's return statement.
ReturnDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ReturnDef
 
ReturnProcessor - Class in org.webharvest.runtime.processors
Function's return value processor.
ReturnProcessor() - Constructor for class org.webharvest.runtime.processors.ReturnProcessor
 
run(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.AbstractProcessor
Template method that defines the execution flow.
run(DynamicScopeContext) - Method in interface org.webharvest.runtime.processors.Processor
 
run(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RunningStatusController
run(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.StoppedOrExitedProcessor
RUNNING - org.webharvest.runtime.ScraperState
Scraper is running.
RunningStatusController<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
AbstractProcessorDecorator implementation which decorates Processor#run(Scraper, DynamicScopeContext) method in the way that it enters to the Monitor using Monitor.Guard verifying that processing is not paused.
RunningStatusController(Processor<TDef>) - Constructor for class org.webharvest.runtime.processors.RunningStatusController
Default class constructor which accepts Processor instance which is going to be decorated by this class.
RunningStatusGuard - Class in org.webharvest.runtime
Implementation of Monitor.Guard verifying that current status of configuration's processing is 'running'.
RunningStatusGuard(Monitor) - Constructor for class org.webharvest.runtime.RunningStatusGuard
Default class constructor which accepts reference to Monitor.
RuntimeConfig - Class in org.webharvest.runtime
Facade for runtime objects needed for specific processors' execution.
RuntimeConfig() - Constructor for class org.webharvest.runtime.RuntimeConfig
 

S

saveStringToFile(File, String, String) - Static method in class org.webharvest.utils.CommonUtil
Saves specified content to the file with specified charset.
SAXConfigParser - Class in org.webharvest.definition
Created by rba on 16.07.2017.
SAXConfigParser() - Constructor for class org.webharvest.definition.SAXConfigParser
 
SchemaComponentFactory - Class in org.webharvest.definition.validation
Factory class providing instances of SchemaResolver and SchemaFactory which actually are the same instance of SchemaFactoryImpl.
SchemaComponentFactory() - Constructor for class org.webharvest.definition.validation.SchemaComponentFactory
 
SchemaFactory - Interface in org.webharvest.definition.validation
Factory creating an instance of Schema which is a base of XML validation process.
SchemaFactoryImpl - Class in org.webharvest.definition.validation
Implementation of SchemaFactory and SchemaResolver interfaces which realizes Abstract Factory and Singleton design patterns.
SchemaResolver - Interface in org.webharvest.definition.validation
Interface to be implemented by the web harvest XML schema sources resolvers.
SchemaResolverPostProcessor - Interface in org.webharvest.definition.validation
Allows for custom modification of the web harvest XML schema sources resolvers.
SchemaResourcesPostProcessor<T> - Class in org.webharvest.definition.validation
SchemaResolverPostProcessor implementation capable of transforming specified XML schema resources which could be e.g. a paths to these resources or some resource object.
SchemaResourcesPostProcessor(Transformer<T, SchemaSource>, T...) - Constructor for class org.webharvest.definition.validation.SchemaResourcesPostProcessor
Default class constructor accepting array of XML schema resources and reference to Transformer instance which allows to convert these resources to SchemaSource objects.
SchemaSource - Class in org.webharvest.definition.validation
An POJO object which contains XML schema's Source.
SchemaSource(InputStream, String) - Constructor for class org.webharvest.definition.validation.SchemaSource
Default class constructors accepting not null InputSteam and system identifier of the XML schema source.
scope(Key<T>, Provider<T>) - Method in class org.webharvest.ioc.AttributeHolderScope
ScopeAttributeHolder - Class in org.webharvest.ioc
Represents an object that implements AttributeHolder and is intended to serve purpose as container for Guice scope's beans.
ScopeAttributeHolder() - Constructor for class org.webharvest.ioc.ScopeAttributeHolder
 
Scraper - Class in org.webharvest.runtime
Basic runtime class.
Scraper() - Constructor for class org.webharvest.runtime.Scraper
Default constructor for non-Guice usage.
Scraper(EventBus) - Constructor for class org.webharvest.runtime.Scraper
Constructor for custom EventBus.
ScraperContext - Class in org.webharvest.runtime
Context of scraper execution.
ScraperContext() - Constructor for class org.webharvest.runtime.ScraperContext
 
ScraperContext10 - Class in org.webharvest.deprecated.runtime
Deprecated.
ScraperContext10() - Constructor for class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
ScraperExecutionContinuedEvent - Class in org.webharvest.events
 
ScraperExecutionContinuedEvent(Harvester) - Constructor for class org.webharvest.events.ScraperExecutionContinuedEvent
 
ScraperExecutionEndEvent - Class in org.webharvest.events
Event informing that the execution of WebScraper has been successfully completed.
ScraperExecutionEndEvent(WebScraper, long) - Constructor for class org.webharvest.events.ScraperExecutionEndEvent
Constructs ScraperExecutionEndEvent accepting reference to WebScraper which execution has been completed
ScraperExecutionErrorEvent - Class in org.webharvest.events
Event informing that during the execution of Scraper some exception has occurred.
ScraperExecutionErrorEvent(Exception) - Constructor for class org.webharvest.events.ScraperExecutionErrorEvent
Event constructor accepting Exception which occurred during the execution.
ScraperExecutionExitEvent - Class in org.webharvest.events
Event informing that the execution of configuration has exited.
ScraperExecutionExitEvent(String) - Constructor for class org.webharvest.events.ScraperExecutionExitEvent
Constructs ScraperExecutionExitEvent accepting cause of of execution's exit.
ScraperExecutionPausedEvent - Class in org.webharvest.events
 
ScraperExecutionPausedEvent(Harvester) - Constructor for class org.webharvest.events.ScraperExecutionPausedEvent
 
ScraperExecutionStartEvent - Class in org.webharvest.events
 
ScraperExecutionStartEvent(WebScraper) - Constructor for class org.webharvest.events.ScraperExecutionStartEvent
 
ScraperExecutionStoppedEvent - Class in org.webharvest.events
Event informing that the execution of Harvester has been stopped.
ScraperExecutionStoppedEvent(Harvester) - Constructor for class org.webharvest.events.ScraperExecutionStoppedEvent
Constructs ScraperExecutionStoppedEvent accepting reference to Harvester which execution has been stopped.
ScraperModule - Class in org.webharvest.ioc
Guice module for Web-Harvest configuration.
ScraperModule(String) - Constructor for class org.webharvest.ioc.ScraperModule
Creates a ScraperModule with default configuration.
ScraperScope - Class in org.webharvest.ioc
Scraping scope container.
ScraperScope() - Constructor for class org.webharvest.ioc.ScraperScope
 
ScraperState - Enum in org.webharvest.runtime
An enum containing all available Scraper's states.
ScraperXPathException - Exception in org.webharvest.exception
XPath exception - occured during executing xpath processor.
ScraperXPathException() - Constructor for exception org.webharvest.exception.ScraperXPathException
 
ScraperXPathException(String) - Constructor for exception org.webharvest.exception.ScraperXPathException
 
ScraperXPathException(String, Throwable) - Constructor for exception org.webharvest.exception.ScraperXPathException
 
ScraperXPathException(Throwable) - Constructor for exception org.webharvest.exception.ScraperXPathException
 
ScraperXQueryException - Exception in org.webharvest.exception
XQuery exception - occured during executing xquery processor.
ScraperXQueryException() - Constructor for exception org.webharvest.exception.ScraperXQueryException
 
ScraperXQueryException(String) - Constructor for exception org.webharvest.exception.ScraperXQueryException
 
ScraperXQueryException(String, Throwable) - Constructor for exception org.webharvest.exception.ScraperXQueryException
 
ScraperXQueryException(Throwable) - Constructor for exception org.webharvest.exception.ScraperXQueryException
 
Scraping - Annotation Type in org.webharvest.ioc
Guice helper annotation used to indicate methods expected to be invoked in exclusive scraping scope.
ScrapingAware - Interface in org.webharvest
Interface to be implemented by any object that wishes to be notified of scraping scope possibly it runs in.
ScrapingAwareHelper() - Constructor for class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
 
ScrapingAwareTypeListener - Class in org.webharvest.ioc
Implementation of TypeListener that is responsible for registration of detected ScrapingAware.
ScrapingAwareTypeListener() - Constructor for class org.webharvest.ioc.ScrapingAwareTypeListener
 
ScrapingHarvester - Class in org.webharvest.runtime
Default implementation of Harvester interface aimed to perform data extraction from remote websites.
ScrapingHarvester(Provider<WebScraper>, ContextFactory, Config) - Constructor for class org.webharvest.runtime.ScrapingHarvester
Class constructor expecting Guice WebScraper provider, DynamicScopeContext factory and Config as already loaded configuration.
ScrapingInterceptor - Class in org.webharvest.ioc
Guice AOP interceptor responsible for taking action for method annotated with Scraping annotation.
ScrapingInterceptor() - Constructor for class org.webharvest.ioc.ScrapingInterceptor
 
ScrapingInterceptor.ScrapingAwareHelper - Class in org.webharvest.ioc
Guice aware helper class that maintains collection of registered ScrapingAware listeners.
ScrapingScope - Annotation Type in org.webharvest.ioc
Guice helper annotation used to indicate types to be instantiated and kept in scraping scope.
ScriptDef - Class in org.webharvest.definition
Definition of script processor.
ScriptDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ScriptDef
 
ScriptEngine - Interface in org.webharvest.runtime.scripting
Interface providing scripting functionality.
ScriptEngineException - Exception in org.webharvest.exception
Script engine exception - thrown when there is a problem with a script engine itself, not a script source.
ScriptEngineException(Throwable) - Constructor for exception org.webharvest.exception.ScriptEngineException
 
scriptEngineFactory - Static variable in class org.webharvest.runtime.templaters.BaseTemplater
 
ScriptEngineFactory - Interface in org.webharvest.runtime.scripting
Factory creating ScriptEngine instances capable of executing provided ScriptSource.
ScriptException - Exception in org.webharvest.exception
Script execution exception - occurred during script compilation or evaluation.
ScriptException(String, Throwable) - Constructor for exception org.webharvest.exception.ScriptException
 
ScriptException(Throwable) - Constructor for exception org.webharvest.exception.ScriptException
 
ScriptingLanguage - Enum in org.webharvest.runtime.scripting
Created by IntelliJ IDEA.
ScriptingVariable - Class in org.webharvest.runtime.variables
This variables are unwrapped when passing into script engines and preserve the mutable collections in the original state when passing over Scraper
ScriptingVariable(Object) - Constructor for class org.webharvest.runtime.variables.ScriptingVariable
 
ScriptProcessor - Class in org.webharvest.runtime.processors
Script processor - executes script defined in the body.
ScriptProcessor() - Constructor for class org.webharvest.runtime.processors.ScriptProcessor
 
ScriptSource - Class in org.webharvest.runtime.scripting
Created by IntelliJ IDEA.
ScriptSource(String, ScriptingLanguage) - Constructor for class org.webharvest.runtime.scripting.ScriptSource
 
serializeItem(Item, Properties) - Static method in class org.webharvest.utils.CommonUtil
Serializes item after XPath or XQuery processor execution using Saxon.
setCharset(String) - Method in class org.webharvest.definition.HttpDef
 
setCharset(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
Sets default configuration's charset.
setCharset(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
Sets default configuration's charset.
setCharset(String) - Method in class org.webharvest.runtime.ScraperContext
Sets default configuration's charset.
setConfig(Config) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
setConfig(Config) - Method in interface org.webharvest.runtime.DynamicScopeContext
Deprecated.
setConfig(Config) - Method in class org.webharvest.runtime.ScraperContext
Deprecated.
setContentType(String) - Method in class org.webharvest.definition.HttpDef
 
setCookiePolicy(String) - Method in class org.webharvest.definition.HttpDef
 
setCookiePolicy(String) - Method in class org.webharvest.runtime.web.HttpClientManager
 
setElementDef(TDef) - Method in class org.webharvest.runtime.processors.AbstractProcessor
Sets appropriate element definition to the processor.
setElementDef(TDef) - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
Sets appropriate element definition to the processor.
setElementDef(TDef) - Method in interface org.webharvest.runtime.processors.Processor
Deprecated.
Find better solution to create processor in valid state
setFollowRedirects(String) - Method in class org.webharvest.definition.HttpDef
 
setFunctionResult(Variable) - Method in class org.webharvest.runtime.processors.CallProcessor
 
setHostnameVerification(boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
Set the host name verification flag.
setIgnoreResponseBody(String) - Method in class org.webharvest.definition.HttpDef
 
setLocalVar(String, Object) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
setLocalVar(String, Object) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
setLocalVar(String, Object) - Method in class org.webharvest.runtime.ScraperContext
 
setLocalVar(String, Variable) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
 
setLocalVar(String, Variable) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
setLocalVar(String, Variable) - Method in class org.webharvest.runtime.ScraperContext
 
setLocalVar(Map<String, Object>) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
setLocalVar(Map<String, Object>) - Method in interface org.webharvest.runtime.DynamicScopeContext
 
setLocalVar(Map<String, Object>) - Method in class org.webharvest.runtime.ScraperContext
 
setLocation(int, int) - Method in class org.webharvest.definition.XmlNode
 
setMethod(String) - Method in class org.webharvest.definition.HttpDef
 
setParentProcessor(Processor) - Method in class org.webharvest.runtime.processors.AbstractProcessor
Sets reference to parent Processor.
setParentProcessor(Processor) - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
Sets reference to parent Processor.
setParentProcessor(Processor) - Method in class org.webharvest.runtime.processors.BodyProcessor.Builder
Sets reference to the parent Processor.
setParentProcessor(Processor) - Method in interface org.webharvest.runtime.processors.Processor
Sets reference to parent Processor.
setPassword(String) - Method in class org.webharvest.definition.HttpDef
 
setProperty(String, Object) - Method in class org.webharvest.runtime.processors.AbstractProcessor
Defines processor runtime property with specified name and value.
setProxyCredentialsNTDomain(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
setProxyCredentialsNTHost(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
setProxyCredentialsPassword(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
setProxyCredentialsUsername(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
setProxyPort(int) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
 
setResponse(HttpResponseWrapper) - Method in class org.webharvest.runtime.web.HttpInfo
 
setRetryAttempts(String) - Method in class org.webharvest.definition.HttpDef
 
setRetryDelay(String) - Method in class org.webharvest.definition.HttpDef
 
setRetryDelayFactor(String) - Method in class org.webharvest.definition.HttpDef
 
setScriptingLanguage(ScriptingLanguage) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
Deprecated.
setScriptingLanguage(ScriptingLanguage) - Method in interface org.webharvest.runtime.DynamicScopeContext
Deprecated.
setScriptingLanguage(ScriptingLanguage) - Method in class org.webharvest.runtime.ScraperContext
Deprecated.
setUrl(String) - Method in class org.webharvest.definition.HttpDef
 
setUsername(String) - Method in class org.webharvest.definition.HttpDef
 
SetVarPlugin - Class in org.webharvest.runtime.processors.plugins.variable
Support for database operations.
SetVarPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.variable.SetVarPlugin
 
shouldNeverHappen(Throwable) - Static method in class org.webharvest.utils.Assert
 
size() - Method in class org.webharvest.utils.Stack
 
SleepPlugin - Class in org.webharvest.runtime.processors.plugins
 
SleepPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.SleepPlugin
 
space - Static variable in class org.webharvest.utils.SystemUtilities
 
SPLITTER_WIDTH - Static variable in class org.webharvest.WHConstants
 
Stack<T> - Class in org.webharvest.utils
Simple Stack (LIFO queue).
Stack() - Constructor for class org.webharvest.utils.Stack
 
StandaloneConnectionPool - Class in org.webharvest.runtime.database
Default, standalone ConnectionFactory interface implementation.
StandaloneConnectionPool() - Constructor for class org.webharvest.runtime.database.StandaloneConnectionPool
 
statusCode - Variable in class org.webharvest.runtime.web.HttpInfo
 
StatusHolder - Interface in org.webharvest.runtime
Component responsible for providing information about current status of being processed configuration.
statusText - Variable in class org.webharvest.runtime.web.HttpInfo
 
stop(ScraperExecutionStoppedEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
Changes status to ScraperState.STOPPED.
STOPPED - org.webharvest.runtime.ScraperState
Scraper has been stopped by the user.
StoppedOrExitedProcessor<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
AbstractProcessorDecorator implementation which decorates Processor#run(Scraper, DynamicScopeContext) method in the way that if status of the processing is 'stopped' or 'exit' then returns EmptyVariable.INSTANCE otherwise it delegates invocation to decorated Processor.
StoppedOrExitedProcessor(Processor<TDef>) - Constructor for class org.webharvest.runtime.processors.StoppedOrExitedProcessor
Default class constructor which accepts Processor instance which is going to be decorated by this class.
StrictSSLProtocolSocketFactory - Class in org.apache.commons.httpclient.contrib.ssl
A SecureProtocolSocketFactory that uses JSSE to create SSL sockets.
StrictSSLProtocolSocketFactory() - Constructor for class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
Constructor for StrictSSLProtocolSocketFactory.
StrictSSLProtocolSocketFactory(boolean) - Constructor for class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
Constructor for StrictSSLProtocolSocketFactory.
StylesheetProcessor - Class in org.webharvest.runtime.processors
 
StylesheetProcessor() - Constructor for class org.webharvest.runtime.processors.StylesheetProcessor
 
subclassesOf(Class<?>) - Static method in class org.webharvest.ioc.TypeMatchers
Matcher matches all classes that extends, implements or is the same as baseClass
SystemUtilities - Class in org.webharvest.utils
Collection of useful constants and functions that are available in each scraper context.
SystemUtilities(DynamicScopeContext) - Constructor for class org.webharvest.utils.SystemUtilities
 

T

tab - Static variable in class org.webharvest.utils.SystemUtilities
 
TargetNamespace - Annotation Type in org.webharvest.runtime.processors.plugins
This annotation may be used on the web harvest plugin class to indicate one or more target XML namespaces for the plugin.
TemplateDef - Class in org.webharvest.definition
Definition of template task.
TemplateDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.TemplateDef
 
TemplateException - Exception in org.webharvest.exception
Template exception - occured during executing templete processor.
TemplateException() - Constructor for exception org.webharvest.exception.TemplateException
 
TemplateException(String) - Constructor for exception org.webharvest.exception.TemplateException
 
TemplateException(String, Throwable) - Constructor for exception org.webharvest.exception.TemplateException
 
TemplateException(Throwable) - Constructor for exception org.webharvest.exception.TemplateException
 
TemplateProcessor - Class in org.webharvest.runtime.processors
Template processor.
TemplateProcessor() - Constructor for class org.webharvest.runtime.processors.TemplateProcessor
 
TemplaterException - Exception in org.webharvest.exception
Templater exception.
TemplaterException() - Constructor for exception org.webharvest.exception.TemplaterException
 
TemplaterException(String) - Constructor for exception org.webharvest.exception.TemplaterException
 
TemplaterException(String, Throwable) - Constructor for exception org.webharvest.exception.TemplaterException
 
TemplaterException(Throwable) - Constructor for exception org.webharvest.exception.TemplaterException
 
TextDef - Class in org.webharvest.definition
Definition of text processor.
TextDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.TextDef
 
TextProcessor - Class in org.webharvest.runtime.processors
Text processor.
TextProcessor() - Constructor for class org.webharvest.runtime.processors.TextProcessor
 
ThreadLocalCache<K,​V> - Class in org.webharvest
Cache implementation based on ThreadLocal, that is, allowing each thread to have separate cache bindings.
ThreadLocalCache() - Constructor for class org.webharvest.ThreadLocalCache
 
time() - Method in class org.webharvest.utils.SystemUtilities
 
TIME_TO_LAUNCH_WEB_SERVER - Static variable in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
 
toArray() - Method in class org.webharvest.runtime.variables.Variable
Safely converts this variable to array of objects.
toBinary() - Method in class org.webharvest.runtime.variables.EmptyVariable
 
toBinary() - Method in class org.webharvest.runtime.variables.ListVariable
 
toBinary() - Method in class org.webharvest.runtime.variables.NodeVariable
 
toBinary() - Method in class org.webharvest.runtime.variables.Variable
 
toBinary(String) - Method in class org.webharvest.runtime.variables.EmptyVariable
 
toBinary(String) - Method in class org.webharvest.runtime.variables.ListVariable
 
toBinary(String) - Method in class org.webharvest.runtime.variables.NodeVariable
 
toBinary(String) - Method in class org.webharvest.runtime.variables.Variable
 
toBoolean() - Method in class org.webharvest.runtime.variables.Variable
Safely converts this variable to boolean value.
toDouble() - Method in class org.webharvest.runtime.variables.Variable
Safely converts this variable to double value.
toInt() - Method in class org.webharvest.runtime.variables.Variable
Safely converts this variable to integer value.
toIterator() - Method in class org.webharvest.runtime.variables.EmptyVariable
 
toIterator() - Method in class org.webharvest.runtime.variables.ListVariable
 
toIterator() - Method in class org.webharvest.runtime.variables.NodeVariable
 
toIterator() - Method in class org.webharvest.runtime.variables.Variable
 
tokenize(String, String) - Static method in class org.webharvest.utils.CommonUtil
Tokenize given string for specified delimiter(s).
tokenize(String, String, boolean, boolean) - Static method in class org.webharvest.utils.CommonUtil
 
TokenizePlugin - Class in org.webharvest.runtime.processors.plugins
Support for database operations.
TokenizePlugin() - Constructor for class org.webharvest.runtime.processors.plugins.TokenizePlugin
 
toList() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
toList() - Method in class org.webharvest.runtime.variables.EmptyVariable
 
toList() - Method in class org.webharvest.runtime.variables.ListVariable
 
toList() - Method in class org.webharvest.runtime.variables.NodeVariable
 
toList() - Method in class org.webharvest.runtime.variables.Variable
 
toLong() - Method in class org.webharvest.runtime.variables.Variable
Safely converts this variable to long value.
toString() - Method in class org.webharvest.definition.ElementName
 
toString() - Method in class org.webharvest.definition.validation.SchemaSource
Returns system identifier of the XML schema's Source.
toString() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
toString() - Method in class org.webharvest.runtime.variables.EmptyVariable
 
toString() - Method in class org.webharvest.runtime.variables.ListVariable
 
toString() - Method in class org.webharvest.runtime.variables.NodeVariable
 
toString() - Method in class org.webharvest.runtime.variables.Variable
 
toString() - Method in class org.webharvest.utils.CommonUtil.IntPair
 
toString() - Method in class org.webharvest.utils.XmlNodeWrapper
 
toString(String) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
 
toString(String) - Method in class org.webharvest.runtime.variables.EmptyVariable
 
toString(String) - Method in class org.webharvest.runtime.variables.ListVariable
 
toString(String) - Method in class org.webharvest.runtime.variables.NodeVariable
 
toString(String) - Method in class org.webharvest.runtime.variables.Variable
 
toString(String, String) - Method in class org.webharvest.runtime.variables.ListVariable
 
totalLength - Variable in class org.webharvest.runtime.web.HttpInfo
 
totalResponses - Variable in class org.webharvest.runtime.web.HttpInfo
 
transform(I) - Method in class org.webharvest.definition.validation.TransformerPair
Transforms input object into output object.
transform(I) - Method in interface org.webharvest.Transformer
Transforms input object into output object.
transform(String) - Method in class org.webharvest.definition.validation.ResourcePathToURITransformer
Transforms input object into output object.
transform(URI) - Method in class org.webharvest.definition.validation.URIToSchemaSourceTransformer
Transforms input object into output object.
TransformationException - Exception in org.webharvest
Checked exception thrown if transformation process has failed.
TransformationException(Throwable) - Constructor for exception org.webharvest.TransformationException
TransformationException constructor accepting cause of the exception.
Transformer<I,​O> - Interface in org.webharvest
A component which is capable of transforming object from one type to another type.
TransformerPair<I,​T,​O> - Class in org.webharvest.definition.validation
Implementation of Transformer interface connecting two other Transformers where the output type of the first one is the same as input of the second one.
TransformerPair(Transformer<I, T>, Transformer<T, O>) - Constructor for class org.webharvest.definition.validation.TransformerPair
Default class constructor which accepts reference to two Transformers.
TryDef - Class in org.webharvest.definition
Definition of try-catche element.
TryDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.TryDef
 
TryProcessor - Class in org.webharvest.runtime.processors
OnError processor - sets .
TryProcessor() - Constructor for class org.webharvest.runtime.processors.TryProcessor
 
TYPE_BINARY - Static variable in class org.webharvest.runtime.variables.Types
 
TYPE_DEFAULT - Static variable in class org.webharvest.runtime.variables.Types
 
TYPE_EMPTY - Static variable in class org.webharvest.runtime.variables.Types
 
TYPE_LIST - Static variable in class org.webharvest.runtime.variables.Types
 
TYPE_TEXT - Static variable in class org.webharvest.runtime.variables.Types
 
TYPE_XML - Static variable in class org.webharvest.runtime.variables.Types
 
TypeMatchers - Class in org.webharvest.ioc
 
TypeMatchers() - Constructor for class org.webharvest.ioc.TypeMatchers
 
Types - Class in org.webharvest.runtime.variables
Variable types.
Types() - Constructor for class org.webharvest.runtime.variables.Types
 

U

unbind(K) - Method in class org.webharvest.AbstractRegistry
Unbinds name and the associated value from the registry.
unbind(K) - Method in class org.webharvest.LockedRegistry
Unbinds name and the associated value from the registry.
unbind(K) - Method in interface org.webharvest.Registry
Unbinds name and the associated value from the registry.
UNDEFINED_LOCATION - Static variable in interface org.webharvest.definition.ConfigSource
Configuration source undefined location.
UNKNOWN - org.webharvest.runtime.ScraperState
Deprecated.
unregisterPlugin(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
Deprecated.
unregisterPlugin(String, String) - Method in class org.webharvest.definition.DefinitionResolver
 
URIToSchemaSourceTransformer - Class in org.webharvest.definition.validation
An implementation of Transformer interface which supports transformation from given resource URI to appropriate instance of SchemaSource.
URIToSchemaSourceTransformer() - Constructor for class org.webharvest.definition.validation.URIToSchemaSourceTransformer
 
URLConfigSource - Class in org.webharvest.definition
Implementation of ConfigSource that uses a HTTP protocol as source of XML configurations.
URLConfigSource(URL) - Constructor for class org.webharvest.definition.URLConfigSource
Class constructor expecting URL as configuration source.
UserException - Exception in org.webharvest.exception
Exception explicitly thrown by a user.
UserException(String) - Constructor for exception org.webharvest.exception.UserException
 

V

v1 - org.webharvest.definition.Config.Version
 
v2 - org.webharvest.definition.Config.Version
 
validAttributes() - Method in annotation type org.webharvest.annotation.Definition
This method should return all possible attribute names for the plugin processor.
validSubprocessors() - Method in annotation type org.webharvest.annotation.Definition
This method should return all names of all allowed processors inside the body of this processor plugin.
value() - Method in annotation type org.webharvest.annotation.Definition
Defines name of the processor.
value() - Method in annotation type org.webharvest.runtime.processors.plugins.TargetNamespace
Indicates target XML namespaces for the plugin.
VALUE_PROPERTY_NAME - Static variable in class org.webharvest.WHConstants
 
valueOf(String) - Static method in enum org.webharvest.definition.Config.Version
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.webharvest.runtime.database.DefaultDriverManager
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.webharvest.runtime.ScraperState
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.webharvest.runtime.scripting.ScriptingLanguage
Returns the enum constant of this type with the specified name.
ValueOfPlugin - Class in org.webharvest.runtime.processors.plugins
 
ValueOfPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ValueOfPlugin
 
values() - Static method in enum org.webharvest.definition.Config.Version
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.webharvest.runtime.database.DefaultDriverManager
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.webharvest.runtime.ScraperState
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.webharvest.runtime.scripting.ScriptingLanguage
Returns an array containing the constants of this enum type, in the order they are declared.
VAR_END - Static variable in class org.webharvest.runtime.templaters.BaseTemplater
 
VAR_START - Static variable in class org.webharvest.runtime.templaters.BaseTemplater
 
VarDef - Class in org.webharvest.definition
Definition of variable call.
VarDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.VarDef
 
VarDefDef - Class in org.webharvest.definition
Definition of variable.
VarDefDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.VarDefDef
 
VarDefProcessor - Class in org.webharvest.deprecated.runtime.processors
Deprecated.
Use either DefProcessor or SetProcessor
VarDefProcessor() - Constructor for class org.webharvest.deprecated.runtime.processors.VarDefProcessor
Deprecated.
 
Variable - Class in org.webharvest.runtime.variables
Variables Interface.
Variable() - Constructor for class org.webharvest.runtime.variables.Variable
 
VariableException - Exception in org.webharvest.exception
Variable processor exception.
VariableException() - Constructor for exception org.webharvest.exception.VariableException
 
VariableException(String) - Constructor for exception org.webharvest.exception.VariableException
 
VariableException(String, Throwable) - Constructor for exception org.webharvest.exception.VariableException
 
VariableException(Throwable) - Constructor for exception org.webharvest.exception.VariableException
 
VariableName - Class in org.webharvest.runtime.variables
Represents name of the Variable that is set on DynamicScopeContext.
VariableName(String) - Constructor for class org.webharvest.runtime.variables.VariableName
Creates new Variable name.
VarProcessor - Class in org.webharvest.deprecated.runtime.processors
Deprecated.
Use or instead
VarProcessor() - Constructor for class org.webharvest.deprecated.runtime.processors.VarProcessor
Deprecated.
 
visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.AbstractConfigSource
Depending on owned ConfigSource.Location allows the ancestor class to accept intercepted ConfigLocationVisitor.
visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.BufferConfigSource
Depending on owned ConfigSource.Location allows the ancestor class to accept intercepted ConfigLocationVisitor.
visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.FileConfigSource
Depending on owned ConfigSource.Location allows the ancestor class to accept intercepted ConfigLocationVisitor.
visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.URLConfigSource
Depending on owned ConfigSource.Location allows the ancestor class to accept intercepted ConfigLocationVisitor.
visit(FileConfigSource.FileLocation) - Method in interface org.webharvest.definition.ConfigLocationVisitor
Takes certain action on FileConfigSource.FileLocation.
visit(FileConfigSource.FileLocation) - Method in class org.webharvest.definition.IncludeVisitor
Takes certain action on FileConfigSource.FileLocation.
visit(URLConfigSource.URLLocation) - Method in interface org.webharvest.definition.ConfigLocationVisitor
Takes certain action on URLConfigSource.URLLocation.
visit(URLConfigSource.URLLocation) - Method in class org.webharvest.definition.IncludeVisitor
Takes certain action on URLConfigSource.URLLocation.

W

WebBrowserJavascriptPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
Evaluates javascript on the page inside headless web browser.
WebBrowserJavascriptPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserJavascriptPlugin
 
WebBrowserLoadPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
Load page inside headless web browser.
WebBrowserLoadPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserLoadPlugin
 
WebBrowserlPluginException - Exception in org.webharvest.runtime.processors.plugins.webbrowser
Runtime exception for MailPlugin
WebBrowserlPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserlPluginException
 
WebBrowserlPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserlPluginException
 
WebBrowserlPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserlPluginException
 
WebBrowserPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
Support headless web browser supported by PhantomJS open source project.
WebBrowserPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
 
WebBrowserRenderPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
Evaluates javascript on the page inside headless web browser.
WebBrowserRenderPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserRenderPlugin
 
WebHarvestPlugin - Class in org.webharvest.runtime.processors
Base for all user-defined plugins.
WebHarvestPlugin() - Constructor for class org.webharvest.runtime.processors.WebHarvestPlugin
 
WebHarvestPluginDef - Class in org.webharvest.definition
Definition of all plugin processors.
WebHarvestPluginDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.WebHarvestPluginDef
 
WebScraper - Interface in org.webharvest.runtime
 
WHConstants - Class in org.webharvest
Created by IntelliJ IDEA.
WHConstants() - Constructor for class org.webharvest.WHConstants
 
WhileDef - Class in org.webharvest.definition
Definition of while loop processor (while-empty and while-not-empty).
WhileDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.WhileDef
 
WhileProcessor - Class in org.webharvest.runtime.processors
Conditional processor.
WhileProcessor() - Constructor for class org.webharvest.runtime.processors.WhileProcessor
 
withPackageName(String) - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
Sets the package name to scan for plugins.
withPluginClass(Class<? extends Processor>) - Method in class org.webharvest.definition.PluginDefinitionBuilder
Sets the plugin class for the definition.
withRegistrationStrategy(PluginRegistrationStrategy) - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
Sets the registration strategy to use.
withXmlNode(XmlNode) - Method in class org.webharvest.definition.PluginDefinitionBuilder
Sets the XML node for the plugin definition.
WorkingDir - Annotation Type in org.webharvest.ioc
Guice binder helper annotation for scraper's working directory path (indicates where temporary files are kept).

X

x - Variable in class org.webharvest.utils.CommonUtil.IntPair
 
XmlAttribute - Class in org.webharvest.definition
Information about single xml attribute
XmlAttribute(String, String, String) - Constructor for class org.webharvest.definition.XmlAttribute
 
XMLConfig - Class in org.webharvest.definition
Implementation of ConfigSource capable to work with XML based configurations.
XMLConfig(ConfigSource, ConfigParser) - Constructor for class org.webharvest.definition.XMLConfig
Constructor for both Guice and non-Guice usage.
xmlNode - Variable in class org.webharvest.definition.AbstractElementDef
 
XmlNode - Class in org.webharvest.definition
 
XmlNode(String, String, String, XmlNode) - Constructor for class org.webharvest.definition.XmlNode
Constructor that defines name and connects to specified parent element.
XmlNodeWrapper - Class in org.webharvest.utils
 
XmlNodeWrapper(Item, Properties) - Constructor for class org.webharvest.utils.XmlNodeWrapper
 
XMLNS_CORE - Static variable in class org.webharvest.WHConstants
 
XMLNS_CORE_10 - Static variable in class org.webharvest.WHConstants
 
XMLNS_CORE_10_ALIASES - Static variable in class org.webharvest.WHConstants
 
XMLNS_PARAM - Static variable in class org.webharvest.WHConstants
 
XMLNS_VAR - Static variable in class org.webharvest.WHConstants
 
XMLProcessor - Class in org.webharvest.runtime.processors
 
XMLProcessor() - Constructor for class org.webharvest.runtime.processors.XMLProcessor
 
XmlToJsonPlugin - Class in org.webharvest.runtime.processors.plugins
Converter from XML to JSON
XmlToJsonPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.XmlToJsonPlugin
 
XmlUtil - Class in org.webharvest.utils
XML utils - contains common logic for XML handling
XmlUtil() - Constructor for class org.webharvest.utils.XmlUtil
 
XmlValidator - Class in org.webharvest.utils
 
XmlValidator() - Constructor for class org.webharvest.utils.XmlValidator
 
XPathDef - Class in org.webharvest.definition
Definition of XPath processor.
XPathDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XPathDef
 
XPathProcessor - Class in org.webharvest.runtime.processors
XQuery processor.
XPathProcessor() - Constructor for class org.webharvest.runtime.processors.XPathProcessor
 
XQExpression - Class in org.webharvest.runtime.processors
 
XQExpression() - Constructor for class org.webharvest.runtime.processors.XQExpression
 
XQParamProcessor - Class in org.webharvest.runtime.processors
 
XQParamProcessor() - Constructor for class org.webharvest.runtime.processors.XQParamProcessor
 
XQueryDef - Class in org.webharvest.definition
Definition of XQuery processor.
XQueryDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XQueryDef
 
XQueryExpressionPool - Class in org.webharvest.runtime.processors
Class represnts simple pool for XQuery expressions.
XQueryExpressionPool(StaticQueryContext) - Constructor for class org.webharvest.runtime.processors.XQueryExpressionPool
 
XQueryExternalParamDef - Class in org.webharvest.definition
Definition of XQuery external parameter.
XQueryExternalParamDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XQueryExternalParamDef
 
XQueryProcessor - Class in org.webharvest.runtime.processors
XQuery processor.
XQueryProcessor() - Constructor for class org.webharvest.runtime.processors.XQueryProcessor
 
XsltDef - Class in org.webharvest.definition
Definition of user-defined function.
XsltDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XsltDef
 
XsltException - Exception in org.webharvest.exception
Template exception - occured during executing XSLT processor.
XsltException() - Constructor for exception org.webharvest.exception.XsltException
 
XsltException(String) - Constructor for exception org.webharvest.exception.XsltException
 
XsltException(String, Throwable) - Constructor for exception org.webharvest.exception.XsltException
 
XsltException(Throwable) - Constructor for exception org.webharvest.exception.XsltException
 
XsltProcessor - Class in org.webharvest.runtime.processors
XSLT processor.
XsltProcessor() - Constructor for class org.webharvest.runtime.processors.XsltProcessor
 

Y

y - Variable in class org.webharvest.utils.CommonUtil.IntPair
 

Z

ZipEntryPlugin - Class in org.webharvest.runtime.processors.plugins.zip
Zip entry plugin - can be used only inside zip plugin.
ZipEntryPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
 
ZipPlugin - Class in org.webharvest.runtime.processors.plugins.zip
ZIP processor
ZipPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
 
ZipPluginException - Exception in org.webharvest.runtime.processors.plugins.zip
Runtime exception for ZipPlugin
ZipPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.zip.ZipPluginException
 
ZipPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.zip.ZipPluginException
 
ZipPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.zip.ZipPluginException
 
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
All Classes All Packages