A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- AbstractConfigSource - Class in org.webharvest.definition
-
Base class for other more specialized implementations of
ConfigSource
interface. - AbstractConfigSource() - Constructor for class org.webharvest.definition.AbstractConfigSource
- AbstractDatabasePlugin - Class in org.webharvest.runtime.processors.plugins.db
- AbstractDatabasePlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
- AbstractElementDef - Class in org.webharvest.definition
- AbstractElementDef(XmlNode) - Constructor for class org.webharvest.definition.AbstractElementDef
- AbstractProcessor<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
-
Base processor that contains common processor logic.
- AbstractProcessor() - Constructor for class org.webharvest.runtime.processors.AbstractProcessor
- AbstractProcessor(ProcessorExecutionStrategy) - Constructor for class org.webharvest.runtime.processors.AbstractProcessor
-
Constructor with custom execution strategy (for testing).
- AbstractProcessorDecorator<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
- AbstractProcessorDecorator(Processor<TDef>) - Constructor for class org.webharvest.runtime.processors.AbstractProcessorDecorator
- AbstractRefreshableResolver - Class in org.webharvest.definition
-
Abstract
ConfigurableResolver
implementation serving as a base class for definition resolvers supposed to support multiple refresh operations. - AbstractRefreshableResolver(Provider<ElementsRegistry>) - Constructor for class org.webharvest.definition.AbstractRefreshableResolver
- AbstractRegistry<K,V> - Class in org.webharvest
-
Abstract class implementing
Registry
interface. - AbstractRegistry() - Constructor for class org.webharvest.AbstractRegistry
- accept(ConfigLocationVisitor) - Method in interface org.webharvest.definition.ConfigLocationVisitor.VisitableLocation
-
Accepts
ConfigLocationVisitor
- adaptFilename(String) - Static method in class org.webharvest.utils.CommonUtil
- add(IElementDef) - Method in class org.webharvest.definition.AbstractElementDef
- addAttribute(String, String, String) - Method in class org.webharvest.definition.XmlNode
-
Adds new attribute with specified name and value.
- addDriverResource(URI) - Method in enum org.webharvest.runtime.database.DefaultDriverManager
-
Adds resource containing database driver (in most cases it is a JAR file).
- addDriverResource(URI) - Method in interface org.webharvest.runtime.database.DriverManager
-
Adds resource containing database driver (in most cases it is a JAR file).
- addElement(String) - Method in class org.webharvest.definition.XmlNode
-
Adds new textBuff to element list
- addElement(XmlNode) - Method in class org.webharvest.definition.XmlNode
-
Adds new subelement.
- addEventHandler(EventHandler<?>) - Method in interface org.webharvest.Harvest
-
Register event handler.
- addEventHandler(EventHandler<?>) - Method in class org.webharvest.runtime.DefaultHarvest
-
Register event handler.
- addFunctionDef(FunctionDef) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- addFunctionDef(FunctionDef) - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Deprecated.
- addFunctionDef(FunctionDef) - Method in class org.webharvest.runtime.ScraperContext
-
Deprecated.
- addFunctionParam(String, Variable) - Method in class org.webharvest.runtime.processors.CallProcessor
-
Adds parameter of function which is going to call.
- addHttpHeader(String, String) - Method in class org.webharvest.runtime.processors.HttpProcessor
- addHttpParam(String, boolean, String, String, Variable) - Method in class org.webharvest.runtime.processors.HttpProcessor
- addListener(ScrapingAware) - Method in class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
-
Register
ScrapingAware
listener. - addPostProcessor(ResolverPostProcessor) - Method in class org.webharvest.definition.AbstractRefreshableResolver
-
Adds new
ResolverPostProcessor
which is going to be applied on the current resolver instance on the resolver refresh. - addPostProcessor(ResolverPostProcessor) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Adds new
ResolverPostProcessor
which is going to be applied on the current resolver instance on the resolver refresh. - addPostProcessor(SchemaResolverPostProcessor) - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
-
Adds new
SchemaResolverPostProcessor
which is going to be applied on the current resolver instance on the resolver refresh. - addPostProcessor(SchemaResolverPostProcessor) - Method in interface org.webharvest.definition.validation.SchemaResolver
-
Adds new
SchemaResolverPostProcessor
which is going to be applied on the current resolver instance on the resolver refresh. - addVariable(Variable) - Method in class org.webharvest.runtime.variables.ListVariable
- ALLOWED_PARAM_TYPES - Static variable in class org.webharvest.runtime.processors.XQueryProcessor
- AlreadyBoundException - Exception in org.webharvest
-
Checked exception thrown if object with the given name is already bound in the registry.
- AlreadyBoundException(String) - Constructor for exception org.webharvest.AlreadyBoundException
-
AlreadyBoundException
constructor accepting explanation message. - AnnotatedPluginsPostProcessor - Class in org.webharvest.definition
-
ResolverPostProcessor
implementation capable of scanning Java class path looking forAutoscanned
web harvest plugins. - AnnotatedPluginsPostProcessor(String) - Constructor for class org.webharvest.definition.AnnotatedPluginsPostProcessor
-
Constructs
AnnotatedPluginsPostProcessor
accepting name of java package which is going to be scanned for the web harvest plugins. - AnnotatedPluginsPostProcessor(String, PluginRegistrationStrategy) - Constructor for class org.webharvest.definition.AnnotatedPluginsPostProcessor
-
Constructs
AnnotatedPluginsPostProcessor
with custom registration strategy. - AnnotatedPluginsPostProcessorBuilder - Class in org.webharvest.definition
-
Builder for creating AnnotatedPluginsPostProcessor instances.
- annotatedWith(Class<? extends Annotation>) - Static method in class org.webharvest.ioc.TypeMatchers
- apos - Static variable in class org.webharvest.utils.SystemUtilities
- appendBinary(Variable) - Method in class org.webharvest.runtime.processors.FileProcessor
- areAllTagsAllowed() - Method in class org.webharvest.definition.ElementInfo
- Assert - Class in org.webharvest.utils
-
Created by IntelliJ IDEA.
- Assert() - Constructor for class org.webharvest.utils.Assert
- AttributeHolder - Interface in org.webharvest.ioc
-
Implementors of this interface can serve as the backing store for Objects that are scoped within an (subclass of)
AttributeHolderScope
. - AttributeHolderScope<AHT extends AttributeHolder> - Class in org.webharvest.ioc
-
A
Scope
that uses anAttributeHolder
as the backing store for its scoped objects. - AttributeHolderScope() - Constructor for class org.webharvest.ioc.AttributeHolderScope
- AuthSSLInitializationError - Error in org.apache.commons.httpclient.contrib.ssl
-
Signals fatal error in initialization of
AuthSSLProtocolSocketFactory
. - AuthSSLInitializationError() - Constructor for error org.apache.commons.httpclient.contrib.ssl.AuthSSLInitializationError
-
Creates a new AuthSSLInitializationError.
- AuthSSLInitializationError(String) - Constructor for error org.apache.commons.httpclient.contrib.ssl.AuthSSLInitializationError
-
Creates a new AuthSSLInitializationError with the specified message.
- AuthSSLProtocolSocketFactory - Class in org.apache.commons.httpclient.contrib.ssl
-
AuthSSLProtocolSocketFactory can be used to validate the identity of the HTTPS server against a list of trusted certificates and to authenticate to the HTTPS server using a private key.
- AuthSSLProtocolSocketFactory(URL, String, URL, String) - Constructor for class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
-
Constructor for AuthSSLProtocolSocketFactory.
- AuthSSLX509TrustManager - Class in org.apache.commons.httpclient.contrib.ssl
-
AuthSSLX509TrustManager can be used to extend the default
X509TrustManager
with additional trust decisions. - AuthSSLX509TrustManager(X509TrustManager) - Constructor for class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
-
Constructor for AuthSSLX509TrustManager.
- Autoscanned - Annotation Type in org.webharvest.runtime.processors.plugins
-
Indicates that the annotated plugin should be automatically registered in system.
B
- backspace - Static variable in class org.webharvest.utils.SystemUtilities
- BaseException - Exception in org.webharvest.exception
-
Basic Scraper exception.
- BaseException() - Constructor for exception org.webharvest.exception.BaseException
- BaseException(String) - Constructor for exception org.webharvest.exception.BaseException
- BaseException(String, Throwable) - Constructor for exception org.webharvest.exception.BaseException
- BaseException(Throwable) - Constructor for exception org.webharvest.exception.BaseException
- BaseTemplater - Class in org.webharvest.runtime.templaters
-
Simple templater - replaces ${expression} sequences in string with evaluated expressions.
- BaseTemplater() - Constructor for class org.webharvest.runtime.templaters.BaseTemplater
- BEANSHELL - org.webharvest.runtime.scripting.ScriptingLanguage
- bind(K, V) - Method in class org.webharvest.AbstractRegistry
-
Binds value under the given name.
- bind(K, V) - Method in class org.webharvest.LockedRegistry
-
Binds value under the given name.
- bind(K, V) - Method in interface org.webharvest.Registry
-
Binds value under the given name.
- bindConfigParser() - Method in class org.webharvest.ioc.ScraperModule
- bindDBConnectionFactory() - Method in class org.webharvest.ioc.ScraperModule
- bindDebugFileLogger() - Method in class org.webharvest.ioc.ScraperModule
- bindScraperContext() - Method in class org.webharvest.ioc.ScraperModule
- bindStatusHolder() - Method in class org.webharvest.ioc.ScraperModule
- body() - Method in annotation type org.webharvest.annotation.Definition
- BodyProcessor - Class in org.webharvest.runtime.processors
-
Processor which executes only body and returns variables list.
- BodyProcessor() - Constructor for class org.webharvest.runtime.processors.BodyProcessor
- BodyProcessor.Builder - Class in org.webharvest.runtime.processors
-
A builder responsible for creating instance of
BodyProcessor
and completing it with appropriateIElementDef
. - BufferConfigSource - Class in org.webharvest.definition
-
Implementation of
ConfigSource
that uses plain old string as source of configuration/ - BufferConfigSource(String) - Constructor for class org.webharvest.definition.BufferConfigSource
-
Class constructor expecting plain old string configuration.
- BufferConfigSource(String, ConfigSource.Location) - Constructor for class org.webharvest.definition.BufferConfigSource
-
Class constructor expecting both the XML configuration as well as configuration location (eg. in situation when the original
ConfigSource
object has been modified and such a change should be reflected in new immutable state. - build() - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
-
Builds the AnnotatedPluginsPostProcessor instance.
- build() - Method in class org.webharvest.definition.PluginDefinitionBuilder
-
Builds the WebHarvestPluginDef instance.
- build() - Method in class org.webharvest.runtime.processors.BodyProcessor.Builder
-
Returns an instance of
BodyProcessor
which is completed with element definition. - build() - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- Builder(String) - Constructor for class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- Builder(IElementDef) - Constructor for class org.webharvest.runtime.processors.BodyProcessor.Builder
-
Default builder constructor which accepts
IElementDef
forBodyProcessor
.
C
- Cache<K,V> - Interface in org.webharvest
-
Cache containing semi-persistent, key-value mappings.
- CallDef - Class in org.webharvest.definition
-
Definition of function call.
- CallDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.CallDef
- CallParamDef - Class in org.webharvest.definition
-
Definition of function call parameter.
- CallParamDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.CallParamDef
- CallParamProcessor - Class in org.webharvest.runtime.processors
-
Variable definition http param processor.
- CallParamProcessor() - Constructor for class org.webharvest.runtime.processors.CallParamProcessor
- CallProcessor - Class in org.webharvest.runtime.processors
-
Function call processor.
- CallProcessor() - Constructor for class org.webharvest.runtime.processors.CallProcessor
- CallProcessor10 - Class in org.webharvest.deprecated.runtime.processors
-
Function call processor.
- CallProcessor10() - Constructor for class org.webharvest.deprecated.runtime.processors.CallProcessor10
- CaseDef - Class in org.webharvest.definition
-
Definition of case processor.
- CaseDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.CaseDef
- CaseProcessor - Class in org.webharvest.runtime.processors
-
Conditional processor.
- CaseProcessor() - Constructor for class org.webharvest.runtime.processors.CaseProcessor
- CatchProcessor - Class in org.webharvest.runtime.processors
- CatchProcessor() - Constructor for class org.webharvest.runtime.processors.CatchProcessor
- charset - Variable in class org.webharvest.runtime.web.HttpInfo
- checkClientTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
- checkClientTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
- checkServerTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
- checkServerTrusted(X509Certificate[], String) - Method in class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
- ClassLoaderUtil - Class in org.webharvest.utils
-
Class loading utility - used for loading JDBC driver classes and plugin classes.
- ClassPathScanner - Interface in org.webharvest.utils
-
Component capable of scanning types available on Java class path that meet certain criteria.
- ClassPathScannerImpl - Class in org.webharvest.utils
-
ClassPathScanner
interface implementation using Reflections library under the hood. - ClassPathScannerImpl(String) - Constructor for class org.webharvest.utils.ClassPathScannerImpl
-
Constructor accepting name of the java package.
- client - Variable in class org.webharvest.runtime.web.HttpInfo
- close() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- CommonUtil - Class in org.webharvest.utils
-
Basic evaluation utilities
- CommonUtil() - Constructor for class org.webharvest.utils.CommonUtil
- CommonUtil.IntPair - Class in org.webharvest.utils
-
Contains pair of integer values
- compareTo(Object) - Method in class org.webharvest.definition.ElementName
- Config - Interface in org.webharvest.definition
-
Represents lazily loaded configuration object that is loaded from associated
ConfigSource
on demand with call toConfig.reload()
method. - Config.Version - Enum in org.webharvest.definition
- ConfigDef - Class in org.webharvest.definition
-
Web Harvest's definition of the
<config>
element. - ConfigDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ConfigDef
-
Default class constructor which obtains from given
XmlNode
attributes such as charset and scripting language. - ConfigFactory - Interface in org.webharvest.definition
-
Represents simple
Config
factory. - ConfigLocationVisitor - Interface in org.webharvest.definition
-
Represents object implementing Visitor patter.
- ConfigLocationVisitor.VisitableLocation - Interface in org.webharvest.definition
-
Extended version of
ConfigSource.Location
interface that closely cooperate withConfigLocationVisitor
in order to take special action on certainConfigSource.Location
instance of which we haev lost a type. - ConfigParser - Interface in org.webharvest.definition
-
Created by rba on 16.07.2017.
- ConfigProcessor - Class in org.webharvest.runtime.processors
- ConfigProcessor() - Constructor for class org.webharvest.runtime.processors.ConfigProcessor
- ConfigSource - Interface in org.webharvest.definition
-
Represents source of XML configuration object.
- ConfigSource.Location - Interface in org.webharvest.definition
-
Just a marker interface to indicate the actual type of location eg. file, url or any other
- ConfigSourceFactory - Interface in org.webharvest.definition
-
Represents factory object capable to instantiate
ConfigSource
objects either fromURL
,File
or just raw XML configuration. - ConfigurableResolver - Interface in org.webharvest.definition
-
Interface to be implemented by the web harvest configuration elements definition resolvers.
- ConfigurationException - Exception in org.webharvest.exception
-
Configuration exception - occurred during parsing configuration file etc.
- ConfigurationException() - Constructor for exception org.webharvest.exception.ConfigurationException
- ConfigurationException(String) - Constructor for exception org.webharvest.exception.ConfigurationException
- ConfigurationException(String, Throwable) - Constructor for exception org.webharvest.exception.ConfigurationException
- ConfigurationException(Throwable) - Constructor for exception org.webharvest.exception.ConfigurationException
- configure() - Method in class org.webharvest.ioc.HttpModule
- configure() - Method in class org.webharvest.ioc.ScraperModule
- ConnectionFactory - Interface in org.webharvest.runtime.database
-
Factory responsible for creating SQL
Connection
s basis on the provided parameters. - ConnectionProxy - Interface in org.webharvest.runtime.database
-
Connection
subinterface implemented byConnection
proxies. - ConstantDef - Class in org.webharvest.definition
-
Definition of constant processor.
- ConstantDef(String, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ConstantDef
- ConstantProcessor - Class in org.webharvest.runtime.processors
-
Constant processor.
- ConstantProcessor() - Constructor for class org.webharvest.runtime.processors.ConstantProcessor
- CONTAINER_PREFIX - Static variable in class org.webharvest.runtime.database.JNDIConnectionFactory
-
Environment-related component's binding subtree used as default by JavaEE container.
- contains(Object) - Method in class org.webharvest.runtime.variables.ListVariable
-
Checks if list contains specified object's string representation
- contains(K) - Method in interface org.webharvest.Cache
-
Returns {@link true} if
Cache
contains object identified by the given key. - contains(K) - Method in class org.webharvest.ThreadLocalCache
-
Returns {@link true} if
Cache
contains object identified by the given key. - containsVar(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- containsVar(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
- containsVar(String) - Method in class org.webharvest.runtime.ScraperContext
- contentLength - Variable in class org.webharvest.runtime.web.HttpInfo
- ContextFactory - Interface in org.webharvest.ioc
-
Factory creating brand new instances of
DynamicScopeContext
. - ContextHolder - Interface in org.webharvest.runtime
-
Component responsible for providing current
DynamicScopeContext
which is used by the Scraper. - convertIntoJndiName(String) - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
-
Converts provided name to conform java enterprise JNDI naming policy.
- countChars(String, char, int, int) - Static method in class org.webharvest.utils.CommonUtil
-
Counts number of specified characters in give text.
- cr - Static variable in class org.webharvest.utils.SystemUtilities
- create() - Static method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
-
Creates a new builder instance.
- create() - Static method in class org.webharvest.definition.PluginDefinitionBuilder
-
Creates a new builder instance.
- create(File) - Method in interface org.webharvest.definition.ConfigSourceFactory
-
Create new instance of
ConfigSource
fromFile
. - create(String) - Method in interface org.webharvest.definition.ConfigSourceFactory
-
Create new instance of
ConfigSource
from raw XML content. - create(URL) - Method in interface org.webharvest.definition.ConfigSourceFactory
-
Create new instance of
ConfigSource
fromURL
. - create(Config) - Method in interface org.webharvest.ioc.ContextFactory
-
Returns brand new instance of
DynamicScopeContext
. - create(Config) - Method in interface org.webharvest.ioc.HarvesterFactory
- create(ConfigSource) - Method in interface org.webharvest.definition.ConfigFactory
-
Produces new instance of
Config
for specifiedConfigSource
resource. - create(DynamicScopeContext) - Static method in class org.webharvest.runtime.NestedContextFactory
-
Creates new instance of
DynamicScopeContext
. - createContext() - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
- createDataSourceOfVariable(Variable, String, String) - Static method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- createElementDefinition(XmlNode) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Creates proper element definition instance based on given xml node from input configuration.
- createElementDefinition(XmlNode) - Method in class org.webharvest.definition.DefinitionResolver
-
Creates proper element definition instance based on given xml node from input configuration.
- createListOfXmlNodes(XQueryExpression, DynamicQueryContext) - Static method in class org.webharvest.utils.XmlUtil
-
Creates list variable of resulting XML nodes.
- createPlugin() - Method in interface org.webharvest.definition.PluginDef
- createPlugin() - Method in class org.webharvest.definition.WebHarvestPluginDef
- createPlugin(Class<? extends Processor>, IElementDef) - Static method in class org.webharvest.definition.PluginFactory
-
Creates a new plugin instance with proper dependency injection.
- createProcessor(DynamicScopeContext, TDef) - Method in class org.webharvest.runtime.processors.DefaultProcessorFactory
- createProcessor(DynamicScopeContext, TDef) - Method in interface org.webharvest.runtime.processors.ProcessorFactory
- createSocket(String, int) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
- createSocket(String, int) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
- createSocket(String, int) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
- createSocket(String, int, InetAddress, int) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
- createSocket(String, int, InetAddress, int) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
- createSocket(String, int, InetAddress, int) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
- createSocket(String, int, InetAddress, int, HttpConnectionParams) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
-
Attempts to get a new socket connection to the given host within the given time limit.
- createSocket(String, int, InetAddress, int, HttpConnectionParams) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
-
Attempts to get a new socket connection to the given host within the given time limit.
- createSocket(String, int, InetAddress, int, HttpConnectionParams) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
-
Attempts to get a new socket connection to the given host within the given time limit.
- createSocket(Socket, String, int, boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLProtocolSocketFactory
- createSocket(Socket, String, int, boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
- createSocket(Socket, String, int, boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
- createVariable(Object) - Static method in class org.webharvest.utils.CommonUtil
-
Creates appropriate AbstractVariable instance for the specified object.
D
- DatabaseException - Exception in org.webharvest.exception
-
Database execution exception - occured during executing database processor.
- DatabaseException() - Constructor for exception org.webharvest.exception.DatabaseException
- DatabaseException(String) - Constructor for exception org.webharvest.exception.DatabaseException
- DatabaseException(String, Throwable) - Constructor for exception org.webharvest.exception.DatabaseException
- DatabaseException(Throwable) - Constructor for exception org.webharvest.exception.DatabaseException
- DatabaseJNDIPlugin - Class in org.webharvest.runtime.processors.plugins.db
-
Web Harvest plugin supporting database operations within JNDI - enabled environment.
- DatabaseJNDIPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.DatabaseJNDIPlugin
- DatabasePlugin - Class in org.webharvest.runtime.processors.plugins.db
-
Support for database operations.
- DatabasePlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
- date() - Method in class org.webharvest.utils.SystemUtilities
-
Returns current date formatted as "yyyyMMdd".
- datetime(Object) - Method in class org.webharvest.utils.SystemUtilities
-
Returns formatted date/time for specified format string.
- DbColumnDescription - Class in org.webharvest.runtime.processors.plugins.db
-
Information about database record columns.
- DbColumnDescription(String, int) - Constructor for class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
- DbParamPlugin - Class in org.webharvest.runtime.processors.plugins.db
-
DB param plugin - can be used only inside database plugin.
- DbParamPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
- DbRowVariable - Class in org.webharvest.runtime.processors.plugins.db
-
Special variable used for database records
- DbRowVariable(DbColumnDescription[], Object[]) - Constructor for class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- debug(IElementDef, DynamicScopeContext, Variable) - Method in class org.webharvest.runtime.processors.AbstractProcessor
- DebugFileLogger - Annotation Type in org.webharvest.ioc
- decoratedProcessor - Variable in class org.webharvest.runtime.processors.AbstractProcessorDecorator
- DEFAULT_CONTENT_TYPE - Static variable in class org.webharvest.definition.HttpDef
- DEFAULT_MAX_LOOPS - Static variable in class org.webharvest.WHConstants
- DEFAULT_METHOD - Static variable in class org.webharvest.definition.HttpDef
- DEFAULT_PARAM_TYPE - Static variable in class org.webharvest.runtime.processors.XQueryProcessor
- DEFAULT_USER_AGENT - Static variable in class org.webharvest.runtime.web.HttpClientManager
- DefaultDriverManager - Enum in org.webharvest.runtime.database
-
Default implementation of the
DriverManger
interface. - DefaultHandlerHolder - Class in org.webharvest.events
-
Default implementation of
HandlerHolder
storing references to registeredEventHandler
listeners. - DefaultHandlerHolder(Registry<Harvester, EventBus>, Provider<EventBus>) - Constructor for class org.webharvest.events.DefaultHandlerHolder
-
Default class constructor specifying the
Registry
storing associations between Harvester's scope and particularEventBus
that is bound to it. - DefaultHarvest - Class in org.webharvest.runtime
-
Default implementation of
Harvest
interface. - DefaultHarvest(ConfigFactory, HarvesterFactory, HandlerHolder, EventSink) - Constructor for class org.webharvest.runtime.DefaultHarvest
-
Default class constructor specifying
HarvesterFactory
,HandlerHolder
andEventSink
that are expected to be Guice injected. - DefaultPluginRegistrationStrategy - Class in org.webharvest.definition
-
Default implementation of PluginRegistrationStrategy.
- DefaultPluginRegistrationStrategy() - Constructor for class org.webharvest.definition.DefaultPluginRegistrationStrategy
- DefaultProcessorExecutionStrategy - Class in org.webharvest.runtime.processors
-
Default implementation of processor execution strategy.
- DefaultProcessorExecutionStrategy() - Constructor for class org.webharvest.runtime.processors.DefaultProcessorExecutionStrategy
- DefaultProcessorFactory - Class in org.webharvest.runtime.processors
-
Created by rbala on 13.07.2017.
- DefaultProcessorFactory() - Constructor for class org.webharvest.runtime.processors.DefaultProcessorFactory
- defineFromString(String, char, int) - Method in class org.webharvest.utils.CommonUtil.IntPair
- defineVariable(String, Object) - Method in class org.webharvest.utils.SystemUtilities
-
Adds or/replaces variable in scraper's context.
- defineVariable(String, Object, boolean) - Method in class org.webharvest.utils.SystemUtilities
-
Adds or/replaces variable in scraper's context.
- Definition - Annotation Type in org.webharvest.annotation
-
Annotation on
Processor
classes specifying theIElementDef
this particular processor is described by. - definitionClass() - Method in annotation type org.webharvest.annotation.Definition
-
The definition type that the processor is mapped to.
- DefinitionResolver - Class in org.webharvest.definition
-
Class contains information and logic to validate and crate definition classes for parsed xml nodes from Web-Harvest configurations.
- DefinitionResolver() - Constructor for class org.webharvest.definition.DefinitionResolver
-
Constructor for non-Guice usage.
- DefinitionResolver(Provider<ElementsRegistry>) - Constructor for class org.webharvest.definition.DefinitionResolver
- DefinitionResolver(ElementsRegistry) - Constructor for class org.webharvest.definition.DefinitionResolver
-
Constructor for non-Guice usage with custom ElementsRegistry.
- DefinitionResolverProvider - Class in org.webharvest.ioc
-
Created by rba on 22.07.2017.
- DefinitionResolverProvider(List<? extends ResolverPostProcessor>, Provider<ElementsRegistry>) - Constructor for class org.webharvest.ioc.DefinitionResolverProvider
- DefVarPlugin - Class in org.webharvest.runtime.processors.plugins.variable
-
Support for database operations.
- DefVarPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.variable.DefVarPlugin
- dependantProcessors() - Method in annotation type org.webharvest.annotation.Definition
-
Defines dependant subprocessors that are used inside this plugin and that will automatically be registered with this plugin.
- doCall(DynamicScopeContext, Callable<Object>) - Method in class org.webharvest.deprecated.runtime.processors.CallProcessor10
- doCall(DynamicScopeContext, Callable<Object>) - Method in class org.webharvest.runtime.processors.CallProcessor
- doExecute(DynamicScopeContext, String, Variable) - Method in class org.webharvest.runtime.processors.plugins.variable.DefVarPlugin
- doExecute(DynamicScopeContext, String, Variable) - Method in class org.webharvest.runtime.processors.plugins.variable.SetVarPlugin
- DriverManager - Interface in org.webharvest.runtime.database
-
DriverManager
allows to register database drivers placed in the arbitrary locations. - DynamicScopeContext - Interface in org.webharvest.runtime
-
Created by IntelliJ IDEA.
E
- EasySSLProtocolSocketFactory - Class in org.apache.commons.httpclient.contrib.ssl
-
EasySSLProtocolSocketFactory can be used to creats SSL
Socket
s that accept self-signed certificates. - EasySSLProtocolSocketFactory() - Constructor for class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
-
Constructor for EasySSLProtocolSocketFactory.
- EasyX509TrustManager - Class in org.apache.commons.httpclient.contrib.ssl
-
EasyX509TrustManager unlike default
X509TrustManager
accepts self-signed certificates. - EasyX509TrustManager(KeyStore) - Constructor for class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
-
Constructor for EasyX509TrustManager.
- elementDef - Variable in class org.webharvest.runtime.processors.AbstractProcessor
- ElementInfo - Class in org.webharvest.definition
- ElementInfo(String, Class<? extends IElementDef>, Class<? extends Processor>, String, String, boolean, Class[]) - Constructor for class org.webharvest.definition.ElementInfo
- ElementInfoFactory - Class in org.webharvest.annotation
-
Deprecated.
- ElementInfoPluginRegistrationStrategy - Class in org.webharvest.definition
-
ElementInfo-based implementation of PluginRegistrationStrategy.
- ElementInfoPluginRegistrationStrategy() - Constructor for class org.webharvest.definition.ElementInfoPluginRegistrationStrategy
- ElementName - Class in org.webharvest.definition
-
Name of the single element (tag in configuraton xml).
- ElementName(String, String) - Constructor for class org.webharvest.definition.ElementName
- ElementsRegistry - Interface in org.webharvest.definition
-
Registry containing all valid Web Harvest definition elements.
- ElementsRegistryImpl - Class in org.webharvest.definition
-
Default implementation of
ElementsRegistry
interface. - ElementsRegistryImpl() - Constructor for class org.webharvest.definition.ElementsRegistryImpl
- ElseProcessor - Class in org.webharvest.runtime.processors
- ElseProcessor() - Constructor for class org.webharvest.runtime.processors.ElseProcessor
- EMPTY - Static variable in class org.webharvest.utils.Stack
- EmptyDef - Class in org.webharvest.definition
-
Definition of empty element.
- EmptyDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.EmptyDef
- EmptyProcessor - Class in org.webharvest.runtime.processors
-
Empty processor = executes body and returns empty variable.
- EmptyProcessor() - Constructor for class org.webharvest.runtime.processors.EmptyProcessor
- EmptyVariable - Class in org.webharvest.runtime.variables
-
Empty variable.
- encodeUrl(String, String) - Static method in class org.webharvest.utils.CommonUtil
- enter(AHT) - Method in class org.webharvest.ioc.AttributeHolderScope
-
Lets the current
Thread
enter thisScope
. - EnvironmentException - Exception in org.webharvest.exception
-
Thrown when current JVM does not support a specific required feature.
- EnvironmentException(Exception) - Constructor for exception org.webharvest.exception.EnvironmentException
- equals(Object) - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
- equals(Object) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
- equals(Object) - Method in class org.webharvest.definition.ElementName
- equals(Object) - Method in class org.webharvest.definition.validation.SchemaSource
-
Equals system identifiers of
SchemaSource
specified as parameter and thisSchemaSource
. - equals(Object) - Method in class org.webharvest.runtime.scripting.ScriptSource
- ErrMsg - Class in org.webharvest.exception
-
Error messages utility
- ErrMsg() - Constructor for class org.webharvest.exception.ErrMsg
- error(String) - Method in class org.webharvest.utils.SystemUtilities
- ERROR - org.webharvest.runtime.ScraperState
-
Scraper has finished its work with error.
- escapeXml(Object) - Method in class org.webharvest.utils.SystemUtilities
-
Escapes XML string - special characters: &'"<> are replaced with XML escape sequences: & ' " < >
- escapeXml(String) - Static method in class org.webharvest.utils.CommonUtil
-
Escapes XML string - special characters: &'"<> are replaced with XML escape sequences: & ' " < >
- evaluate(DynamicScopeContext, ScriptSource) - Method in class org.webharvest.runtime.scripting.jsr.JSRScriptEngineAdapter
-
Executes script provided in
ScriptSource
. - evaluate(DynamicScopeContext, ScriptSource) - Method in interface org.webharvest.runtime.scripting.ScriptEngine
-
Executes script provided in
ScriptSource
. - evaluateAttribute(String, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
- evaluateAttributeAsBoolean(String, boolean, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
- evaluateAttributeAsDouble(String, double, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
- evaluateAttributeAsInteger(String, int, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
- evaluateToString(String, ScriptingLanguage, DynamicScopeContext) - Static method in class org.webharvest.runtime.templaters.BaseTemplater
- evaluateToVariable(String, ScriptingLanguage, DynamicScopeContext) - Static method in class org.webharvest.runtime.templaters.BaseTemplater
- evaluateXPath(String, String, RuntimeConfig) - Static method in class org.webharvest.utils.XmlUtil
-
Evaluates specified XPath expression against given XML text and using given runtime configuration.
- evaluateXPath(String, Document) - Static method in class org.webharvest.utils.XmlUtil
- EventBasedStatusHolder - Class in org.webharvest.runtime
-
StatusHolder
implementation which bases on Scraper's events. - EventBasedStatusHolder(Monitor) - Constructor for class org.webharvest.runtime.EventBasedStatusHolder
-
Default class constructor which creates
Monitor.Guard
instances using givenMonitor
. - EventBusTypeListener - Class in org.webharvest.ioc
-
Guice
TypeListener
implementation that is responsible for registration of objects managed by Guice in singletonEventBus
. - EventBusTypeListener() - Constructor for class org.webharvest.ioc.EventBusTypeListener
- EventHandler<T> - Interface in org.webharvest.events
-
Represents an object that is handler for particular type of event.
- EventSink - Interface in org.webharvest.events
-
Dispatches events to registered listeners.
- execute(String, Boolean, String, String, String, String, String, Variable, Map<String, HttpParamInfo>, Map, int, long, double) - Method in class org.webharvest.runtime.web.HttpClientManager
- execute(Harvester.ContextInitCallback) - Method in interface org.webharvest.Harvester
-
Execute scraping session in its own scope.
- execute(Harvester.ContextInitCallback) - Method in class org.webharvest.runtime.ScrapingHarvester
-
Execute scraping session in its own scope.
- execute(DynamicScopeContext) - Method in class org.webharvest.deprecated.runtime.processors.VarDefProcessor
-
Deprecated.
- execute(DynamicScopeContext) - Method in class org.webharvest.deprecated.runtime.processors.VarProcessor
-
Deprecated.
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.AbstractProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.BodyProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CallParamProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CallProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CaseProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ConfigProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ConstantProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.EmptyProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ExitProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.FileProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.FunctionProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HtmlToXmlProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HttpHeaderProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HttpParamProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.HttpProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.IncludeProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.LoopProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ReturnProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ScriptProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.TemplateProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.TextProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.TryProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WhileProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XPathProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XQueryProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XsltProcessor
- execute(DynamicScopeContext) - Method in class org.webharvest.runtime.Scraper
- execute(DynamicScopeContext) - Method in interface org.webharvest.runtime.WebScraper
- execute(Processor<?>, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.DefaultProcessorExecutionStrategy
- execute(Processor<?>, DynamicScopeContext) - Method in interface org.webharvest.runtime.processors.ProcessorExecutionStrategy
-
Executes the processor with the given context.
- executeBody(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
-
Executes body of plugin processor
- executeFunctionCall(Callable<R>) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.CatchProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ElseProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.IfProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.ListProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.JsonToXmlPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.SleepPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.ValueOfPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.variable.GetVarPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserJavascriptPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserLoadPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserRenderPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.XmlToJsonPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RealBodyProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpPatternProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpResultProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RegexpSourceProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.StylesheetProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
-
Deprecated.Use execute(...) instead
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XMLProcessor
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XQExpression
-
Mathod that actually executes processor.
- executePlugin(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.XQParamProcessor
-
Mathod that actually executes processor.
- executeWithinNewContext(Callable<R>) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- executeWithinNewContext(Callable<R>) - Method in interface org.webharvest.runtime.DynamicScopeContext
- executeWithinNewContext(Callable<R>) - Method in class org.webharvest.runtime.ScraperContext
- EXECUTION_TIME_PROPERTY_NAME - Static variable in class org.webharvest.WHConstants
- existsInStringArray(String[], String, boolean) - Static method in class org.webharvest.utils.CommonUtil
-
Checks if specified string exists in given array
- exit() - Method in class org.webharvest.ioc.AttributeHolderScope
-
Lets the current
Thread
leave thisScope
. - exit(ScraperExecutionExitEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
-
Changes status to
ScraperState.EXIT
. - EXIT - org.webharvest.runtime.ScraperState
-
Scraper's execution has been stopped by exit command.
- ExitDef - Class in org.webharvest.definition
-
Definition of exit processor.
- ExitDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ExitDef
- ExitProcessor - Class in org.webharvest.runtime.processors
-
Exit processor.
- ExitProcessor() - Constructor for class org.webharvest.runtime.processors.ExitProcessor
F
- FileConfigSource - Class in org.webharvest.definition
-
Implementation of
ConfigSource
that uses a file system as source of XML configurations. - FileConfigSource(File) - Constructor for class org.webharvest.definition.FileConfigSource
-
Class constructor expecting
File
as configuration source. - FileDef - Class in org.webharvest.definition
-
Definition of file proessor.
- FileDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.FileDef
- FileException - Exception in org.webharvest.exception
-
File management exception.
- FileException() - Constructor for exception org.webharvest.exception.FileException
- FileException(String) - Constructor for exception org.webharvest.exception.FileException
- FileException(String, Throwable) - Constructor for exception org.webharvest.exception.FileException
- FileException(Throwable) - Constructor for exception org.webharvest.exception.FileException
- FileListIterator - Class in org.webharvest.utils
- FileListIterator(File, boolean) - Constructor for class org.webharvest.utils.FileListIterator
- FileProcessor - Class in org.webharvest.runtime.processors
-
File processor.
- FileProcessor() - Constructor for class org.webharvest.runtime.processors.FileProcessor
- findParentPlugin(AbstractProcessor) - Static method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
- findValueInCollection(Collection, Object) - Static method in class org.webharvest.utils.CommonUtil
-
Searches specified value in given collection
- FINISHED - org.webharvest.runtime.ScraperState
-
Scraper has successfully finished its work.
- FtpDelPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
Ftp Del plugin - can be used only inside ftp plugin for deleting file on remote directory.
- FtpDelPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
- FtpGetPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
Ftp Get plugin - can be used only inside ftp plugin for retrieving file from remote directory.
- FtpGetPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
- FtpListPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
Ftp List plugin - can be used only inside ftp plugin for listing file in working remote directory.
- FtpListPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- FtpMkdirPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
Ftp Mkdir plugin - can be used only inside ftp plugin for creating directory on remote directory.
- FtpMkdirPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
- FtpPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
FTP processor
- FtpPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- FtpPluginException - Exception in org.webharvest.runtime.processors.plugins.ftp
-
Runtime exception for FtpPlugin
- FtpPluginException() - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
- FtpPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
- FtpPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
- FtpPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.ftp.FtpPluginException
- FtpPutPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
Ftp Put plugin - can be used only inside ftp plugin for storing file to remote directory.
- FtpPutPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
- FtpRmdirPlugin - Class in org.webharvest.runtime.processors.plugins.ftp
-
Ftp Mkdir plugin - can be used only inside ftp plugin for removing subdirectory on remote directory.
- FtpRmdirPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
- fullUrl(Object, Object) - Method in class org.webharvest.utils.SystemUtilities
-
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.
- fullUrl(String, String) - Static method in class org.webharvest.utils.CommonUtil
-
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.
- FunctionDef - Class in org.webharvest.definition
-
Definition of user-defined function.
- FunctionDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.FunctionDef
- FunctionException - Exception in org.webharvest.exception
-
Function processor exception.
- FunctionException() - Constructor for exception org.webharvest.exception.FunctionException
- FunctionException(String) - Constructor for exception org.webharvest.exception.FunctionException
- FunctionException(String, Throwable) - Constructor for exception org.webharvest.exception.FunctionException
- FunctionException(Throwable) - Constructor for exception org.webharvest.exception.FunctionException
- FunctionProcessor - Class in org.webharvest.runtime.processors
-
Function definition processor.
- FunctionProcessor() - Constructor for class org.webharvest.runtime.processors.FunctionProcessor
G
- generateProcessorReference(String) - Method in class org.webharvest.docs.ProcessorReferenceGenerator
- get() - Method in class org.webharvest.ioc.AttributeHolderScope
- get() - Method in class org.webharvest.ioc.DefinitionResolverProvider
- get(int) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- get(int) - Method in class org.webharvest.runtime.variables.ListVariable
- get(Object) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- get(String) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- getAbsoluteFilename(String, String) - Static method in class org.webharvest.utils.CommonUtil
-
For the given working path and file path returns absolute file path.
- getAcceptedIssuers() - Method in class org.apache.commons.httpclient.contrib.ssl.AuthSSLX509TrustManager
- getAcceptedIssuers() - Method in class org.apache.commons.httpclient.contrib.ssl.EasyX509TrustManager
- getAction() - Method in class org.webharvest.definition.FileDef
- getAdvancedXmlEscape() - Method in class org.webharvest.definition.HtmlToXmlDef
- getAllowHtmlInsideAttributes() - Method in class org.webharvest.definition.HtmlToXmlDef
- getAllowMultiWordAttributes() - Method in class org.webharvest.definition.HtmlToXmlDef
- getAttribute(Object) - Method in interface org.webharvest.ioc.AttributeHolder
-
Extracts the
Object
memorized for the specified key from thisAttributeHolder
. - getAttribute(Object) - Method in class org.webharvest.ioc.ScopeAttributeHolder
-
Extracts the
Object
memorized for the specified key from thisAttributeHolder
. - getAttribute(String) - Method in class org.webharvest.definition.XmlNode
- getAttribute(String, String) - Method in class org.webharvest.definition.XmlNode
- getAttributeLock() - Method in interface org.webharvest.ioc.AttributeHolder
-
Returns an object on which to lock when access to multiple methods of the
AttributeHolder
are to be made atomic. - getAttributeLock() - Method in class org.webharvest.ioc.ScopeAttributeHolder
-
Returns an object on which to lock when access to multiple methods of the
AttributeHolder
are to be made atomic. - getAttributes() - Method in class org.webharvest.definition.WebHarvestPluginDef
- getAttributes() - Method in class org.webharvest.runtime.processors.WebHarvestPlugin
-
Deprecated.
- getAttributes(String) - Method in class org.webharvest.definition.WebHarvestPluginDef
- getAttributes(String) - Method in class org.webharvest.definition.XmlNode
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
- getAttributeValueSuggestions(String) - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
- getAttsSet() - Method in class org.webharvest.definition.ElementInfo
- getAttValues(String) - Method in class org.webharvest.definition.ElementInfo
- getBody() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
-
Deprecated.Left only for backward compatibility. Use
HttpResponseWrapper.readBodyAsArray()
orHttpResponseWrapper.getBodyAsInputStream()
- getBodyAsInputStream() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getBodyTextContent(IElementDef, DynamicScopeContext) - Method in class org.webharvest.runtime.processors.AbstractProcessor
- getBodyTextContent(IElementDef, DynamicScopeContext, boolean) - Method in class org.webharvest.runtime.processors.AbstractProcessor
- getBodyTextContent(IElementDef, DynamicScopeContext, boolean, KeyValuePair[]) - Method in class org.webharvest.runtime.processors.AbstractProcessor
- getBooleanAtts() - Method in class org.webharvest.definition.HtmlToXmlDef
- getBooleanValue(String, Boolean) - Static method in class org.webharvest.utils.CommonUtil
-
Reads boolean value from string
- getCatchValueDef() - Method in class org.webharvest.definition.TryDef
- getCharset() - Method in class org.webharvest.definition.ConfigDef
-
Returns default charset for current configuration.
- getCharset() - Method in class org.webharvest.definition.FileDef
- getCharset() - Method in class org.webharvest.definition.HttpDef
- getCharset() - Method in class org.webharvest.definition.TextDef
- getCharset() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.Returns default configuration's charset.
- getCharset() - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Returns default configuration's charset.
- getCharset() - Method in class org.webharvest.runtime.ScraperContext
-
Returns default configuration's charset.
- getCharset() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getColumnCount() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- getColumnName(int) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- getColumnNumber() - Method in class org.webharvest.definition.AbstractElementDef
- getColumnNumber() - Method in interface org.webharvest.definition.IElementDef
- getColumnNumber() - Method in class org.webharvest.definition.XmlNode
- getColumnNumber() - Method in class org.webharvest.utils.XmlValidator
- getCompiledExpression(String) - Method in class org.webharvest.runtime.processors.XQueryExpressionPool
- getCondition() - Method in class org.webharvest.definition.ExitDef
- getCondition() - Method in class org.webharvest.definition.IfDef
- getCondition() - Method in class org.webharvest.definition.WhileDef
- getConfig() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- getConfig() - Method in interface org.webharvest.runtime.DynamicScopeContext
- getConfig() - Method in class org.webharvest.runtime.ScraperContext
- getConfigSource() - Method in interface org.webharvest.definition.Config
-
Gets reference to configuration resource
ConfigSource
. - getConfigSource() - Method in class org.webharvest.definition.IncludeVisitor
- getConfigSource() - Method in class org.webharvest.definition.XMLConfig
-
Gets reference to configuration resource
ConfigSource
. - getConnection(String) - Method in interface org.webharvest.runtime.database.ConnectionFactory
-
Returns SQL
Connection
instance available under the given name (for example under JNDI name within Java EE container). - getConnection(String) - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
-
Returns SQL
Connection
instance available under the given name (for example under JNDI name within Java EE container). - getConnection(String) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
-
Returns SQL
Connection
instance available under the given name (for example under JNDI name within Java EE container). - getConnection(String, String, String, String) - Method in interface org.webharvest.runtime.database.ConnectionFactory
-
Returns SQL
Connection
instance configured according to provided parameters. - getConnection(String, String, String, String) - Method in class org.webharvest.runtime.database.JNDIConnectionFactory
-
Returns SQL
Connection
instance configured according to provided parameters. - getConnection(String, String, String, String) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
-
Returns SQL
Connection
instance configured according to provided parameters. - getContentLength() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getContenttype() - Method in class org.webharvest.definition.HttpParamDef
- getContentType() - Method in class org.webharvest.definition.HttpDef
- getContentType() - Method in class org.webharvest.runtime.web.HttpParamInfo
- getContext() - Method in interface org.webharvest.runtime.ContextHolder
-
Returns current
DynamicScopeContext
which is used by the Scraper. - getContext() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
- getCookiePolicy() - Method in class org.webharvest.definition.HttpDef
- getDefinitionClass() - Method in class org.webharvest.definition.ElementInfo
- getDelimiter() - Method in class org.webharvest.definition.TextDef
- getDependantProcessors() - Method in class org.webharvest.definition.ElementInfo
- getDependantProcessors() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- getDependantProcessors() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getDependantProcessors() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- getDirectoryFromPath(String) - Static method in class org.webharvest.utils.CommonUtil
-
Extracts a filename and directory from an absolute path.
- getDoubleValue(String, double) - Static method in class org.webharvest.utils.CommonUtil
-
Reads double value from string
- getElementDef() - Method in interface org.webharvest.definition.Config
-
Gets reference to configuration root element.
- getElementDef() - Method in class org.webharvest.definition.XMLConfig
-
Gets reference to configuration root element.
- getElementDef() - Method in class org.webharvest.runtime.processors.AbstractProcessor
- getElementDef() - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
- getElementDef() - Method in interface org.webharvest.runtime.processors.Processor
- getElementDefs() - Method in class org.webharvest.definition.AbstractElementDef
- getElementDefs() - Method in interface org.webharvest.definition.IElementDef
- getElementInfo(Class<? extends WebHarvestPlugin>) - Static method in class org.webharvest.annotation.ElementInfoFactory
-
Deprecated.Creates new
ElementInfo
object from processor class that is about to be registered. - getElementInfo(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Deprecated.
- getElementInfo(String, String) - Method in class org.webharvest.definition.DefinitionResolver
- getElementList() - Method in class org.webharvest.definition.XmlNode
- getElementNames() - Method in interface org.webharvest.definition.ConfigurableResolver
-
Deprecated.
- getElementNames() - Method in class org.webharvest.definition.DefinitionResolver
-
Returns names of all known elements.
- getElementsRegistry() - Method in class org.webharvest.definition.AbstractRefreshableResolver
-
Returns current instance of the
ElementsRegistry
. - getElseDef() - Method in class org.webharvest.definition.CaseDef
- getEmail() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getEmpty() - Method in class org.webharvest.definition.LoopDef
- getEmpty() - Method in class org.webharvest.definition.WhileDef
- getEngine(ScriptSource) - Method in class org.webharvest.runtime.scripting.jsr.JSRScriptEngineFactory
-
Returns
ScriptEngine
capable of executing providedScriptSource
. - getEngine(ScriptSource) - Method in interface org.webharvest.runtime.scripting.ScriptEngineFactory
-
Returns
ScriptEngine
capable of executing providedScriptSource
. - getException() - Method in class org.webharvest.events.ScraperExecutionErrorEvent
-
Return reference to the exception occurred.
- getException() - Method in class org.webharvest.utils.XmlValidator
- getExecutionTime() - Method in class org.webharvest.events.ScraperExecutionEndEvent
-
Returns how long the execution took in milliseconds.
- getExecutionTime() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
- getExpression() - Method in class org.webharvest.definition.XPathDef
- getExternalParamDefs() - Method in class org.webharvest.definition.XQueryDef
- getFileFromPath(String) - Static method in class org.webharvest.utils.CommonUtil
-
Extracts a filename from an absolute path.
- getFilename() - Method in class org.webharvest.definition.HttpParamDef
- getFilename(String) - Method in class org.webharvest.utils.SystemUtilities
- getFileName() - Method in class org.webharvest.runtime.web.HttpParamInfo
- getFilter() - Method in class org.webharvest.definition.LoopDef
- getFirstSubnode(ElementName) - Method in class org.webharvest.definition.XmlNode
- getFlagCanoneq() - Method in class org.webharvest.definition.RegexpDef
- getFlagCaseInsensitive() - Method in class org.webharvest.definition.RegexpDef
- getFlagDotall() - Method in class org.webharvest.definition.RegexpDef
- getFlagMultiline() - Method in class org.webharvest.definition.RegexpDef
- getFlagUnicodecase() - Method in class org.webharvest.definition.RegexpDef
- getFollowRedirects() - Method in class org.webharvest.definition.HttpDef
- getFtpClient() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- getFunctionDef(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- getFunctionDef(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Deprecated.
- getFunctionDef(String) - Method in class org.webharvest.runtime.ScraperContext
-
Deprecated.
- getHarvester() - Method in interface org.webharvest.events.HarvesterEvent
-
Gets reference of event's source/destination.
- getHarvester() - Method in class org.webharvest.events.ScraperExecutionContinuedEvent
- getHarvester() - Method in class org.webharvest.events.ScraperExecutionPausedEvent
- getHarvester() - Method in class org.webharvest.events.ScraperExecutionStoppedEvent
-
Gets reference of event's source/destination.
- getHarvester(ConfigSource, HarvestLoadCallback) - Method in interface org.webharvest.Harvest
-
Creates new
Harvester
object with configuration with configuration represented byConfigSource
object. - getHarvester(ConfigSource, HarvestLoadCallback) - Method in class org.webharvest.runtime.DefaultHarvest
-
Creates new
Harvester
object with configuration with configuration represented byConfigSource
object. - getHeader(String) - Method in class org.webharvest.runtime.web.HttpInfo
- getHeaders() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getHeaders(String) - Method in class org.webharvest.runtime.web.HttpInfo
- getHostnameVerification() - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
-
Gets the status of the host name verification flag.
- getHttpClient() - Method in class org.webharvest.runtime.web.HttpClientManager
- getHttpInfo() - Method in class org.webharvest.runtime.web.HttpClientManager
- getHyphenReplacement() - Method in class org.webharvest.definition.HtmlToXmlDef
- getId() - Method in class org.webharvest.definition.AbstractElementDef
- getId() - Method in interface org.webharvest.definition.IElementDef
- getIdentifier() - Method in class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
- getIfDefs() - Method in class org.webharvest.definition.CaseDef
- getIgnoreResponseBody() - Method in class org.webharvest.definition.HttpDef
- getIndex() - Method in class org.webharvest.definition.LoopDef
- getIndex() - Method in class org.webharvest.definition.WhileDef
- getInjector() - Static method in class org.webharvest.ioc.InjectorHelper
-
Gets reference to the current
Injector
. - getIntValue(String, int) - Static method in class org.webharvest.utils.CommonUtil
-
Reads integer value from string
- getIsfile() - Method in class org.webharvest.definition.HttpParamDef
- getItem() - Method in class org.webharvest.definition.LoopDef
- getKey() - Method in class org.webharvest.utils.KeyValuePair
- getLanguage() - Method in class org.webharvest.definition.ScriptDef
- getLanguage() - Method in class org.webharvest.definition.TemplateDef
- getLanguage() - Method in class org.webharvest.runtime.scripting.ScriptSource
- getLineNumber() - Method in class org.webharvest.definition.AbstractElementDef
- getLineNumber() - Method in interface org.webharvest.definition.IElementDef
- getLineNumber() - Method in class org.webharvest.definition.XmlNode
- getLineNumber() - Method in class org.webharvest.utils.XmlValidator
- getList() - Method in class org.webharvest.runtime.variables.ListVariable
- getList() - Method in class org.webharvest.utils.Stack
- getListDirs() - Method in class org.webharvest.definition.FileDef
- getListFiles() - Method in class org.webharvest.definition.FileDef
- getListFilter() - Method in class org.webharvest.definition.FileDef
- getListRecursive() - Method in class org.webharvest.definition.FileDef
- getLocation() - Method in class org.webharvest.definition.BufferConfigSource
-
Gets the reference to physical location of configuration.
- getLocation() - Method in interface org.webharvest.definition.ConfigSource
-
Gets the reference to physical location of configuration.
- getLocation() - Method in class org.webharvest.definition.FileConfigSource
-
Gets the reference to physical location of configuration.
- getLocation() - Method in class org.webharvest.definition.URLConfigSource
-
Gets the reference to physical location of configuration.
- getLoopBodyDef() - Method in class org.webharvest.definition.LoopDef
- getLoopValueDef() - Method in class org.webharvest.definition.LoopDef
- getMax() - Method in class org.webharvest.definition.RegexpDef
- getMaxloops() - Method in class org.webharvest.definition.LoopDef
- getMaxLoops() - Method in class org.webharvest.definition.WhileDef
- getMessage() - Method in class org.webharvest.definition.ExitDef
- getMessage() - Method in class org.webharvest.events.ScraperExecutionExitEvent
-
Returns cause of execution's exit.
- getMethod() - Method in class org.webharvest.definition.HttpDef
- getMimeType() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getName() - Method in class org.webharvest.definition.CallDef
- getName() - Method in class org.webharvest.definition.CallParamDef
- getName() - Method in class org.webharvest.definition.ElementInfo
- getName() - Method in class org.webharvest.definition.ElementName
- getName() - Method in class org.webharvest.definition.FunctionDef
- getName() - Method in class org.webharvest.definition.HttpHeaderDef
- getName() - Method in class org.webharvest.definition.HttpParamDef
- getName() - Method in class org.webharvest.definition.VarDef
- getName() - Method in class org.webharvest.definition.VarDefDef
- getName() - Method in class org.webharvest.definition.XmlAttribute
- getName() - Method in class org.webharvest.definition.XmlNode
- getName() - Method in class org.webharvest.definition.XQueryExternalParamDef
- getName() - Method in class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
- getName() - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.XmlToJsonPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
- getName() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- getName() - Method in class org.webharvest.runtime.web.HttpParamInfo
- getNamespacesAware() - Method in class org.webharvest.definition.HtmlToXmlDef
- getNamespaceURI() - Method in interface org.webharvest.definition.IElementDef
-
Deprecated.
- getNamespaceURI() - Method in class org.webharvest.definition.WebHarvestPluginDef
- getNextAttachmentName() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getNode() - Method in interface org.webharvest.definition.IElementDef
-
Deprecated.
- getNode() - Method in class org.webharvest.definition.WebHarvestPluginDef
-
Deprecated.
- getNsAttsSet() - Method in class org.webharvest.definition.ElementInfo
- getOmitComments() - Method in class org.webharvest.definition.HtmlToXmlDef
- getOmitDeprecatedTags() - Method in class org.webharvest.definition.HtmlToXmlDef
- getOmitHtmlEnvelope() - Method in class org.webharvest.definition.HtmlToXmlDef
- getOmitUnknownTags() - Method in class org.webharvest.definition.HtmlToXmlDef
- getOmitXmlDecl() - Method in class org.webharvest.definition.HtmlToXmlDef
- getOperationDefs() - Method in class org.webharvest.definition.AbstractElementDef
-
Deprecated.
- getOperationDefs() - Method in class org.webharvest.definition.CaseDef
- getOperationDefs() - Method in interface org.webharvest.definition.IElementDef
-
Deprecated.
- getOperationDefs() - Method in class org.webharvest.definition.LoopDef
- getOperationDefs() - Method in class org.webharvest.definition.RegexpDef
- getOperationDefs() - Method in class org.webharvest.definition.TryDef
- getOperationDefs() - Method in class org.webharvest.definition.XQueryDef
- getOperationDefs() - Method in class org.webharvest.definition.XsltDef
- getOutputType() - Method in class org.webharvest.definition.HtmlToXmlDef
- getOverwrite() - Method in class org.webharvest.definition.VarDefDef
- getParentProcessor() - Method in class org.webharvest.runtime.processors.AbstractProcessor
-
Returns reference to parent
Processor
of this processor. - getParentProcessor() - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
-
Returns reference to parent
Processor
of this processor. - getParentProcessor() - Method in interface org.webharvest.runtime.processors.Processor
-
Returns reference to parent
Processor
of this processor. - getPassword() - Method in class org.webharvest.definition.HttpDef
- getPath() - Method in class org.webharvest.definition.FileDef
- getPath() - Method in class org.webharvest.definition.IncludeDef
- getPhantomTemplateAsString() - Static method in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
- getPluginClass(String) - Static method in class org.webharvest.utils.ClassLoaderUtil
- getProcessor() - Method in class org.webharvest.events.ProcessorStartEvent
-
Returns reference to
Processor
which has been started. - getProcessor() - Method in class org.webharvest.events.ProcessorStopEvent
-
Returns reference to
Processor
which has been stopped. - getProcessor() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
- getProcessorClass() - Method in class org.webharvest.definition.ElementInfo
- getProcessorId() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
- getProperties() - Method in class org.webharvest.events.ProcessorStopEvent
-
Returns properties of the
Processor
which has been stopped. - getProperties() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
- getPrunetags() - Method in class org.webharvest.definition.HtmlToXmlDef
- getQName() - Method in class org.webharvest.definition.XmlNode
- getReader() - Method in class org.webharvest.definition.BufferConfigSource
-
Gets the
Reader
for character stream. - getReader() - Method in class org.webharvest.definition.FileConfigSource
-
Gets the
Reader
for character stream. - getReader() - Method in class org.webharvest.definition.URLConfigSource
-
Gets the
Reader
for character stream. - getReader() - Method in interface org.webharvest.utils.HasReader
-
Gets the
Reader
for character stream. - getRecognizeUnicodeChars() - Method in class org.webharvest.definition.HtmlToXmlDef
- getRegexpPatternDef() - Method in class org.webharvest.definition.RegexpDef
- getRegexpResultDef() - Method in class org.webharvest.definition.RegexpDef
- getRegexpSourceDef() - Method in class org.webharvest.definition.RegexpDef
- getReplace() - Method in class org.webharvest.definition.RegexpDef
- getReplaceNbspWithSp() - Method in class org.webharvest.definition.HtmlToXmlDef
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
- getRequiredAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- getRequiredAttsSet() - Method in class org.webharvest.definition.ElementInfo
- getRequiredSubprocessors() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getRequiredTagsSet() - Method in class org.webharvest.definition.ElementInfo
- getRetryAttempts() - Method in class org.webharvest.definition.HttpDef
- getRetryDelay() - Method in class org.webharvest.definition.HttpDef
- getRetryDelayFactor() - Method in class org.webharvest.definition.HttpDef
- getReturnExpression() - Method in class org.webharvest.definition.ScriptDef
-
For backward compatibility with ver.2b1 only!
- getRunningLevel() - Method in class org.webharvest.runtime.processors.AbstractProcessor
-
Returns information how deep in processors hierarchy is this
Processor
. - getRunningLevel() - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
-
Returns information how deep in processors hierarchy is this
Processor
. - getRunningLevel() - Method in interface org.webharvest.runtime.processors.Processor
-
Returns information how deep in processors hierarchy is this
Processor
. - getSAXParserFactory(boolean, boolean) - Static method in class org.webharvest.utils.XmlUtil
- getSchema() - Method in interface org.webharvest.definition.validation.SchemaFactory
-
Returns an instance of
Schema
which should be used to validate a XML file. - getSchema() - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
-
Returns an instance of
Schema
which should be used to validate a XML file. - getSchemaFactory() - Static method in class org.webharvest.definition.validation.SchemaComponentFactory
-
Returns an instance of
SchemaFactory
. - getSchemaResolver() - Static method in class org.webharvest.definition.validation.SchemaComponentFactory
-
Returns an instance of
SchemaResolver
. - getScraper() - Method in class org.webharvest.events.ScraperExecutionEndEvent
-
Returns instance of
WebScraper
which execution has been completed. - getScraper() - Method in class org.webharvest.events.ScraperExecutionStartEvent
- getScriptingLanguage() - Method in class org.webharvest.definition.ConfigDef
-
Returns default
ScriptingLanguage
for current configuration. - getScriptingLanguage() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- getScriptingLanguage() - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Deprecated.
- getScriptingLanguage() - Method in class org.webharvest.runtime.ScraperContext
-
Deprecated.
- getShortElementName() - Method in class org.webharvest.definition.AbstractElementDef
- getShortElementName() - Method in class org.webharvest.definition.CallDef
- getShortElementName() - Method in class org.webharvest.definition.CallParamDef
- getShortElementName() - Method in class org.webharvest.definition.CaseDef
- getShortElementName() - Method in class org.webharvest.definition.ConstantDef
- getShortElementName() - Method in class org.webharvest.definition.EmptyDef
- getShortElementName() - Method in class org.webharvest.definition.ExitDef
- getShortElementName() - Method in class org.webharvest.definition.FileDef
- getShortElementName() - Method in class org.webharvest.definition.FunctionDef
- getShortElementName() - Method in class org.webharvest.definition.HtmlToXmlDef
- getShortElementName() - Method in class org.webharvest.definition.HttpDef
- getShortElementName() - Method in class org.webharvest.definition.HttpHeaderDef
- getShortElementName() - Method in class org.webharvest.definition.HttpParamDef
- getShortElementName() - Method in interface org.webharvest.definition.IElementDef
- getShortElementName() - Method in class org.webharvest.definition.IfDef
- getShortElementName() - Method in class org.webharvest.definition.IncludeDef
- getShortElementName() - Method in class org.webharvest.definition.LoopDef
- getShortElementName() - Method in class org.webharvest.definition.RegexpDef
- getShortElementName() - Method in class org.webharvest.definition.ReturnDef
- getShortElementName() - Method in class org.webharvest.definition.ScriptDef
- getShortElementName() - Method in class org.webharvest.definition.TemplateDef
- getShortElementName() - Method in class org.webharvest.definition.TextDef
- getShortElementName() - Method in class org.webharvest.definition.TryDef
- getShortElementName() - Method in class org.webharvest.definition.VarDef
- getShortElementName() - Method in class org.webharvest.definition.VarDefDef
- getShortElementName() - Method in class org.webharvest.definition.WhileDef
- getShortElementName() - Method in class org.webharvest.definition.XPathDef
- getShortElementName() - Method in class org.webharvest.definition.XQueryDef
- getShortElementName() - Method in class org.webharvest.definition.XQueryExternalParamDef
- getShortElementName() - Method in class org.webharvest.definition.XsltDef
- getSource() - Method in class org.webharvest.definition.validation.SchemaSource
-
Returns an instance of created XML schema's
Source
. - getSourceCode() - Method in class org.webharvest.runtime.scripting.ScriptSource
- getStartTime() - Method in class org.webharvest.runtime.processors.ProcessorExecutionContext
- getStaticQueryContext() - Method in class org.webharvest.runtime.RuntimeConfig
- getStatus() - Method in class org.webharvest.runtime.EventBasedStatusHolder
-
Returns current status of being processed configuration.
- getStatus() - Method in interface org.webharvest.runtime.StatusHolder
-
Returns current status of being processed configuration.
- getStatusCode() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getStatusText() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- getStylesheetDef() - Method in class org.webharvest.definition.XsltDef
- getSubnodes(ElementName) - Method in class org.webharvest.definition.XmlNode
- getTagsSet() - Method in class org.webharvest.definition.ElementInfo
- getTargetConnection() - Method in interface org.webharvest.runtime.database.ConnectionProxy
-
Returns proxied
Connection
instance. - getTemplate(boolean) - Method in class org.webharvest.definition.ElementInfo
- getText() - Method in class org.webharvest.definition.ConstantDef
- getText() - Method in class org.webharvest.definition.XmlNode
- getTranslateSpecialEntities() - Method in class org.webharvest.definition.HtmlToXmlDef
- getTreatDeprecatedTagsAsContent() - Method in class org.webharvest.definition.HtmlToXmlDef
- getTreatUnknownTagsAsContent() - Method in class org.webharvest.definition.HtmlToXmlDef
- getTryBodyDef() - Method in class org.webharvest.definition.TryDef
- getType() - Method in class org.webharvest.definition.FileDef
- getType() - Method in class org.webharvest.definition.XQueryExternalParamDef
- getType() - Method in class org.webharvest.runtime.processors.plugins.db.DbColumnDescription
- getTypesAnnotatedWith(Class<? extends Annotation>) - Method in interface org.webharvest.utils.ClassPathScanner
-
Returns
Set
of all types annotated with the given annotation. - getTypesAnnotatedWith(Class<? extends Annotation>) - Method in class org.webharvest.utils.ClassPathScannerImpl
-
Returns
Set
of all types annotated with the given annotation. - getUri() - Method in class org.webharvest.definition.ElementName
- getUri() - Method in class org.webharvest.definition.WebHarvestPluginDef
- getUri() - Method in class org.webharvest.definition.XmlAttribute
- getUri() - Method in class org.webharvest.definition.XmlNode
- getUrl() - Method in class org.webharvest.definition.HttpDef
- getUseCdataForScriptAndStyle() - Method in class org.webharvest.definition.HtmlToXmlDef
- getUseEmptyElementTags() - Method in class org.webharvest.definition.HtmlToXmlDef
- getUsername() - Method in class org.webharvest.definition.HttpDef
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.db.DbParamPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpPutPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.TokenizePlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
- getValidAttributes() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- getValidIdentifier(String) - Static method in class org.webharvest.utils.CommonUtil
-
For the given string creates valid identifier name.
- getValidSubprocessors() - Method in class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- getValue() - Method in class org.webharvest.definition.XmlAttribute
- getValue() - Method in class org.webharvest.runtime.variables.VariableName
-
Gets
Variable
value. - getValue() - Method in class org.webharvest.runtime.web.HttpParamInfo
- getValue() - Method in class org.webharvest.utils.KeyValuePair
- getVar(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- getVar(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
- getVar(String) - Method in class org.webharvest.runtime.ScraperContext
- getVar(String) - Method in class org.webharvest.utils.SystemUtilities
-
Returns variable from scraper context
- getVariableMap() - Method in class org.webharvest.definition.XPathDef
- GetVarPlugin - Class in org.webharvest.runtime.processors.plugins.variable
- GetVarPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.variable.GetVarPlugin
- getVersion() - Method in interface org.webharvest.definition.Config
- getVersion() - Method in class org.webharvest.definition.XMLConfig
- getWrappedObject() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- getWrappedObject() - Method in class org.webharvest.runtime.variables.EmptyVariable
- getWrappedObject() - Method in class org.webharvest.runtime.variables.ListVariable
- getWrappedObject() - Method in class org.webharvest.runtime.variables.NodeVariable
- getWrappedObject() - Method in class org.webharvest.runtime.variables.Variable
- getXmlDef() - Method in class org.webharvest.definition.XsltDef
- getXqDef() - Method in class org.webharvest.definition.XQueryDef
- getXQueryExpressionPool() - Method in class org.webharvest.runtime.RuntimeConfig
- getZipOutStream() - Method in class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- GROOVY - org.webharvest.runtime.scripting.ScriptingLanguage
H
- handle(T) - Method in interface org.webharvest.events.EventHandler
-
Handle event of supported type..
- HandlerHolder - Interface in org.webharvest.events
-
Represents an object that serves purpose as storage of
EventHandler
(supporting different types of events). - Harvest - Interface in org.webharvest
-
Web-Harvest application facade that provides control over creation of scraping processors
Harvester
and dispatching of scraping events. - Harvester - Interface in org.webharvest
-
Represents scraping session object that is associated with particular configuration and can be executed multiple times.
- Harvester.ContextInitCallback - Interface in org.webharvest
-
Context initialization callback that is invoked for all newly created context objects shortly before of scraping session.
- HarvesterEvent - Interface in org.webharvest.events
-
Represent an event object that is either addressed to particular
Harvester
instance or represents state change that happened on it. - HarvesterEventSink - Class in org.webharvest.events
-
Implementation of
EventSink
intended to guarantee scraping scope's events delivery. - HarvesterEventSink(Registry<Harvester, EventBus>) - Constructor for class org.webharvest.events.HarvesterEventSink
-
Default class constructor expecting
Registry
to be injected possibly by IoC container. - HarvesterFactory - Interface in org.webharvest.ioc
-
Guice dynamic factory helper interface that help to instantiate
Harvester
objects. - HarvestLoadCallback - Interface in org.webharvest
-
Callback interface representing successfully loaded scraping configuration that is projected as collection of
IElementDef
objects. - hasAttribute(Object) - Method in interface org.webharvest.ioc.AttributeHolder
-
Decides if this
AttributeHolder
has an association for the specified key. - hasAttribute(Object) - Method in class org.webharvest.ioc.ScopeAttributeHolder
-
Decides if this
AttributeHolder
has an association for the specified key. - hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpDelPlugin
- hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpGetPlugin
- hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpListPlugin
- hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpMkdirPlugin
- hasBody() - Method in class org.webharvest.runtime.processors.plugins.ftp.FtpRmdirPlugin
- hashCode() - Method in class org.apache.commons.httpclient.contrib.ssl.EasySSLProtocolSocketFactory
- hashCode() - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
- hashCode() - Method in class org.webharvest.definition.ElementName
- hashCode() - Method in class org.webharvest.definition.validation.SchemaSource
-
Returns hash code of system identifier of the schema's
Source
. - hashCode() - Method in class org.webharvest.runtime.scripting.ScriptSource
- hasNext() - Method in class org.webharvest.utils.FileListIterator
- hasOperations() - Method in class org.webharvest.definition.AbstractElementDef
- hasOperations() - Method in interface org.webharvest.definition.IElementDef
- HasReader - Interface in org.webharvest.utils
-
Represent an object holding character stream
Reader
. - headers - Variable in class org.webharvest.runtime.web.HttpInfo
- hear(TypeLiteral<I>, TypeEncounter<I>) - Method in class org.webharvest.ioc.EventBusTypeListener
- hear(TypeLiteral<I>, TypeEncounter<I>) - Method in class org.webharvest.ioc.PostConstructListener
- hear(TypeLiteral<I>, TypeEncounter<I>) - Method in class org.webharvest.ioc.ScrapingAwareTypeListener
- HtmlToXmlDef - Class in org.webharvest.definition
-
Definition of HTML to XML rensformation task.
- HtmlToXmlDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HtmlToXmlDef
- HtmlToXmlProcessor - Class in org.webharvest.runtime.processors
-
Advanced HTML to XML processor using Chain of Responsibility pattern with Strategy pattern for different HTML parsing strategies.
- HtmlToXmlProcessor() - Constructor for class org.webharvest.runtime.processors.HtmlToXmlProcessor
- HttpClientManager - Class in org.webharvest.runtime.web
-
HTTP client functionality.
- HttpClientManager(HttpClientManager.ProxySettings) - Constructor for class org.webharvest.runtime.web.HttpClientManager
- HttpClientManager.ProxySettings - Class in org.webharvest.runtime.web
- HttpClientManager.ProxySettings.Builder - Class in org.webharvest.runtime.web
- HttpDef - Class in org.webharvest.definition
-
Definition of HTTP processor.
- HttpDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HttpDef
-
Deprecated.
- HttpException - Exception in org.webharvest.exception
-
Http exception - occures during http requests.
- HttpException() - Constructor for exception org.webharvest.exception.HttpException
- HttpException(String) - Constructor for exception org.webharvest.exception.HttpException
- HttpException(String, Throwable) - Constructor for exception org.webharvest.exception.HttpException
- HttpException(Throwable) - Constructor for exception org.webharvest.exception.HttpException
- HttpHeaderDef - Class in org.webharvest.definition
-
Definition of HTTP header.
- HttpHeaderDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HttpHeaderDef
- HttpHeaderProcessor - Class in org.webharvest.runtime.processors
-
Variable definition http header processor.
- HttpHeaderProcessor() - Constructor for class org.webharvest.runtime.processors.HttpHeaderProcessor
- HttpInfo - Class in org.webharvest.runtime.web
-
Class offers access to HTTP client and response details to the user.
- HttpInfo(HttpClient) - Constructor for class org.webharvest.runtime.web.HttpInfo
- HttpModule - Class in org.webharvest.ioc
-
Google Guice module containing bindings for Web-Harvest's HTTP-related components.
- HttpModule(HttpClientManager.ProxySettings) - Constructor for class org.webharvest.ioc.HttpModule
- HttpParamDef - Class in org.webharvest.definition
-
Definition of HTTP parameter.
- HttpParamDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.HttpParamDef
- HttpParamInfo - Class in org.webharvest.runtime.web
-
Information about http request parameter.
- HttpParamInfo(String, boolean, String, String, Variable) - Constructor for class org.webharvest.runtime.web.HttpParamInfo
- HttpParamProcessor - Class in org.webharvest.runtime.processors
-
Variable definition http param processor.
- HttpParamProcessor() - Constructor for class org.webharvest.runtime.processors.HttpParamProcessor
- HttpProcessor - Class in org.webharvest.runtime.processors
-
Http processor.
- HttpProcessor() - Constructor for class org.webharvest.runtime.processors.HttpProcessor
- HttpResponseWrapper - Class in org.webharvest.runtime.web
-
Class defines http server response.
- HttpResponseWrapper(HttpMethodBase) - Constructor for class org.webharvest.runtime.web.HttpResponseWrapper
-
Constructor - defines response result based on specified HttpMethodBase instance.
I
- IElementDef - Interface in org.webharvest.definition
-
Marker for element definition.
- IfDef - Class in org.webharvest.definition
-
Definition of conditional processor.
- IfDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.IfDef
- IfProcessor - Class in org.webharvest.runtime.processors
- IfProcessor() - Constructor for class org.webharvest.runtime.processors.IfProcessor
- include(ConfigSource.Location) - Method in class org.webharvest.definition.AbstractConfigSource
-
Includes specified
ConfigSource.Location
from current configuration if possible. - include(ConfigSource.Location) - Method in interface org.webharvest.definition.ConfigSource
-
Includes specified
ConfigSource.Location
from current configuration if possible. - IncludeDef - Class in org.webharvest.definition
-
Definition of include element.
- IncludeDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.IncludeDef
- IncludeProcessor - Class in org.webharvest.runtime.processors
-
Include processor.
- IncludeProcessor() - Constructor for class org.webharvest.runtime.processors.IncludeProcessor
- IncludeVisitor - Class in org.webharvest.definition
- IncludeVisitor(String) - Constructor for class org.webharvest.definition.IncludeVisitor
- indent(int) - Static method in class org.webharvest.utils.CommonUtil
-
Deprecated.Provides unnecessary complexity for logging facility.
- informListenersAboutError(Exception) - Method in class org.webharvest.runtime.Scraper
-
Inform all scraper listeners that an error has occured during scraper execution.
- informListenersAboutError(Exception) - Method in interface org.webharvest.runtime.WebScraper
-
Deprecated.
- initContext() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- initContext() - Method in class org.webharvest.runtime.ScraperContext
- InjectorHelper - Class in org.webharvest.ioc
-
Guice static injector helper.
- INSTANCE - org.webharvest.runtime.database.DefaultDriverManager
-
Singleton instance reference
- INSTANCE - Static variable in class org.webharvest.definition.validation.SchemaFactoryImpl
-
Reference to singleton instance.
- INSTANCE - Static variable in interface org.webharvest.runtime.processors.ProcessorFactory
- INSTANCE - Static variable in class org.webharvest.runtime.variables.EmptyVariable
- internal() - Method in annotation type org.webharvest.annotation.Definition
- IntPair() - Constructor for class org.webharvest.utils.CommonUtil.IntPair
- IntPair(int, int) - Constructor for class org.webharvest.utils.CommonUtil.IntPair
- invalidate(K) - Method in interface org.webharvest.Cache
-
Discard value cached under the specified key.
- invalidate(K) - Method in class org.webharvest.ThreadLocalCache
-
Discard value cached under the specified key.
- invalidAttribute(String, String) - Static method in class org.webharvest.exception.ErrMsg
- invalidTag(String, String) - Static method in class org.webharvest.exception.ErrMsg
- invoke(MethodInvocation) - Method in class org.webharvest.ioc.ScrapingInterceptor
- isBooleanTrue(String) - Static method in class org.webharvest.utils.CommonUtil
-
Checks if specified string value represents boolean true value.
- isEmpty() - Method in class org.webharvest.runtime.variables.EmptyVariable
- isEmpty() - Method in class org.webharvest.runtime.variables.ListVariable
- isEmpty() - Method in class org.webharvest.runtime.variables.NodeVariable
- isEmpty() - Method in class org.webharvest.runtime.variables.Variable
- isEmpty() - Method in class org.webharvest.utils.Stack
- isEmpty() - Method in class org.webharvest.utils.XmlNodeWrapper
- isEmpty(String) - Static method in class org.webharvest.utils.CommonUtil
- isEmptyString(Object) - Static method in class org.webharvest.utils.CommonUtil
- isFalse(boolean, String, Object...) - Static method in class org.webharvest.utils.Assert
- isFile() - Method in class org.webharvest.runtime.web.HttpParamInfo
- isFullUrl(String) - Static method in class org.webharvest.utils.CommonUtil
-
Checks if specified link is full URL.
- isInternal() - Method in class org.webharvest.definition.ElementInfo
- isNull(Object) - Static method in class org.webharvest.utils.Assert
- isNull(Object, String, Object...) - Static method in class org.webharvest.utils.Assert
- isPathAbsolute(String) - Static method in class org.webharvest.utils.CommonUtil
-
Checks if specified file path is absolute.
- isPluginRegistered(Class, String) - Method in class org.webharvest.definition.DefinitionResolver
-
Deprecated.
- isPluginRegistered(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Deprecated.
- isPluginRegistered(String, String) - Method in class org.webharvest.definition.DefinitionResolver
- isSatisfied() - Method in class org.webharvest.runtime.RunningStatusGuard
- isTrue(boolean, String, Object...) - Static method in class org.webharvest.utils.Assert
- isValidXmlIdentifier(String) - Static method in class org.webharvest.utils.CommonUtil
-
Checks if given string is valid XML identifier, i.e. it can be valid XML tag or attribute name.
- isVariableDefined(String) - Method in class org.webharvest.utils.SystemUtilities
- iterator() - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- iterator() - Method in class org.webharvest.runtime.ScraperContext
- iterator() - Method in class org.webharvest.runtime.variables.ListVariable
J
- JAVASCRIPT - org.webharvest.runtime.scripting.ScriptingLanguage
- JNDI_NAME_ATTRIBUTE - Static variable in class org.webharvest.runtime.processors.plugins.db.DatabaseJNDIPlugin
-
Name of XML attribute representing JNDI hook name.
- JNDIConnectionFactory - Class in org.webharvest.runtime.database
-
ConnectionFactory
implementation supporting JNDIDataSource
lookups. - JNDIConnectionFactory() - Constructor for class org.webharvest.runtime.database.JNDIConnectionFactory
-
Instantiates
JNDIConnectionFactory
, obtaining reference to the JNDIContext
. - JsonToXmlPlugin - Class in org.webharvest.runtime.processors.plugins
-
Converter from JSON to XML
- JsonToXmlPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.JsonToXmlPlugin
- JSRScriptEngineAdapter - Class in org.webharvest.runtime.scripting.jsr
-
Adapter design pattern implementation.
- JSRScriptEngineAdapter(ScriptEngine) - Constructor for class org.webharvest.runtime.scripting.jsr.JSRScriptEngineAdapter
- JSRScriptEngineFactory - Class in org.webharvest.runtime.scripting.jsr
-
ScriptEngineFactory
implementation that creates script engines based on JSR-223 specification. - JSRScriptEngineFactory() - Constructor for class org.webharvest.runtime.scripting.jsr.JSRScriptEngineFactory
K
- KeyValuePair<T> - Class in org.webharvest.utils
- KeyValuePair(String, T) - Constructor for class org.webharvest.utils.KeyValuePair
L
- LAYOUT - Static variable in annotation type org.webharvest.ioc.DebugFileLogger
-
Logger's layout
- lf - Static variable in class org.webharvest.utils.SystemUtilities
- listBound() - Method in class org.webharvest.AbstractRegistry
-
Lists names of all values bound.
- listBound() - Method in class org.webharvest.LockedRegistry
-
Lists names of all values bound.
- listBound() - Method in interface org.webharvest.Registry
-
Lists names of all values bound.
- ListProcessor - Class in org.webharvest.runtime.processors
- ListProcessor() - Constructor for class org.webharvest.runtime.processors.ListProcessor
- ListVariable - Class in org.webharvest.runtime.variables
-
List variable - String wrapper.
- ListVariable() - Constructor for class org.webharvest.runtime.variables.ListVariable
- ListVariable(Iterable) - Constructor for class org.webharvest.runtime.variables.ListVariable
- LockedRegistry<K,V> - Class in org.webharvest
-
Generic locking registry implementation that follows 'decorator' design pattern.
- LockedRegistry(Registry<K, V>) - Constructor for class org.webharvest.LockedRegistry
-
Class constructor expecting
Registry
to be specified. - log - Static variable in class org.webharvest.definition.XmlNode
- LOG - Static variable in class org.webharvest.runtime.processors.AbstractProcessor
- lookup(K) - Method in class org.webharvest.AbstractRegistry
-
Lookups value bound under the given name.
- lookup(K) - Method in interface org.webharvest.Cache
-
Returns value associated with the specified key.
- lookup(K) - Method in class org.webharvest.LockedRegistry
-
Lookups value bound under the given name.
- lookup(K) - Method in interface org.webharvest.Registry
-
Lookups value bound under the given name.
- lookup(K) - Method in class org.webharvest.ThreadLocalCache
-
Returns value associated with the specified key.
- LoopDef - Class in org.webharvest.definition
-
Definition of loop processor.
- LoopDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.LoopDef
- LoopProcessor - Class in org.webharvest.runtime.processors
-
Loop list processor.
- LoopProcessor() - Constructor for class org.webharvest.runtime.processors.LoopProcessor
M
- MailAttachPlugin - Class in org.webharvest.runtime.processors.plugins.mail
-
Mail attachment plugin - can be used only inside mail plugin.
- MailAttachPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.mail.MailAttachPlugin
- MailPlugin - Class in org.webharvest.runtime.processors.plugins.mail
-
Mail sending processor.
- MailPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.mail.MailPlugin
- MailPluginException - Exception in org.webharvest.runtime.processors.plugins.mail
-
Runtime exception for MailPlugin
- MailPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.mail.MailPluginException
- MailPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.mail.MailPluginException
- MailPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.mail.MailPluginException
- main(String[]) - Static method in class org.webharvest.docs.ProcessorReferenceGenerator
- MIME_TYPES - Static variable in class org.webharvest.WHConstants
- mimeType - Variable in class org.webharvest.runtime.web.HttpInfo
- missingAttribute(String, String) - Static method in class org.webharvest.exception.ErrMsg
- missingTag(String, String) - Static method in class org.webharvest.exception.ErrMsg
N
- NAME - Static variable in annotation type org.webharvest.ioc.DebugFileLogger
-
Logger's name
- NestedContextFactory - Class in org.webharvest.runtime
-
Factory capable of creating nested
DynamicScopeContext
instances. - next() - Method in class org.webharvest.utils.FileListIterator
- NO_PROXY_SET - Static variable in class org.webharvest.runtime.web.HttpClientManager.ProxySettings
- NodeVariable - Class in org.webharvest.runtime.variables
-
Node variable - Single node wrapper.
- NodeVariable(Object) - Constructor for class org.webharvest.runtime.variables.NodeVariable
- notNull(Object) - Static method in class org.webharvest.utils.Assert
- notNull(Object, String, Object...) - Static method in class org.webharvest.utils.Assert
- NULL - Static variable in class org.webharvest.definition.XmlNode
- nvl(Object, String) - Static method in class org.webharvest.utils.CommonUtil
O
- obtainConnection(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.AbstractDatabasePlugin
-
Returns configured, ready for use database
Connection
- obtainConnection(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.DatabaseJNDIPlugin
-
Returns configured, ready for use database
Connection
- obtainConnection(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.plugins.db.DatabasePlugin
-
Returns configured, ready for use database
Connection
- onAfterScraping(Harvester) - Method in class org.webharvest.events.DefaultHandlerHolder
-
Receive notification about the scope for particular
Harvester
that is about to be tore down. - onAfterScraping(Harvester) - Method in class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
-
Notify all registered listeners about scraping scope that is about the end.
- onAfterScraping(Harvester) - Method in interface org.webharvest.ScrapingAware
-
Receive notification about the scope for particular
Harvester
that is about to be tore down. - onBeforeScraping(Harvester) - Method in class org.webharvest.events.DefaultHandlerHolder
-
Receive notification about new scope initialized for particular
Harvester
object. - onBeforeScraping(Harvester) - Method in class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
-
Notify all registered listeners about new scraping scope.
- onBeforeScraping(Harvester) - Method in interface org.webharvest.ScrapingAware
-
Receive notification about new scope initialized for particular
Harvester
object. - onExecutionEnd(ScraperExecutionEndEvent) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
-
Reacts on the end of configuration execution releasing all pooled database connections.
- onExecutionError(ScraperExecutionErrorEvent) - Method in class org.webharvest.runtime.database.StandaloneConnectionPool
-
Reacts on the configuration execution error releasing all pooled database connections.
- onExecutionFinished(ScraperExecutionEndEvent) - Method in class org.webharvest.runtime.Scraper
-
Logs information about time of Scraper's execution on
ScraperExecutionEndEvent
. - onExecutionStopped(ScraperExecutionStoppedEvent) - Method in class org.webharvest.runtime.Scraper
-
Logs information that Scraper's execution has been stopped.
- onSuccess(List<IElementDef>) - Method in interface org.webharvest.HarvestLoadCallback
-
Invoked when successfully loaded scraping configuraiton.
- onSuccess(DynamicScopeContext) - Method in interface org.webharvest.Harvester.ContextInitCallback
-
Adjust newly created
DynamicScopeContext
object shortly before it is used to launch scraping session. - org.apache.commons.httpclient.contrib.ssl - package org.apache.commons.httpclient.contrib.ssl
- org.webharvest - package org.webharvest
- org.webharvest.annotation - package org.webharvest.annotation
- org.webharvest.definition - package org.webharvest.definition
- org.webharvest.definition.validation - package org.webharvest.definition.validation
- org.webharvest.deprecated.runtime - package org.webharvest.deprecated.runtime
- org.webharvest.deprecated.runtime.processors - package org.webharvest.deprecated.runtime.processors
- org.webharvest.docs - package org.webharvest.docs
- org.webharvest.events - package org.webharvest.events
- org.webharvest.exception - package org.webharvest.exception
- org.webharvest.ioc - package org.webharvest.ioc
- org.webharvest.runtime - package org.webharvest.runtime
- org.webharvest.runtime.database - package org.webharvest.runtime.database
- org.webharvest.runtime.processors - package org.webharvest.runtime.processors
- org.webharvest.runtime.processors.plugins - package org.webharvest.runtime.processors.plugins
- org.webharvest.runtime.processors.plugins.db - package org.webharvest.runtime.processors.plugins.db
- org.webharvest.runtime.processors.plugins.ftp - package org.webharvest.runtime.processors.plugins.ftp
- org.webharvest.runtime.processors.plugins.mail - package org.webharvest.runtime.processors.plugins.mail
- org.webharvest.runtime.processors.plugins.variable - package org.webharvest.runtime.processors.plugins.variable
- org.webharvest.runtime.processors.plugins.webbrowser - package org.webharvest.runtime.processors.plugins.webbrowser
- org.webharvest.runtime.processors.plugins.zip - package org.webharvest.runtime.processors.plugins.zip
- org.webharvest.runtime.scripting - package org.webharvest.runtime.scripting
- org.webharvest.runtime.scripting.jsr - package org.webharvest.runtime.scripting.jsr
- org.webharvest.runtime.templaters - package org.webharvest.runtime.templaters
- org.webharvest.runtime.variables - package org.webharvest.runtime.variables
- org.webharvest.runtime.web - package org.webharvest.runtime.web
- org.webharvest.utils - package org.webharvest.utils
P
- parse(ConfigSource) - Method in interface org.webharvest.definition.ConfigParser
- parse(ConfigSource) - Method in class org.webharvest.definition.SAXConfigParser
- parse(InputSource) - Static method in class org.webharvest.utils.XmlUtil
- parse(InputSource) - Method in class org.webharvest.utils.XmlValidator
- ParserException - Exception in org.webharvest.exception
-
General parsing exception.
- ParserException() - Constructor for exception org.webharvest.exception.ParserException
- ParserException(String) - Constructor for exception org.webharvest.exception.ParserException
- ParserException(String, Throwable) - Constructor for exception org.webharvest.exception.ParserException
- ParserException(Throwable) - Constructor for exception org.webharvest.exception.ParserException
- pause(ScraperExecutionPausedEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
-
Changes status from
ScraperState.RUNNING
toScraperState.PAUSED
. - PAUSED - org.webharvest.runtime.ScraperState
-
Scraper has been paused.
- peek() - Method in class org.webharvest.utils.Stack
- PluginDef - Interface in org.webharvest.definition
- PluginDefinitionBuilder - Class in org.webharvest.definition
-
Builder for creating WebHarvestPluginDef instances.
- PluginException - Exception in org.webharvest.exception
-
Runtime exception occurred during plugin processors registration or creation.
- PluginException() - Constructor for exception org.webharvest.exception.PluginException
- PluginException(String) - Constructor for exception org.webharvest.exception.PluginException
- PluginException(String, Throwable) - Constructor for exception org.webharvest.exception.PluginException
- PluginException(Throwable) - Constructor for exception org.webharvest.exception.PluginException
- PluginFactory - Class in org.webharvest.definition
-
Factory for creating plugin instances.
- PluginFactory() - Constructor for class org.webharvest.definition.PluginFactory
- PluginRegistrationStrategy - Interface in org.webharvest.definition
-
Strategy interface for plugin registration.
- pop() - Method in class org.webharvest.utils.Stack
- PostConstructListener - Class in org.webharvest.ioc
-
TypeListener
implementation enabling Guice support for JSR-250@PostConstruct
annotation. - PostConstructListener() - Constructor for class org.webharvest.ioc.PostConstructListener
- postEvent(E) - Method in interface org.webharvest.Harvest
-
Post an event with hope it will be routed to target scope.
- postEvent(E) - Method in class org.webharvest.runtime.DefaultHarvest
-
Post an event with hope it will be routed to target scope.
- postProcess(ConfigurableResolver) - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessor
-
Post process provided
ConfigurableResolver
by registering allAutoscanned
web harvest plugins available on classpath under package (and its subpackages) declared in the post processor's constructor. - postProcess(ConfigurableResolver) - Method in interface org.webharvest.definition.ResolverPostProcessor
-
Triggered on the
ConfigurableResolver
refresh. - postProcess(SchemaResolver) - Method in interface org.webharvest.definition.validation.SchemaResolverPostProcessor
-
Triggered on the
SchemaResolver
refresh. - postProcess(SchemaResolver) - Method in class org.webharvest.definition.validation.SchemaResourcesPostProcessor
-
Triggered on the
SchemaResolver
refresh. - prettyPrintXml(String) - Static method in class org.webharvest.utils.XmlUtil
- prettyPrintXml(Document, Writer) - Static method in class org.webharvest.utils.XmlUtil
- Processor<TDef extends IElementDef> - Interface in org.webharvest.runtime.processors
- ProcessorExecutionContext - Class in org.webharvest.runtime.processors
-
Context object that holds execution state and configuration.
- ProcessorExecutionContext(Processor<?>, DynamicScopeContext, Map<String, Object>, long, String) - Constructor for class org.webharvest.runtime.processors.ProcessorExecutionContext
- ProcessorExecutionStrategy - Interface in org.webharvest.runtime.processors
-
Strategy interface for processor execution.
- ProcessorFactory - Interface in org.webharvest.runtime.processors
-
Created by rbala on 13.07.2017.
- ProcessorReferenceGenerator - Class in org.webharvest.docs
-
Generator for processor reference documentation from source code annotations.
- ProcessorReferenceGenerator() - Constructor for class org.webharvest.docs.ProcessorReferenceGenerator
- ProcessorStartEvent - Class in org.webharvest.events
-
Event informing that the specified
Processor
has been started. - ProcessorStartEvent(Processor) - Constructor for class org.webharvest.events.ProcessorStartEvent
-
Default class constructor which accepts not
null
reference toProcessor
. - ProcessorStopEvent - Class in org.webharvest.events
-
Event informing that the specified
Processor
has successfully finished its work. - ProcessorStopEvent(Processor, Map) - Constructor for class org.webharvest.events.ProcessorStopEvent
-
Default class constructor which accepts not
null
reference toProcessor
and its properties. - publish(E) - Method in interface org.webharvest.events.EventSink
-
Posts an event to all registered handlers.
- publish(E) - Method in class org.webharvest.events.HarvesterEventSink
-
Posts an event to all registered handlers.
- push(T) - Method in class org.webharvest.utils.Stack
- put(K, V) - Method in interface org.webharvest.Cache
-
Associates the specified value with the specified key in
Cache
. - put(K, V) - Method in class org.webharvest.ThreadLocalCache
-
Associates the specified value with the specified key in
Cache
. - putAttribute(Object, Object) - Method in interface org.webharvest.ioc.AttributeHolder
-
Stores a new value in this
AttributeHolder
. - putAttribute(Object, Object) - Method in class org.webharvest.ioc.ScopeAttributeHolder
-
Stores a new value in this
AttributeHolder
.
Q
- quot - Static variable in class org.webharvest.utils.SystemUtilities
R
- readBodyAsArray() - Method in class org.webharvest.runtime.web.HttpResponseWrapper
- readBytesFromFile(File) - Static method in class org.webharvest.utils.CommonUtil
- readStringFromFile(File, String) - Static method in class org.webharvest.utils.CommonUtil
- readStringFromUrl(URL) - Static method in class org.webharvest.utils.CommonUtil
-
Reads content from specified URL
- READY - org.webharvest.runtime.ScraperState
-
Scraper is ready to use.
- RealBodyProcessor - Class in org.webharvest.runtime.processors
- RealBodyProcessor() - Constructor for class org.webharvest.runtime.processors.RealBodyProcessor
- recognize(String) - Static method in enum org.webharvest.runtime.scripting.ScriptingLanguage
- recordCount() - Method in class org.webharvest.utils.SystemUtilities
-
Returns the number of records processed in the current context.
- refresh() - Method in class org.webharvest.definition.AbstractRefreshableResolver
-
Refresh the current resolver instance, causing all configuration element definitions to be reinitialized.
- refresh() - Method in interface org.webharvest.definition.ConfigurableResolver
-
Refresh the current resolver instance, causing all configuration element definitions to be reinitialized.
- refresh() - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
-
Refresh the current resolver instance, causing all XML schema sources to be reinitialized.
- refresh() - Method in interface org.webharvest.definition.validation.SchemaResolver
-
Refresh the current resolver instance, causing all XML schema sources to be reinitialized.
- refreshSchemaResolver() - Static method in class org.webharvest.utils.XmlUtil
-
Force refresh of the schema resolver to pick up XSD changes.
- RegexpDef - Class in org.webharvest.definition
-
Definition of regular expression processor.
- RegexpDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.RegexpDef
- RegexpPatternProcessor - Class in org.webharvest.runtime.processors
- RegexpPatternProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpPatternProcessor
- RegexpProcessor - Class in org.webharvest.runtime.processors
-
Regular expression replace processor.
- RegexpProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpProcessor
- RegexpResultProcessor - Class in org.webharvest.runtime.processors
- RegexpResultProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpResultProcessor
- RegexpSourceProcessor - Class in org.webharvest.runtime.processors
- RegexpSourceProcessor() - Constructor for class org.webharvest.runtime.processors.RegexpSourceProcessor
- register(EventHandler<?>) - Method in class org.webharvest.events.DefaultHandlerHolder
-
Register event handler.
- register(EventHandler<?>) - Method in interface org.webharvest.events.HandlerHolder
-
Register event handler.
- registerDriver(String) - Method in enum org.webharvest.runtime.database.DefaultDriverManager
-
Registers driver with the provided class name.
- registerDriver(String) - Method in interface org.webharvest.runtime.database.DriverManager
-
Registers driver with the provided class name.
- registerPlugin(Class<? extends WebHarvestPlugin>, ConfigurableResolver) - Method in class org.webharvest.definition.DefaultPluginRegistrationStrategy
- registerPlugin(Class<? extends WebHarvestPlugin>, ConfigurableResolver) - Method in class org.webharvest.definition.ElementInfoPluginRegistrationStrategy
- registerPlugin(Class<? extends WebHarvestPlugin>, ConfigurableResolver) - Method in interface org.webharvest.definition.PluginRegistrationStrategy
-
Registers a plugin in the resolver using the specified strategy.
- registerPlugin(Class, String) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Deprecated.
- registerPlugin(Class, String) - Method in class org.webharvest.definition.DefinitionResolver
- registerPlugin(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Deprecated.
- registerPlugin(String, String) - Method in class org.webharvest.definition.DefinitionResolver
-
Deprecated.
- registerPlugin(ElementInfo, String) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Register provided
WebHarvestPlugin
based on information provided by specifiedElementInfo
. - registerPlugin(ElementInfo, String) - Method in class org.webharvest.definition.DefinitionResolver
-
Register provided
WebHarvestPlugin
based on information provided by specifiedElementInfo
. - registerSchemaSource(SchemaSource) - Method in class org.webharvest.definition.validation.SchemaFactoryImpl
-
Register provided
SchemaSource
which cannot benull
- registerSchemaSource(SchemaSource) - Method in interface org.webharvest.definition.validation.SchemaResolver
-
Register provided
SchemaSource
which cannot benull
- Registry<K,V> - Interface in org.webharvest
-
Generic registry interface following 'registry' design pattern.
- reload() - Method in interface org.webharvest.definition.Config
-
Loads configuration from resource defined by
Config.getConfigSource()
. - reload() - Method in class org.webharvest.definition.XMLConfig
-
Loads configuration from resource defined by
Config.getConfigSource()
. - remove() - Method in class org.webharvest.utils.FileListIterator
- removeDriverResource(URI) - Method in enum org.webharvest.runtime.database.DefaultDriverManager
-
Removes previously added database driver resource.
- removeDriverResource(URI) - Method in interface org.webharvest.runtime.database.DriverManager
-
Removes previously added database driver resource.
- replaceExistingVar(String, Variable) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- replaceExistingVar(String, Variable) - Method in interface org.webharvest.runtime.DynamicScopeContext
- replaceExistingVar(String, Variable) - Method in class org.webharvest.runtime.ScraperContext
- replaceTop(T) - Method in class org.webharvest.utils.Stack
- requiredAttributes() - Method in annotation type org.webharvest.annotation.Definition
-
This method should return all mandatory attribute names for the plugin processor.
- requiredSubprocessors() - Method in annotation type org.webharvest.annotation.Definition
-
This method should return all mandatory subprocessor names, or in other words all mandatory subtags that must be present in the body of this processor plugin.
- ResolverPostProcessor - Interface in org.webharvest.definition
-
Allows for custom modification of the web harvest configuration element definition resolvers.
- ResourcePathToURITransformer - Class in org.webharvest.definition.validation
-
An implementation of
Transformer
interface which supports transformation from given resource name (resource path as string) to itsURI
. - ResourcePathToURITransformer() - Constructor for class org.webharvest.definition.validation.ResourcePathToURITransformer
- resume(ScraperExecutionContinuedEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
-
Changes status from
ScraperState.PAUSED
toScraperState.RUNNING
. - ReturnDef - Class in org.webharvest.definition
-
Definition of function's return statement.
- ReturnDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ReturnDef
- ReturnProcessor - Class in org.webharvest.runtime.processors
-
Function's return value processor.
- ReturnProcessor() - Constructor for class org.webharvest.runtime.processors.ReturnProcessor
- run(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.AbstractProcessor
-
Template method that defines the execution flow.
- run(DynamicScopeContext) - Method in interface org.webharvest.runtime.processors.Processor
- run(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.RunningStatusController
- run(DynamicScopeContext) - Method in class org.webharvest.runtime.processors.StoppedOrExitedProcessor
- RUNNING - org.webharvest.runtime.ScraperState
-
Scraper is running.
- RunningStatusController<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
-
AbstractProcessorDecorator
implementation which decoratesProcessor#run(Scraper, DynamicScopeContext)
method in the way that it enters to theMonitor
usingMonitor.Guard
verifying that processing is not paused. - RunningStatusController(Processor<TDef>) - Constructor for class org.webharvest.runtime.processors.RunningStatusController
-
Default class constructor which accepts
Processor
instance which is going to be decorated by this class. - RunningStatusGuard - Class in org.webharvest.runtime
-
Implementation of
Monitor.Guard
verifying that current status of configuration's processing is 'running'. - RunningStatusGuard(Monitor) - Constructor for class org.webharvest.runtime.RunningStatusGuard
-
Default class constructor which accepts reference to
Monitor
. - RuntimeConfig - Class in org.webharvest.runtime
-
Facade for runtime objects needed for specific processors' execution.
- RuntimeConfig() - Constructor for class org.webharvest.runtime.RuntimeConfig
S
- saveStringToFile(File, String, String) - Static method in class org.webharvest.utils.CommonUtil
-
Saves specified content to the file with specified charset.
- SAXConfigParser - Class in org.webharvest.definition
-
Created by rba on 16.07.2017.
- SAXConfigParser() - Constructor for class org.webharvest.definition.SAXConfigParser
- SchemaComponentFactory - Class in org.webharvest.definition.validation
-
Factory class providing instances of
SchemaResolver
andSchemaFactory
which actually are the same instance ofSchemaFactoryImpl
. - SchemaComponentFactory() - Constructor for class org.webharvest.definition.validation.SchemaComponentFactory
- SchemaFactory - Interface in org.webharvest.definition.validation
-
Factory creating an instance of
Schema
which is a base of XML validation process. - SchemaFactoryImpl - Class in org.webharvest.definition.validation
-
Implementation of
SchemaFactory
andSchemaResolver
interfaces which realizes Abstract Factory and Singleton design patterns. - SchemaResolver - Interface in org.webharvest.definition.validation
-
Interface to be implemented by the web harvest XML schema sources resolvers.
- SchemaResolverPostProcessor - Interface in org.webharvest.definition.validation
-
Allows for custom modification of the web harvest XML schema sources resolvers.
- SchemaResourcesPostProcessor<T> - Class in org.webharvest.definition.validation
-
SchemaResolverPostProcessor
implementation capable of transforming specified XML schema resources which could be e.g. a paths to these resources or some resource object. - SchemaResourcesPostProcessor(Transformer<T, SchemaSource>, T...) - Constructor for class org.webharvest.definition.validation.SchemaResourcesPostProcessor
-
Default class constructor accepting array of XML schema resources and reference to
Transformer
instance which allows to convert these resources toSchemaSource
objects. - SchemaSource - Class in org.webharvest.definition.validation
-
An POJO object which contains XML schema's
Source
. - SchemaSource(InputStream, String) - Constructor for class org.webharvest.definition.validation.SchemaSource
-
Default class constructors accepting not
null
InputSteam
and system identifier of the XML schema source. - scope(Key<T>, Provider<T>) - Method in class org.webharvest.ioc.AttributeHolderScope
- ScopeAttributeHolder - Class in org.webharvest.ioc
-
Represents an object that implements
AttributeHolder
and is intended to serve purpose as container for Guice scope's beans. - ScopeAttributeHolder() - Constructor for class org.webharvest.ioc.ScopeAttributeHolder
- Scraper - Class in org.webharvest.runtime
-
Basic runtime class.
- Scraper() - Constructor for class org.webharvest.runtime.Scraper
-
Default constructor for non-Guice usage.
- Scraper(EventBus) - Constructor for class org.webharvest.runtime.Scraper
-
Constructor for custom EventBus.
- ScraperContext - Class in org.webharvest.runtime
-
Context of scraper execution.
- ScraperContext() - Constructor for class org.webharvest.runtime.ScraperContext
- ScraperContext10 - Class in org.webharvest.deprecated.runtime
-
Deprecated.
- ScraperContext10() - Constructor for class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- ScraperExecutionContinuedEvent - Class in org.webharvest.events
- ScraperExecutionContinuedEvent(Harvester) - Constructor for class org.webharvest.events.ScraperExecutionContinuedEvent
- ScraperExecutionEndEvent - Class in org.webharvest.events
-
Event informing that the execution of
WebScraper
has been successfully completed. - ScraperExecutionEndEvent(WebScraper, long) - Constructor for class org.webharvest.events.ScraperExecutionEndEvent
-
Constructs
ScraperExecutionEndEvent
accepting reference toWebScraper
which execution has been completed - ScraperExecutionErrorEvent - Class in org.webharvest.events
-
Event informing that during the execution of Scraper some exception has occurred.
- ScraperExecutionErrorEvent(Exception) - Constructor for class org.webharvest.events.ScraperExecutionErrorEvent
-
Event constructor accepting
Exception
which occurred during the execution. - ScraperExecutionExitEvent - Class in org.webharvest.events
-
Event informing that the execution of configuration has exited.
- ScraperExecutionExitEvent(String) - Constructor for class org.webharvest.events.ScraperExecutionExitEvent
-
Constructs
ScraperExecutionExitEvent
accepting cause of of execution's exit. - ScraperExecutionPausedEvent - Class in org.webharvest.events
- ScraperExecutionPausedEvent(Harvester) - Constructor for class org.webharvest.events.ScraperExecutionPausedEvent
- ScraperExecutionStartEvent - Class in org.webharvest.events
- ScraperExecutionStartEvent(WebScraper) - Constructor for class org.webharvest.events.ScraperExecutionStartEvent
- ScraperExecutionStoppedEvent - Class in org.webharvest.events
-
Event informing that the execution of
Harvester
has been stopped. - ScraperExecutionStoppedEvent(Harvester) - Constructor for class org.webharvest.events.ScraperExecutionStoppedEvent
-
Constructs
ScraperExecutionStoppedEvent
accepting reference toHarvester
which execution has been stopped. - ScraperModule - Class in org.webharvest.ioc
-
Guice module for Web-Harvest configuration.
- ScraperModule(String) - Constructor for class org.webharvest.ioc.ScraperModule
-
Creates a ScraperModule with default configuration.
- ScraperScope - Class in org.webharvest.ioc
-
Scraping scope container.
- ScraperScope() - Constructor for class org.webharvest.ioc.ScraperScope
- ScraperState - Enum in org.webharvest.runtime
-
An enum containing all available Scraper's states.
- ScraperXPathException - Exception in org.webharvest.exception
-
XPath exception - occured during executing xpath processor.
- ScraperXPathException() - Constructor for exception org.webharvest.exception.ScraperXPathException
- ScraperXPathException(String) - Constructor for exception org.webharvest.exception.ScraperXPathException
- ScraperXPathException(String, Throwable) - Constructor for exception org.webharvest.exception.ScraperXPathException
- ScraperXPathException(Throwable) - Constructor for exception org.webharvest.exception.ScraperXPathException
- ScraperXQueryException - Exception in org.webharvest.exception
-
XQuery exception - occured during executing xquery processor.
- ScraperXQueryException() - Constructor for exception org.webharvest.exception.ScraperXQueryException
- ScraperXQueryException(String) - Constructor for exception org.webharvest.exception.ScraperXQueryException
- ScraperXQueryException(String, Throwable) - Constructor for exception org.webharvest.exception.ScraperXQueryException
- ScraperXQueryException(Throwable) - Constructor for exception org.webharvest.exception.ScraperXQueryException
- Scraping - Annotation Type in org.webharvest.ioc
-
Guice helper annotation used to indicate methods expected to be invoked in exclusive scraping scope.
- ScrapingAware - Interface in org.webharvest
-
Interface to be implemented by any object that wishes to be notified of scraping scope possibly it runs in.
- ScrapingAwareHelper() - Constructor for class org.webharvest.ioc.ScrapingInterceptor.ScrapingAwareHelper
- ScrapingAwareTypeListener - Class in org.webharvest.ioc
-
Implementation of
TypeListener
that is responsible for registration of detectedScrapingAware
. - ScrapingAwareTypeListener() - Constructor for class org.webharvest.ioc.ScrapingAwareTypeListener
- ScrapingHarvester - Class in org.webharvest.runtime
-
Default implementation of
Harvester
interface aimed to perform data extraction from remote websites. - ScrapingHarvester(Provider<WebScraper>, ContextFactory, Config) - Constructor for class org.webharvest.runtime.ScrapingHarvester
-
Class constructor expecting Guice
WebScraper
provider,DynamicScopeContext
factory andConfig
as already loaded configuration. - ScrapingInterceptor - Class in org.webharvest.ioc
-
Guice AOP interceptor responsible for taking action for method annotated with
Scraping
annotation. - ScrapingInterceptor() - Constructor for class org.webharvest.ioc.ScrapingInterceptor
- ScrapingInterceptor.ScrapingAwareHelper - Class in org.webharvest.ioc
-
Guice aware helper class that maintains collection of registered
ScrapingAware
listeners. - ScrapingScope - Annotation Type in org.webharvest.ioc
-
Guice helper annotation used to indicate types to be instantiated and kept in scraping scope.
- ScriptDef - Class in org.webharvest.definition
-
Definition of script processor.
- ScriptDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.ScriptDef
- ScriptEngine - Interface in org.webharvest.runtime.scripting
-
Interface providing scripting functionality.
- ScriptEngineException - Exception in org.webharvest.exception
-
Script engine exception - thrown when there is a problem with a script engine itself, not a script source.
- ScriptEngineException(Throwable) - Constructor for exception org.webharvest.exception.ScriptEngineException
- scriptEngineFactory - Static variable in class org.webharvest.runtime.templaters.BaseTemplater
- ScriptEngineFactory - Interface in org.webharvest.runtime.scripting
-
Factory creating
ScriptEngine
instances capable of executing providedScriptSource
. - ScriptException - Exception in org.webharvest.exception
-
Script execution exception - occurred during script compilation or evaluation.
- ScriptException(String, Throwable) - Constructor for exception org.webharvest.exception.ScriptException
- ScriptException(Throwable) - Constructor for exception org.webharvest.exception.ScriptException
- ScriptingLanguage - Enum in org.webharvest.runtime.scripting
-
Created by IntelliJ IDEA.
- ScriptingVariable - Class in org.webharvest.runtime.variables
-
This variables are unwrapped when passing into script engines and preserve the mutable collections in the original state when passing over Scraper
- ScriptingVariable(Object) - Constructor for class org.webharvest.runtime.variables.ScriptingVariable
- ScriptProcessor - Class in org.webharvest.runtime.processors
-
Script processor - executes script defined in the body.
- ScriptProcessor() - Constructor for class org.webharvest.runtime.processors.ScriptProcessor
- ScriptSource - Class in org.webharvest.runtime.scripting
-
Created by IntelliJ IDEA.
- ScriptSource(String, ScriptingLanguage) - Constructor for class org.webharvest.runtime.scripting.ScriptSource
- serializeItem(Item, Properties) - Static method in class org.webharvest.utils.CommonUtil
-
Serializes item after XPath or XQuery processor execution using Saxon.
- setCharset(String) - Method in class org.webharvest.definition.HttpDef
- setCharset(String) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.Sets default configuration's charset.
- setCharset(String) - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Sets default configuration's charset.
- setCharset(String) - Method in class org.webharvest.runtime.ScraperContext
-
Sets default configuration's charset.
- setConfig(Config) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- setConfig(Config) - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Deprecated.
- setConfig(Config) - Method in class org.webharvest.runtime.ScraperContext
-
Deprecated.
- setContentType(String) - Method in class org.webharvest.definition.HttpDef
- setCookiePolicy(String) - Method in class org.webharvest.definition.HttpDef
- setCookiePolicy(String) - Method in class org.webharvest.runtime.web.HttpClientManager
- setElementDef(TDef) - Method in class org.webharvest.runtime.processors.AbstractProcessor
-
Sets appropriate element definition to the processor.
- setElementDef(TDef) - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
-
Sets appropriate element definition to the processor.
- setElementDef(TDef) - Method in interface org.webharvest.runtime.processors.Processor
-
Deprecated.Find better solution to create processor in valid state
- setFollowRedirects(String) - Method in class org.webharvest.definition.HttpDef
- setFunctionResult(Variable) - Method in class org.webharvest.runtime.processors.CallProcessor
- setHostnameVerification(boolean) - Method in class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
-
Set the host name verification flag.
- setIgnoreResponseBody(String) - Method in class org.webharvest.definition.HttpDef
- setLocalVar(String, Object) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- setLocalVar(String, Object) - Method in interface org.webharvest.runtime.DynamicScopeContext
- setLocalVar(String, Object) - Method in class org.webharvest.runtime.ScraperContext
- setLocalVar(String, Variable) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- setLocalVar(String, Variable) - Method in interface org.webharvest.runtime.DynamicScopeContext
- setLocalVar(String, Variable) - Method in class org.webharvest.runtime.ScraperContext
- setLocalVar(Map<String, Object>) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- setLocalVar(Map<String, Object>) - Method in interface org.webharvest.runtime.DynamicScopeContext
- setLocalVar(Map<String, Object>) - Method in class org.webharvest.runtime.ScraperContext
- setLocation(int, int) - Method in class org.webharvest.definition.XmlNode
- setMethod(String) - Method in class org.webharvest.definition.HttpDef
- setParentProcessor(Processor) - Method in class org.webharvest.runtime.processors.AbstractProcessor
-
Sets reference to parent
Processor
. - setParentProcessor(Processor) - Method in class org.webharvest.runtime.processors.AbstractProcessorDecorator
-
Sets reference to parent
Processor
. - setParentProcessor(Processor) - Method in class org.webharvest.runtime.processors.BodyProcessor.Builder
-
Sets reference to the parent
Processor
. - setParentProcessor(Processor) - Method in interface org.webharvest.runtime.processors.Processor
-
Sets reference to parent
Processor
. - setPassword(String) - Method in class org.webharvest.definition.HttpDef
- setProperty(String, Object) - Method in class org.webharvest.runtime.processors.AbstractProcessor
-
Defines processor runtime property with specified name and value.
- setProxyCredentialsNTDomain(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- setProxyCredentialsNTHost(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- setProxyCredentialsPassword(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- setProxyCredentialsUsername(String) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- setProxyPort(int) - Method in class org.webharvest.runtime.web.HttpClientManager.ProxySettings.Builder
- setResponse(HttpResponseWrapper) - Method in class org.webharvest.runtime.web.HttpInfo
- setRetryAttempts(String) - Method in class org.webharvest.definition.HttpDef
- setRetryDelay(String) - Method in class org.webharvest.definition.HttpDef
- setRetryDelayFactor(String) - Method in class org.webharvest.definition.HttpDef
- setScriptingLanguage(ScriptingLanguage) - Method in class org.webharvest.deprecated.runtime.ScraperContext10
-
Deprecated.
- setScriptingLanguage(ScriptingLanguage) - Method in interface org.webharvest.runtime.DynamicScopeContext
-
Deprecated.
- setScriptingLanguage(ScriptingLanguage) - Method in class org.webharvest.runtime.ScraperContext
-
Deprecated.
- setUrl(String) - Method in class org.webharvest.definition.HttpDef
- setUsername(String) - Method in class org.webharvest.definition.HttpDef
- SetVarPlugin - Class in org.webharvest.runtime.processors.plugins.variable
-
Support for database operations.
- SetVarPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.variable.SetVarPlugin
- shouldNeverHappen(Throwable) - Static method in class org.webharvest.utils.Assert
- size() - Method in class org.webharvest.utils.Stack
- SleepPlugin - Class in org.webharvest.runtime.processors.plugins
- SleepPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.SleepPlugin
- space - Static variable in class org.webharvest.utils.SystemUtilities
- SPLITTER_WIDTH - Static variable in class org.webharvest.WHConstants
- Stack<T> - Class in org.webharvest.utils
-
Simple Stack (LIFO queue).
- Stack() - Constructor for class org.webharvest.utils.Stack
- StandaloneConnectionPool - Class in org.webharvest.runtime.database
-
Default, standalone
ConnectionFactory
interface implementation. - StandaloneConnectionPool() - Constructor for class org.webharvest.runtime.database.StandaloneConnectionPool
- statusCode - Variable in class org.webharvest.runtime.web.HttpInfo
- StatusHolder - Interface in org.webharvest.runtime
-
Component responsible for providing information about current status of being processed configuration.
- statusText - Variable in class org.webharvest.runtime.web.HttpInfo
- stop(ScraperExecutionStoppedEvent) - Method in class org.webharvest.runtime.EventBasedStatusHolder
-
Changes status to
ScraperState.STOPPED
. - STOPPED - org.webharvest.runtime.ScraperState
-
Scraper has been stopped by the user.
- StoppedOrExitedProcessor<TDef extends IElementDef> - Class in org.webharvest.runtime.processors
-
AbstractProcessorDecorator
implementation which decoratesProcessor#run(Scraper, DynamicScopeContext)
method in the way that if status of the processing is 'stopped' or 'exit' then returnsEmptyVariable.INSTANCE
otherwise it delegates invocation to decoratedProcessor
. - StoppedOrExitedProcessor(Processor<TDef>) - Constructor for class org.webharvest.runtime.processors.StoppedOrExitedProcessor
-
Default class constructor which accepts
Processor
instance which is going to be decorated by this class. - StrictSSLProtocolSocketFactory - Class in org.apache.commons.httpclient.contrib.ssl
-
A
SecureProtocolSocketFactory
that uses JSSE to create SSL sockets. - StrictSSLProtocolSocketFactory() - Constructor for class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
-
Constructor for StrictSSLProtocolSocketFactory.
- StrictSSLProtocolSocketFactory(boolean) - Constructor for class org.apache.commons.httpclient.contrib.ssl.StrictSSLProtocolSocketFactory
-
Constructor for StrictSSLProtocolSocketFactory.
- StylesheetProcessor - Class in org.webharvest.runtime.processors
- StylesheetProcessor() - Constructor for class org.webharvest.runtime.processors.StylesheetProcessor
- subclassesOf(Class<?>) - Static method in class org.webharvest.ioc.TypeMatchers
-
Matcher matches all classes that extends, implements or is the same as baseClass
- SystemUtilities - Class in org.webharvest.utils
-
Collection of useful constants and functions that are available in each scraper context.
- SystemUtilities(DynamicScopeContext) - Constructor for class org.webharvest.utils.SystemUtilities
T
- tab - Static variable in class org.webharvest.utils.SystemUtilities
- TargetNamespace - Annotation Type in org.webharvest.runtime.processors.plugins
-
This annotation may be used on the web harvest plugin class to indicate one or more target XML namespaces for the plugin.
- TemplateDef - Class in org.webharvest.definition
-
Definition of template task.
- TemplateDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.TemplateDef
- TemplateException - Exception in org.webharvest.exception
-
Template exception - occured during executing templete processor.
- TemplateException() - Constructor for exception org.webharvest.exception.TemplateException
- TemplateException(String) - Constructor for exception org.webharvest.exception.TemplateException
- TemplateException(String, Throwable) - Constructor for exception org.webharvest.exception.TemplateException
- TemplateException(Throwable) - Constructor for exception org.webharvest.exception.TemplateException
- TemplateProcessor - Class in org.webharvest.runtime.processors
-
Template processor.
- TemplateProcessor() - Constructor for class org.webharvest.runtime.processors.TemplateProcessor
- TemplaterException - Exception in org.webharvest.exception
-
Templater exception.
- TemplaterException() - Constructor for exception org.webharvest.exception.TemplaterException
- TemplaterException(String) - Constructor for exception org.webharvest.exception.TemplaterException
- TemplaterException(String, Throwable) - Constructor for exception org.webharvest.exception.TemplaterException
- TemplaterException(Throwable) - Constructor for exception org.webharvest.exception.TemplaterException
- TextDef - Class in org.webharvest.definition
-
Definition of text processor.
- TextDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.TextDef
- TextProcessor - Class in org.webharvest.runtime.processors
-
Text processor.
- TextProcessor() - Constructor for class org.webharvest.runtime.processors.TextProcessor
- ThreadLocalCache<K,V> - Class in org.webharvest
-
Cache
implementation based onThreadLocal
, that is, allowing each thread to have separate cache bindings. - ThreadLocalCache() - Constructor for class org.webharvest.ThreadLocalCache
- time() - Method in class org.webharvest.utils.SystemUtilities
- TIME_TO_LAUNCH_WEB_SERVER - Static variable in class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
- toArray() - Method in class org.webharvest.runtime.variables.Variable
-
Safely converts this variable to array of objects.
- toBinary() - Method in class org.webharvest.runtime.variables.EmptyVariable
- toBinary() - Method in class org.webharvest.runtime.variables.ListVariable
- toBinary() - Method in class org.webharvest.runtime.variables.NodeVariable
- toBinary() - Method in class org.webharvest.runtime.variables.Variable
- toBinary(String) - Method in class org.webharvest.runtime.variables.EmptyVariable
- toBinary(String) - Method in class org.webharvest.runtime.variables.ListVariable
- toBinary(String) - Method in class org.webharvest.runtime.variables.NodeVariable
- toBinary(String) - Method in class org.webharvest.runtime.variables.Variable
- toBoolean() - Method in class org.webharvest.runtime.variables.Variable
-
Safely converts this variable to boolean value.
- toDouble() - Method in class org.webharvest.runtime.variables.Variable
-
Safely converts this variable to double value.
- toInt() - Method in class org.webharvest.runtime.variables.Variable
-
Safely converts this variable to integer value.
- toIterator() - Method in class org.webharvest.runtime.variables.EmptyVariable
- toIterator() - Method in class org.webharvest.runtime.variables.ListVariable
- toIterator() - Method in class org.webharvest.runtime.variables.NodeVariable
- toIterator() - Method in class org.webharvest.runtime.variables.Variable
- tokenize(String, String) - Static method in class org.webharvest.utils.CommonUtil
-
Tokenize given string for specified delimiter(s).
- tokenize(String, String, boolean, boolean) - Static method in class org.webharvest.utils.CommonUtil
- TokenizePlugin - Class in org.webharvest.runtime.processors.plugins
-
Support for database operations.
- TokenizePlugin() - Constructor for class org.webharvest.runtime.processors.plugins.TokenizePlugin
- toList() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- toList() - Method in class org.webharvest.runtime.variables.EmptyVariable
- toList() - Method in class org.webharvest.runtime.variables.ListVariable
- toList() - Method in class org.webharvest.runtime.variables.NodeVariable
- toList() - Method in class org.webharvest.runtime.variables.Variable
- toLong() - Method in class org.webharvest.runtime.variables.Variable
-
Safely converts this variable to long value.
- toString() - Method in class org.webharvest.definition.ElementName
- toString() - Method in class org.webharvest.definition.validation.SchemaSource
-
Returns system identifier of the XML schema's
Source
. - toString() - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- toString() - Method in class org.webharvest.runtime.variables.EmptyVariable
- toString() - Method in class org.webharvest.runtime.variables.ListVariable
- toString() - Method in class org.webharvest.runtime.variables.NodeVariable
- toString() - Method in class org.webharvest.runtime.variables.Variable
- toString() - Method in class org.webharvest.utils.CommonUtil.IntPair
- toString() - Method in class org.webharvest.utils.XmlNodeWrapper
- toString(String) - Method in class org.webharvest.runtime.processors.plugins.db.DbRowVariable
- toString(String) - Method in class org.webharvest.runtime.variables.EmptyVariable
- toString(String) - Method in class org.webharvest.runtime.variables.ListVariable
- toString(String) - Method in class org.webharvest.runtime.variables.NodeVariable
- toString(String) - Method in class org.webharvest.runtime.variables.Variable
- toString(String, String) - Method in class org.webharvest.runtime.variables.ListVariable
- totalLength - Variable in class org.webharvest.runtime.web.HttpInfo
- totalResponses - Variable in class org.webharvest.runtime.web.HttpInfo
- transform(I) - Method in class org.webharvest.definition.validation.TransformerPair
-
Transforms input object into output object.
- transform(I) - Method in interface org.webharvest.Transformer
-
Transforms input object into output object.
- transform(String) - Method in class org.webharvest.definition.validation.ResourcePathToURITransformer
-
Transforms input object into output object.
- transform(URI) - Method in class org.webharvest.definition.validation.URIToSchemaSourceTransformer
-
Transforms input object into output object.
- TransformationException - Exception in org.webharvest
-
Checked exception thrown if transformation process has failed.
- TransformationException(Throwable) - Constructor for exception org.webharvest.TransformationException
-
TransformationException
constructor accepting cause of the exception. - Transformer<I,O> - Interface in org.webharvest
-
A component which is capable of transforming object from one type to another type.
- TransformerPair<I,T,O> - Class in org.webharvest.definition.validation
-
Implementation of
Transformer
interface connecting two otherTransformer
s where the output type of the first one is the same as input of the second one. - TransformerPair(Transformer<I, T>, Transformer<T, O>) - Constructor for class org.webharvest.definition.validation.TransformerPair
-
Default class constructor which accepts reference to two
Transformer
s. - TryDef - Class in org.webharvest.definition
-
Definition of try-catche element.
- TryDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.TryDef
- TryProcessor - Class in org.webharvest.runtime.processors
-
OnError processor - sets .
- TryProcessor() - Constructor for class org.webharvest.runtime.processors.TryProcessor
- TYPE_BINARY - Static variable in class org.webharvest.runtime.variables.Types
- TYPE_DEFAULT - Static variable in class org.webharvest.runtime.variables.Types
- TYPE_EMPTY - Static variable in class org.webharvest.runtime.variables.Types
- TYPE_LIST - Static variable in class org.webharvest.runtime.variables.Types
- TYPE_TEXT - Static variable in class org.webharvest.runtime.variables.Types
- TYPE_XML - Static variable in class org.webharvest.runtime.variables.Types
- TypeMatchers - Class in org.webharvest.ioc
- TypeMatchers() - Constructor for class org.webharvest.ioc.TypeMatchers
- Types - Class in org.webharvest.runtime.variables
-
Variable types.
- Types() - Constructor for class org.webharvest.runtime.variables.Types
U
- unbind(K) - Method in class org.webharvest.AbstractRegistry
-
Unbinds name and the associated value from the registry.
- unbind(K) - Method in class org.webharvest.LockedRegistry
-
Unbinds name and the associated value from the registry.
- unbind(K) - Method in interface org.webharvest.Registry
-
Unbinds name and the associated value from the registry.
- UNDEFINED_LOCATION - Static variable in interface org.webharvest.definition.ConfigSource
-
Configuration source undefined location.
- UNKNOWN - org.webharvest.runtime.ScraperState
-
Deprecated.
- unregisterPlugin(String, String) - Method in interface org.webharvest.definition.ConfigurableResolver
-
Deprecated.
- unregisterPlugin(String, String) - Method in class org.webharvest.definition.DefinitionResolver
- URIToSchemaSourceTransformer - Class in org.webharvest.definition.validation
-
An implementation of
Transformer
interface which supports transformation from given resourceURI
to appropriate instance ofSchemaSource
. - URIToSchemaSourceTransformer() - Constructor for class org.webharvest.definition.validation.URIToSchemaSourceTransformer
- URLConfigSource - Class in org.webharvest.definition
-
Implementation of
ConfigSource
that uses a HTTP protocol as source of XML configurations. - URLConfigSource(URL) - Constructor for class org.webharvest.definition.URLConfigSource
-
Class constructor expecting
URL
as configuration source. - UserException - Exception in org.webharvest.exception
-
Exception explicitly thrown by a user.
- UserException(String) - Constructor for exception org.webharvest.exception.UserException
V
- v1 - org.webharvest.definition.Config.Version
- v2 - org.webharvest.definition.Config.Version
- validAttributes() - Method in annotation type org.webharvest.annotation.Definition
-
This method should return all possible attribute names for the plugin processor.
- validSubprocessors() - Method in annotation type org.webharvest.annotation.Definition
-
This method should return all names of all allowed processors inside the body of this processor plugin.
- value() - Method in annotation type org.webharvest.annotation.Definition
-
Defines name of the processor.
- value() - Method in annotation type org.webharvest.runtime.processors.plugins.TargetNamespace
-
Indicates target XML namespaces for the plugin.
- VALUE_PROPERTY_NAME - Static variable in class org.webharvest.WHConstants
- valueOf(String) - Static method in enum org.webharvest.definition.Config.Version
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.webharvest.runtime.database.DefaultDriverManager
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.webharvest.runtime.ScraperState
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.webharvest.runtime.scripting.ScriptingLanguage
-
Returns the enum constant of this type with the specified name.
- ValueOfPlugin - Class in org.webharvest.runtime.processors.plugins
- ValueOfPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.ValueOfPlugin
- values() - Static method in enum org.webharvest.definition.Config.Version
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.webharvest.runtime.database.DefaultDriverManager
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.webharvest.runtime.ScraperState
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.webharvest.runtime.scripting.ScriptingLanguage
-
Returns an array containing the constants of this enum type, in the order they are declared.
- VAR_END - Static variable in class org.webharvest.runtime.templaters.BaseTemplater
- VAR_START - Static variable in class org.webharvest.runtime.templaters.BaseTemplater
- VarDef - Class in org.webharvest.definition
-
Definition of variable call.
- VarDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.VarDef
- VarDefDef - Class in org.webharvest.definition
-
Definition of variable.
- VarDefDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.VarDefDef
- VarDefProcessor - Class in org.webharvest.deprecated.runtime.processors
-
Deprecated.Use either DefProcessor or SetProcessor
- VarDefProcessor() - Constructor for class org.webharvest.deprecated.runtime.processors.VarDefProcessor
-
Deprecated.
- Variable - Class in org.webharvest.runtime.variables
-
Variables Interface.
- Variable() - Constructor for class org.webharvest.runtime.variables.Variable
- VariableException - Exception in org.webharvest.exception
-
Variable processor exception.
- VariableException() - Constructor for exception org.webharvest.exception.VariableException
- VariableException(String) - Constructor for exception org.webharvest.exception.VariableException
- VariableException(String, Throwable) - Constructor for exception org.webharvest.exception.VariableException
- VariableException(Throwable) - Constructor for exception org.webharvest.exception.VariableException
- VariableName - Class in org.webharvest.runtime.variables
-
Represents name of the
Variable
that is set onDynamicScopeContext
. - VariableName(String) - Constructor for class org.webharvest.runtime.variables.VariableName
-
Creates new
Variable
name. - VarProcessor - Class in org.webharvest.deprecated.runtime.processors
-
Deprecated.Use
or instead - VarProcessor() - Constructor for class org.webharvest.deprecated.runtime.processors.VarProcessor
-
Deprecated.
- visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.AbstractConfigSource
-
Depending on owned
ConfigSource.Location
allows the ancestor class to accept interceptedConfigLocationVisitor
. - visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.BufferConfigSource
-
Depending on owned
ConfigSource.Location
allows the ancestor class to accept interceptedConfigLocationVisitor
. - visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.FileConfigSource
-
Depending on owned
ConfigSource.Location
allows the ancestor class to accept interceptedConfigLocationVisitor
. - visit(ConfigLocationVisitor) - Method in class org.webharvest.definition.URLConfigSource
-
Depending on owned
ConfigSource.Location
allows the ancestor class to accept interceptedConfigLocationVisitor
. - visit(FileConfigSource.FileLocation) - Method in interface org.webharvest.definition.ConfigLocationVisitor
-
Takes certain action on
FileConfigSource.FileLocation
. - visit(FileConfigSource.FileLocation) - Method in class org.webharvest.definition.IncludeVisitor
-
Takes certain action on
FileConfigSource.FileLocation
. - visit(URLConfigSource.URLLocation) - Method in interface org.webharvest.definition.ConfigLocationVisitor
-
Takes certain action on
URLConfigSource.URLLocation
. - visit(URLConfigSource.URLLocation) - Method in class org.webharvest.definition.IncludeVisitor
-
Takes certain action on
URLConfigSource.URLLocation
.
W
- WebBrowserJavascriptPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
-
Evaluates javascript on the page inside headless web browser.
- WebBrowserJavascriptPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserJavascriptPlugin
- WebBrowserLoadPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
-
Load page inside headless web browser.
- WebBrowserLoadPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserLoadPlugin
- WebBrowserlPluginException - Exception in org.webharvest.runtime.processors.plugins.webbrowser
-
Runtime exception for MailPlugin
- WebBrowserlPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserlPluginException
- WebBrowserlPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserlPluginException
- WebBrowserlPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserlPluginException
- WebBrowserPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
-
Support headless web browser supported by PhantomJS open source project.
- WebBrowserPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserPlugin
- WebBrowserRenderPlugin - Class in org.webharvest.runtime.processors.plugins.webbrowser
-
Evaluates javascript on the page inside headless web browser.
- WebBrowserRenderPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.webbrowser.WebBrowserRenderPlugin
- WebHarvestPlugin - Class in org.webharvest.runtime.processors
-
Base for all user-defined plugins.
- WebHarvestPlugin() - Constructor for class org.webharvest.runtime.processors.WebHarvestPlugin
- WebHarvestPluginDef - Class in org.webharvest.definition
-
Definition of all plugin processors.
- WebHarvestPluginDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.WebHarvestPluginDef
- WebScraper - Interface in org.webharvest.runtime
- WHConstants - Class in org.webharvest
-
Created by IntelliJ IDEA.
- WHConstants() - Constructor for class org.webharvest.WHConstants
- WhileDef - Class in org.webharvest.definition
-
Definition of while loop processor (while-empty and while-not-empty).
- WhileDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.WhileDef
- WhileProcessor - Class in org.webharvest.runtime.processors
-
Conditional processor.
- WhileProcessor() - Constructor for class org.webharvest.runtime.processors.WhileProcessor
- withPackageName(String) - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
-
Sets the package name to scan for plugins.
- withPluginClass(Class<? extends Processor>) - Method in class org.webharvest.definition.PluginDefinitionBuilder
-
Sets the plugin class for the definition.
- withRegistrationStrategy(PluginRegistrationStrategy) - Method in class org.webharvest.definition.AnnotatedPluginsPostProcessorBuilder
-
Sets the registration strategy to use.
- withXmlNode(XmlNode) - Method in class org.webharvest.definition.PluginDefinitionBuilder
-
Sets the XML node for the plugin definition.
- WorkingDir - Annotation Type in org.webharvest.ioc
-
Guice binder helper annotation for scraper's working directory path (indicates where temporary files are kept).
X
- x - Variable in class org.webharvest.utils.CommonUtil.IntPair
- XmlAttribute - Class in org.webharvest.definition
-
Information about single xml attribute
- XmlAttribute(String, String, String) - Constructor for class org.webharvest.definition.XmlAttribute
- XMLConfig - Class in org.webharvest.definition
-
Implementation of
ConfigSource
capable to work with XML based configurations. - XMLConfig(ConfigSource, ConfigParser) - Constructor for class org.webharvest.definition.XMLConfig
-
Constructor for both Guice and non-Guice usage.
- xmlNode - Variable in class org.webharvest.definition.AbstractElementDef
- XmlNode - Class in org.webharvest.definition
- XmlNode(String, String, String, XmlNode) - Constructor for class org.webharvest.definition.XmlNode
-
Constructor that defines name and connects to specified parent element.
- XmlNodeWrapper - Class in org.webharvest.utils
- XmlNodeWrapper(Item, Properties) - Constructor for class org.webharvest.utils.XmlNodeWrapper
- XMLNS_CORE - Static variable in class org.webharvest.WHConstants
- XMLNS_CORE_10 - Static variable in class org.webharvest.WHConstants
- XMLNS_CORE_10_ALIASES - Static variable in class org.webharvest.WHConstants
- XMLNS_PARAM - Static variable in class org.webharvest.WHConstants
- XMLNS_VAR - Static variable in class org.webharvest.WHConstants
- XMLProcessor - Class in org.webharvest.runtime.processors
- XMLProcessor() - Constructor for class org.webharvest.runtime.processors.XMLProcessor
- XmlToJsonPlugin - Class in org.webharvest.runtime.processors.plugins
-
Converter from XML to JSON
- XmlToJsonPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.XmlToJsonPlugin
- XmlUtil - Class in org.webharvest.utils
-
XML utils - contains common logic for XML handling
- XmlUtil() - Constructor for class org.webharvest.utils.XmlUtil
- XmlValidator - Class in org.webharvest.utils
- XmlValidator() - Constructor for class org.webharvest.utils.XmlValidator
- XPathDef - Class in org.webharvest.definition
-
Definition of XPath processor.
- XPathDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XPathDef
- XPathProcessor - Class in org.webharvest.runtime.processors
-
XQuery processor.
- XPathProcessor() - Constructor for class org.webharvest.runtime.processors.XPathProcessor
- XQExpression - Class in org.webharvest.runtime.processors
- XQExpression() - Constructor for class org.webharvest.runtime.processors.XQExpression
- XQParamProcessor - Class in org.webharvest.runtime.processors
- XQParamProcessor() - Constructor for class org.webharvest.runtime.processors.XQParamProcessor
- XQueryDef - Class in org.webharvest.definition
-
Definition of XQuery processor.
- XQueryDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XQueryDef
- XQueryExpressionPool - Class in org.webharvest.runtime.processors
-
Class represnts simple pool for XQuery expressions.
- XQueryExpressionPool(StaticQueryContext) - Constructor for class org.webharvest.runtime.processors.XQueryExpressionPool
- XQueryExternalParamDef - Class in org.webharvest.definition
-
Definition of XQuery external parameter.
- XQueryExternalParamDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XQueryExternalParamDef
- XQueryProcessor - Class in org.webharvest.runtime.processors
-
XQuery processor.
- XQueryProcessor() - Constructor for class org.webharvest.runtime.processors.XQueryProcessor
- XsltDef - Class in org.webharvest.definition
-
Definition of user-defined function.
- XsltDef(XmlNode, Class<? extends Processor>) - Constructor for class org.webharvest.definition.XsltDef
- XsltException - Exception in org.webharvest.exception
-
Template exception - occured during executing XSLT processor.
- XsltException() - Constructor for exception org.webharvest.exception.XsltException
- XsltException(String) - Constructor for exception org.webharvest.exception.XsltException
- XsltException(String, Throwable) - Constructor for exception org.webharvest.exception.XsltException
- XsltException(Throwable) - Constructor for exception org.webharvest.exception.XsltException
- XsltProcessor - Class in org.webharvest.runtime.processors
-
XSLT processor.
- XsltProcessor() - Constructor for class org.webharvest.runtime.processors.XsltProcessor
Y
- y - Variable in class org.webharvest.utils.CommonUtil.IntPair
Z
- ZipEntryPlugin - Class in org.webharvest.runtime.processors.plugins.zip
-
Zip entry plugin - can be used only inside zip plugin.
- ZipEntryPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.zip.ZipEntryPlugin
- ZipPlugin - Class in org.webharvest.runtime.processors.plugins.zip
-
ZIP processor
- ZipPlugin() - Constructor for class org.webharvest.runtime.processors.plugins.zip.ZipPlugin
- ZipPluginException - Exception in org.webharvest.runtime.processors.plugins.zip
-
Runtime exception for ZipPlugin
- ZipPluginException(String) - Constructor for exception org.webharvest.runtime.processors.plugins.zip.ZipPluginException
- ZipPluginException(String, Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.zip.ZipPluginException
- ZipPluginException(Throwable) - Constructor for exception org.webharvest.runtime.processors.plugins.zip.ZipPluginException
All Classes All Packages