| Modifier and Type | Method and Description |
|---|---|
Crawler |
build()
Builds the crawler with the options specified.
|
Crawler.Builder |
fetcher(@NotNull Fetcher fetcher)
Sets the Fetcher to be used, if not set, default will be chosen.
|
Crawler.Builder |
maxConnections(int maxConnections)
The number of concurrent connections allowed out of the client.
|
Crawler.Builder |
maxTries(int maxTries)
Sets number of times to retry for a request.
|
Crawler.Builder |
name(@NotNull String name)
Sets the name for crawler thread.
|
Crawler.Builder |
parallism(int parallelism)
Sets the parallelism level.
|
Crawler.Builder |
propRetainProxy(double propRetainProxy)
Sets the proportion of max tries where a specified proxy, if specified will be used.
|
Crawler.Builder |
router(@NotNull HandlerRouter router)
Sets HandlerRouter to be used.
|
Crawler.Builder |
scheduler(@NotNull AbstractQueueScheduler scheduler)
Sets the Scheduler to be used, if not set, default will be chosen.
|
Crawler.Builder |
session(@NotNull Session session)
Sets the Session to be used, if not set, defaults to none.
|
Crawler.Builder |
sleepScheduler(@NotNull SleepScheduler sleepScheduler)
Sets the SleepScheduler to be used, if not set, default will be chosen.
|
Crawler.Builder |
workerManager(@NotNull WorkerManager workerManager)
Sets the WorkerManager to be used, if not set, default will be chosen.
|
public Crawler.Builder name(@NotNull @NotNull String name)
name - name for crawler threadpublic Crawler.Builder fetcher(@NotNull @NotNull Fetcher fetcher)
fetcher - fetcher to be used.public Crawler.Builder parallism(int parallelism)
parallelism - the parallelism level.public Crawler.Builder workerManager(@NotNull @NotNull WorkerManager workerManager)
workerManager - result workerManager to be used.public Crawler.Builder scheduler(@NotNull @NotNull AbstractQueueScheduler scheduler)
scheduler - scheduler to be used.public Crawler.Builder router(@NotNull @NotNull HandlerRouter router)
router - handler router to be used.public Crawler.Builder maxConnections(int maxConnections)
maxConnections - maximum number of concurrent connections.public Crawler.Builder maxTries(int maxTries)
maxTries - max retry times.public Crawler.Builder propRetainProxy(double propRetainProxy)
This only comes into effect when a specific proxy is set for the request. This proxy set will be overridden beyond this threshold.
propRetainProxy - threshold percentage.public Crawler.Builder sleepScheduler(@NotNull @NotNull SleepScheduler sleepScheduler)
sleepScheduler - sleepAndGetTime scheduler to be used.public Crawler.Builder session(@NotNull @NotNull Session session)
session - Sessions where variables are definedpublic Crawler build()
Copyright © 2018. All rights reserved.