Class CertificateCrawler
- java.lang.Object
-
- edu.uci.ics.crawler4j.crawler.WebCrawler
-
- de.trustable.ca3s.core.service.dir.CertificateCrawler
-
- All Implemented Interfaces:
Runnable
public class CertificateCrawler extends edu.uci.ics.crawler4j.crawler.WebCrawlerThis class crawls for certificates on the web and inserts it into the database.
-
-
Constructor Summary
Constructors Constructor Description CertificateCrawler(List<String> crawlDomains, String regEx, CertificateUtil certUtil, AuditService auditService, ImportInfo importInfo)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanshouldVisit(edu.uci.ics.crawler4j.crawler.Page referringPage, edu.uci.ics.crawler4j.url.WebURL url)decide whether there may be interesting stuff or notvoidvisit(edu.uci.ics.crawler4j.crawler.Page page)retrieve certificates-
Methods inherited from class edu.uci.ics.crawler4j.crawler.WebCrawler
getMyController, getMyId, getMyLocalData, getThread, handlePageStatusCode, handleUrlBeforeProcess, init, isNotWaitingForNewURLs, onBeforeExit, onContentFetchError, onContentFetchError, onPageBiggerThanMaxSize, onParseError, onRedirectedStatusCode, onStart, onUnexpectedStatusCode, onUnhandledException, run, setThread, shouldFollowLinksIn
-
-
-
-
Constructor Detail
-
CertificateCrawler
public CertificateCrawler(List<String> crawlDomains, String regEx, CertificateUtil certUtil, AuditService auditService, ImportInfo importInfo)
-
-
Method Detail
-
shouldVisit
public boolean shouldVisit(edu.uci.ics.crawler4j.crawler.Page referringPage, edu.uci.ics.crawler4j.url.WebURL url)decide whether there may be interesting stuff or not- Overrides:
shouldVisitin classedu.uci.ics.crawler4j.crawler.WebCrawler
-
visit
public void visit(edu.uci.ics.crawler4j.crawler.Page page)
retrieve certificates- Overrides:
visitin classedu.uci.ics.crawler4j.crawler.WebCrawler
-
-