An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

repo1.maven.org "crawler" keyword

View the packages on the repo1.maven.org package registry that are tagged with the "crawler" keyword.

Top 1.5% on repo1.maven.org
us.codecraft:webmagic-core 1.0.3
A crawler framework. It covers the whole lifecycle of crawler: downloading, url management, conte...
33 versions - Latest release: 7 months ago - 39 dependent packages - 1,670 dependent repositories - 11,022 stars on GitHub
org.apache.stormcrawler:stormcrawler-archetype 3.4.0 💰
A collection of resources for building low-latency, scalable web crawlers on Apache Storm.
5 versions - Latest release: 2 months ago - 930 stars on GitHub
org.apache.stormcrawler:stormcrawler-opensearch-archetype 3.4.0 💰
A collection of resources for building low-latency, scalable web crawlers on Apache Storm.
5 versions - Latest release: 2 months ago - 930 stars on GitHub
org.apache.stormcrawler:stormcrawler-solr-archetype 3.4.0 💰
A collection of resources for building low-latency, scalable web crawlers on Apache Storm.
3 versions - Latest release: 2 months ago - 930 stars on GitHub
it.skrape:core 0.4.1 💰
skrape{it} is a Kotlin-based HTML testing and web scraping library that can be used seamlessly in...
9 versions - Latest release: over 6 years ago - 1 dependent repositories - 840 stars on GitHub
com.digitalpebble:storm-crawler-aws 0.7 💰
AWS resources for StormCrawler
2 versions - Latest release: almost 10 years ago - 1 dependent repositories - 923 stars on GitHub
Top 9.3% on repo1.maven.org
org.codelibs.fess:fess 15.2.0
Fess is Full tExt Search System.
162 versions - Latest release: about 22 hours ago - 23 dependent packages - 21 dependent repositories - 1,055 stars on GitHub
org.codelibs.fess:fess-crawler-parent 15.2.0
Fess Crawler is Crawler Framework.
137 versions - Latest release: 1 day ago - 29 stars on GitHub
org.codelibs.fess:fess-crawler-lasta 15.2.0
This is LastaFlute support.
138 versions - Latest release: 1 day ago - 4 dependent packages - 6 dependent repositories - 29 stars on GitHub
org.codelibs.fess:fess-crawler 15.2.0
Fess Crawler is a crawler framework.
138 versions - Latest release: 1 day ago - 6 dependent packages - 3 dependent repositories - 29 stars on GitHub
org.codelibs.fess:fess-crawler-opensearch 15.2.0
Fess Crawler is Crawler Framework.
3 versions - Latest release: 1 day ago - 29 stars on GitHub
lt.tokenmill.crawling:elasticsearch 0.2.0
Framework to simplify news crawling
1 version - Latest release: almost 8 years ago - 3 dependent packages - 2 dependent repositories - 21 stars on GitHub
de.hs-heilbronn.mi:crawler4j-examples-base 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 17 stars on GitHub
Top 1.7% on repo1.maven.org
us.codecraft:webmagic-extension 1.0.3
A crawler framework. It covers the whole lifecycle of crawler: downloading, url management, conte...
33 versions - Latest release: 7 months ago - 27 dependent packages - 1,671 dependent repositories - 11,021 stars on GitHub
Top 9.2% on repo1.maven.org
com.digitalpebble.stormcrawler:storm-crawler-core 1.12.1 💰
Storm-Crawler core Java API.
37 versions - Latest release: almost 7 years ago - 12 dependent packages - 14 dependent repositories - 925 stars on GitHub
it.skrape:skrape-it 1.0.0-alpha7 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 840 stars on GitHub
it.skrape:skrapeit-core 0.6.0 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
13 versions - Latest release: over 6 years ago - 2 dependent packages - 6 dependent repositories - 840 stars on GitHub
it.skrape:skrapeit-async-fetcher 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 1 dependent package - 2 dependent repositories - 840 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-parent 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-elasticsearch-client-v6 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
4 versions - Latest release: over 3 years ago - 2 dependent packages - 5 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-elasticsearch-client-v7 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
3 versions - Latest release: over 3 years ago - 2 dependent packages - 5 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-es7 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
3 versions - Latest release: over 3 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-crawler 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 1 dependent package - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler 2.0.0
FS Crawler offers a simple way to index binary files into elasticsearch.
6 versions - Latest release: almost 10 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-it 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
4 versions - Latest release: over 3 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-crawler-ssh 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 2 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
Top 9.5% on repo1.maven.org
fr.pilato.elasticsearch.crawler:fscrawler-test-framework 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 11 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-elasticsearch-client 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-3rdparty 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
3 versions - Latest release: over 3 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-es6 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
4 versions - Latest release: over 3 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-settings 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 7 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-workplacesearch-client 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
3 versions - Latest release: over 3 years ago - 3 dependent packages - 3 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-es5 2.6
FS Crawler offers a simple way to index binary files into elasticsearch.
1 version - Latest release: over 6 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-cli 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-crawler-ftp 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
3 versions - Latest release: over 3 years ago - 2 dependent packages - 3 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-elasticsearch-client-v5 2.6
FS Crawler offers a simple way to index binary files into elasticsearch.
1 version - Latest release: over 6 years ago - 2 dependent packages - 3 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-beans 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 4 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-it-common 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
4 versions - Latest release: over 3 years ago - 1 dependent package - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-core 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 4 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-docs 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
3 versions - Latest release: over 3 years ago - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-test-documents 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 6 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
Top 9.8% on repo1.maven.org
fr.pilato.elasticsearch.crawler:fscrawler-framework 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 9 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-elasticsearch-client-base 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
4 versions - Latest release: over 3 years ago - 6 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-tika 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-crawler-fs 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 2 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-rest 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-crawler-abstract 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 3 years ago - 4 dependent packages - 6 dependent repositories - 1,407 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-distribution 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
4 versions - Latest release: over 3 years ago - 1,404 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-tika 1.12.1 💰
Tika-based parser bolt for StormCrawler
37 versions - Latest release: almost 7 years ago - 4 dependent repositories - 923 stars on GitHub
com.github.greengerong:prerender-java 1.6.4
Sonatype helps open source projects to set up Maven repositories on https://oss.sonatype.org/
11 versions - Latest release: almost 10 years ago - 11 dependent repositories - 122 stars on GitHub
fr.pilato.elasticsearch.river:fsriver 1.3.1
FS River Plugin offers a simple way to index local files into elasticsearch.
11 versions - Latest release: about 11 years ago - 3 dependent repositories - 1,403 stars on GitHub
com.github.qlone:retrofit-crawler 1.1.3
help you get a html like json,thanks retrofit and Jsoup
4 versions - Latest release: over 4 years ago - 4 stars on GitHub
lt.tokenmill.crawling:data-model 0.2.0
Framework to simplify news crawling
1 version - Latest release: almost 8 years ago - 5 dependent packages - 2 dependent repositories - 21 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-urlfrontier 2.11 💰
URL Frontier resources for StormCrawler
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 925 stars on GitHub
org.codelibs.fess:fess-crawler-db 1.0.12
Fess Crawler is Crawler Framework.
13 versions - Latest release: almost 9 years ago - 29 stars on GitHub
it.skrape:skrapeit-http-fetcher 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 1 dependent package - 3 dependent repositories - 840 stars on GitHub
org.codelibs.fess:fess-crawler-db-mysql 1.0.12
Fess Crawler is Crawler Framework.
13 versions - Latest release: almost 9 years ago - 25 stars on GitHub
de.hs-heilbronn.mi:crawler4j-core 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 3 dependent packages - 1 dependent repositories - 17 stars on GitHub
com.crawljax:vips_selenium 5.2.3 💰
Crawling web applications through even-driven, dynamic analysis, and reconstruction of the UI sta...
9 versions - Latest release: over 2 years ago - 1 dependent package - 526 stars on GitHub
com.crawljax:crawljax-test-utils 3.5.1 💰
This artifact offers Crawljax plugin developers a convenient way to test their plugins by offerin...
9 versions - Latest release: over 11 years ago - 2 dependent packages - 13 dependent repositories - 526 stars on GitHub
com.crawljax:crawljax-examples 5.2.3 💰
Crawljax usage example
18 versions - Latest release: over 2 years ago - 1 dependent repositories - 526 stars on GitHub
com.crawljax:crawljax-cli 5.2.3 💰
The Crawljax command line interface
18 versions - Latest release: over 2 years ago - 2 dependent repositories - 526 stars on GitHub
com.crawljax:crawljax 2.2 💰
Crawling Ajax applications through dynamic analysis and reconstruction of the UI state changes. C...
4 versions - Latest release: over 12 years ago - 3 dependent packages - 14 dependent repositories - 526 stars on GitHub
com.crawljax:crawljax-core 5.2.3 💰
Crawling web applications through even-driven, dynamic analysis, and reconstruction of the UI sta...
18 versions - Latest release: over 2 years ago - 7 dependent packages - 29 dependent repositories - 526 stars on GitHub
com.crawljax.plugins.archetypes:crawljax-plugins-archetype 3.5.1 💰
Generates a Crawljax project template.
11 versions - Latest release: over 11 years ago - 526 stars on GitHub
com.crawljax:crawljax-parent-pom 5.2.3 💰
Crawling web applications through even-driven, dynamic analysis, and reconstruction of the UI sta...
19 versions - Latest release: over 2 years ago - 526 stars on GitHub
com.crawljax:crawljax-web 3.5.1 💰
Crawling Ajax applications through dynamic analysis and reconstruction of the UI state changes. C...
3 versions - Latest release: over 11 years ago - 526 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-warc 1.12.1 💰
WARC resources for StormCrawler
31 versions - Latest release: almost 7 years ago - 1 dependent repositories - 923 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-external 1.12.1 💰
A collection of resources for building low-latency, scalable web crawlers on Apache Storm.
29 versions - Latest release: almost 7 years ago - 922 stars on GitHub
com.digitalpebble:storm-crawler-solr 0.7 💰
Solr resources for StormCrawler
2 versions - Latest release: almost 10 years ago - 923 stars on GitHub
it.skrape:skrapeit-mock-mvc-extension 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 840 stars on GitHub
it.skrape:skrapeit-dsl 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 4 dependent packages - 840 stars on GitHub
it.unimi.di.law:bubing 0.9.15
BUbiNG is an open-source Java fully distributed crawler
8 versions - Latest release: over 6 years ago - 1 dependent package - 2 dependent repositories - 74 stars on GitHub
lt.tokenmill.crawling:analysis-ui 0.2.0
Framework to simplify news crawling
1 version - Latest release: almost 8 years ago - 21 stars on GitHub
it.skrape:skrapeit-base-fetcher 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 6 dependent packages - 1 dependent repositories - 840 stars on GitHub
it.skrape:skrapeit-ktor-extension 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 840 stars on GitHub
org.codelibs.fess:fess-crawler-es 14.19.1
Fess Crawler is Crawler Framework.
135 versions - Latest release: 4 months ago - 1 dependent package - 5 dependent repositories - 29 stars on GitHub
lt.tokenmill.crawling:crawling-framework 0.2.0
Framework to simplify news crawling
1 version - Latest release: almost 8 years ago - 21 stars on GitHub
it.skrape:skrapeit-assertions 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 2 dependent packages - 840 stars on GitHub
it.skrape:skrapeit-html-parser 1.2.2 💰
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data...
15 versions - Latest release: about 3 years ago - 4 dependent packages - 2 dependent repositories - 840 stars on GitHub
com.github.luohaha:jlitespider 0.4.3
jlitespider is a lite distributed Java spider framework
3 versions - Latest release: over 8 years ago - 145 stars on GitHub
com.norconex.collectors:norconex-collector-filesystem 2.9.1
Norconex Filesystem Collector walks through directories and files and extracts their content for ...
15 versions - Latest release: almost 4 years ago - 2 dependent repositories - 22 stars on GitHub
com.exasol:error-code-crawler-maven-plugin 2.0.5
Crawler for Exasol error codes.
28 versions - Latest release: 8 days ago - 1 stars on GitHub
com.digitalpebble:storm-crawler-core 0.7 💰
Storm-Crawler core Java API.
4 versions - Latest release: almost 10 years ago - 6 dependent packages - 4 dependent repositories - 923 stars on GitHub
de.hs-heilbronn.mi:crawler4j-examples-postgres 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 17 stars on GitHub
org.codelibs.fess:fess-crawler-webdriver 3.16.0
This is a library for Web Driver.
109 versions - Latest release: over 3 years ago - 1 dependent repositories - 29 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-aws 1.12.1 💰
AWS resources for StormCrawler
37 versions - Latest release: almost 7 years ago - 923 stars on GitHub
de.hs-heilbronn.mi:crawler4j-frontier-urlfrontier 5.1.3
Open Source Web Crawler for Java
15 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 17 stars on GitHub
com.github.peterbencze:serritor 2.1.1
An open source web crawler framework built upon Selenium and written in Java
12 versions - Latest release: about 5 years ago - 1 dependent repositories - 32 stars on GitHub
de.hs-heilbronn.mi:crawler4j-parent 5.1.3
Open Source Web Crawler for Java
25 versions - Latest release: about 1 month ago - 17 stars on GitHub
lt.tokenmill.crawling:crawler 0.2.0
Framework to simplify news crawling
1 version - Latest release: almost 8 years ago - 1 dependent repositories - 21 stars on GitHub
com.norconex.collectors:norconex-committer-neo4j 2.0.0
Neo4j implementation of Norconex Committer.
5 versions - Latest release: over 3 years ago - 2 stars on GitHub
org.webjars.bower:ngMeta 1.0.0
WebJar for ngMeta
1 version - Latest release: almost 9 years ago - 153 stars on GitHub
de.hs-heilbronn.mi:crawler4j-boms 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 17 stars on GitHub
com.github.btheu.estivate:estivate 0.4.3
Estivate fills pojo from HTML with CSS Query Syntax and annotations
15 versions - Latest release: almost 2 years ago - 2 dependent packages - 1 dependent repositories - 2 stars on GitHub
de.hs-heilbronn.mi:crawler4j-with-urlfrontier 5.1.3
Open Source Web Crawler for Java
15 versions - Latest release: about 1 month ago - 1 dependent package - 2 dependent repositories - 17 stars on GitHub
de.hs-heilbronn.mi:crawler4j-commons 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 4 dependent packages - 1 dependent repositories - 17 stars on GitHub
de.hs-heilbronn.mi:crawler4j-frontier-hsqldb 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 17 stars on GitHub
org.webjars.npm:ng-meta 0.3.9
WebJar for ng-meta
1 version - Latest release: about 9 years ago - 153 stars on GitHub
de.hs-heilbronn.mi:crawler4j-examples 5.1.3
Open Source Web Crawler for Java
26 versions - Latest release: about 1 month ago - 17 stars on GitHub