Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

rubygems.org "scraper" keyword

ronin-web-spider 0.1.0 💰
ronin-web-spider is a collection of common web spidering routines using the spidr gem.
3 versions - Latest release: over 1 year ago - 1 dependent package - 146 dependent repositories - 5.04 thousand downloads total - 7 stars on GitHub - 1 maintainer
Top 2.8% on rubygems.org
huginn_agent 0.6.1 💰
Helpers for making new Huginn Agents
8 versions - Latest release: over 6 years ago - 77 dependent packages - 107 dependent repositories - 67 thousand downloads total - 41,699 stars on GitHub - 2 maintainers
radio5 0.2.3
Adapter for Radiooooo private API.
7 versions - Latest release: 3 months ago - 1.34 thousand downloads total - 0 stars on GitHub - 1 maintainer
network_profile 0.3.2
Extract profile metadata from various social-media-profiles, such as Twitter, XING, Github, Stack...
5 versions - Latest release: 6 months ago - 1 dependent repositories - 7.95 thousand downloads total - 0 stars on GitHub - 1 maintainer
mechanizer 2.1
Mechanize & NokoGiri Wrapper for Automated WebScraping and WebPage Parsing. Light, easy to use wr...
5 versions - Latest release: almost 6 years ago - 8 dependent packages - 9 dependent repositories - 9.72 thousand downloads total - 1 stars on GitHub - 1 maintainer
marvel_101 0.4.0
A CLI that scrapes marvel.com for info on popular Marvel characters and teams.
10 versions - Latest release: over 6 years ago - 20.6 thousand downloads total - 1 stars on GitHub - 1 maintainer
regis-lector 0.9.0
Super simple Moodle site scraper for Frank Matranga's high school
1 version - Latest release: almost 7 years ago - 2.26 thousand downloads total - 1 stars on GitHub - 1 maintainer
scraper_rb 0.1.2
Ruby wrapper for Prompt API's Scraper Checker API
4 versions - Latest release: over 3 years ago - 2 dependent repositories - 5.78 thousand downloads total - 0 stars on GitHub - 1 maintainer
Top 2.9% on rubygems.org
wombat 3.0.0 💰
Generic Web crawler with a DSL that parses structured data from web pages
34 versions - Latest release: over 1 year ago - 5 dependent packages - 55 dependent repositories - 205 thousand downloads total - 1,303 stars on GitHub - 1 maintainer
event-crawler 0.1.0 💰
Generic Web crawler with a DSL that parses event-related data from web pages
1 version - Latest release: over 12 years ago - 4.5 thousand downloads total - 1,303 stars on GitHub - 1 maintainer
s3x 0.1.0
Scrape public AWS S3 buckets with ease.
1 version - Latest release: 20 days ago - 140 downloads total - 0 stars on GitHub - 1 maintainer
spidr_epg 1.0.0 💰
Spidr is a versatile Ruby web spidering library that can spider a site, multiple domains, certain...
1 version - Latest release: about 11 years ago - 3.63 thousand downloads total - 792 stars on GitHub - 1 maintainer
Top 2.8% on rubygems.org
spidr 0.7.1 💰
Spidr is a versatile Ruby web spidering library that can spider a site, multiple domains, certain...
28 versions - Latest release: 4 months ago - 15 dependent packages - 89 dependent repositories - 290 thousand downloads total - 792 stars on GitHub - 1 maintainer
simple-scraper 1.0.0
Library was built on top of nokogiri, parallel and httparty gems that do most of the work
1 version - Latest release: about 5 years ago - 1 dependent repositories - 2.6 thousand downloads total - 10 stars on GitHub - 1 maintainer
Top 3.6% on rubygems.org
kimurai 1.4.0
Modern web scraping framework written in Ruby and based on Capybara/Nokogiri
8 versions - Latest release: over 5 years ago - 3 dependent packages - 80 dependent repositories - 155 thousand downloads total - 999 stars on GitHub - 1 maintainer
pets_seeking_people 0.3.0
An awesome scraper gem that helps users locate cats and dogs available for adoption in their area.
5 versions - Latest release: over 6 years ago - 1 dependent repositories - 8 thousand downloads total - 1 stars on GitHub - 1 maintainer
chanCrawlerGem 0.2.1
This gem scowers 4chan (or any chan copy theoretically) searching for threads that contains key...
3 versions - Latest release: about 3 years ago - 5.56 thousand downloads total - 4 stars on GitHub - 1 maintainer
instagram-crawler 0.3.0
Crawl instagram photos, posts and videos for download.
4 versions - Latest release: about 5 years ago - 2 dependent repositories - 7.33 thousand downloads total - 196 stars on GitHub - 1 maintainer
scraypa 0.1.1
Web scraper with support for proxy, Tor and javascript.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 2.25 thousand downloads total - 7 stars on GitHub - 1 maintainer
pagemunch 1.0.0
A client for the PageMunch web crawler API
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 5.37 thousand downloads total - 3 stars on GitHub - 1 maintainer
serp_scraper 1.0.4
SERP Scraper is a ruby library that extracts keyword rankings from Google.
6 versions - Latest release: almost 7 years ago - 10.4 thousand downloads total - 2 stars on GitHub - 1 maintainer
sjc_bus_schedule 0.0.2
Makes life easy for who wants to get the bus schedule from SJC website
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 3.99 thousand downloads total - 0 stars on GitHub - 1 maintainer
nhkore 0.3.14
Scrapes NHK News Web (Easy) for the word frequency (core list) for Japanese language learners. In...
17 versions - Latest release: almost 2 years ago - 1 dependent repositories - 22.9 thousand downloads total - 13 stars on GitHub - 1 maintainer
webinspector 0.5.0 💰
Ruby gem to inspect completely a web page. It scrapes a given URL, and returns you its meta, link...
10 versions - Latest release: almost 9 years ago - 2 dependent repositories - 24.1 thousand downloads total - 290 stars on GitHub - 1 maintainer
instagrammer 0.3.2
Instagrammer lets you fetch Instagram user info and posts
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 16.9 thousand downloads total - 5 stars on GitHub - 1 maintainer
movie_helper 0.2.0
This application scrapes data off of the website, https://agoodmovietowatch.com/, and uses it to ...
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 3.84 thousand downloads total - 0 stars on GitHub - 1 maintainer
duckduckgo 0.1.8
An unofficial DuckDuckGo search API.
9 versions - Latest release: 3 months ago - 2 dependent packages - 17.4 thousand downloads total - 7 stars on GitHub - 1 maintainer
scrapula 0.6.3
Scrapula is a library for scraping web pages that simplifies some of the common actions t...
1 version - Latest release: over 8 years ago - 1 dependent repositories - 2.93 thousand downloads total - 1 stars on GitHub - 1 maintainer
marmiton_crawler 1.0.3
A web scrawler to get a Marmiton's recipe
2 versions - Latest release: over 7 years ago - 4.64 thousand downloads total - 6 stars on GitHub - 1 maintainer
recipe_scraper 2.2.4
A web scraper to get recipe data just by its web url
5 versions - Latest release: over 5 years ago - 1 dependent package - 4 dependent repositories - 8.71 thousand downloads total - 6 stars on GitHub - 1 maintainer
html2rss 0.9.0 💰
Give the URL to scrape and some CSS selectors. Get a RSS::Rss instance in return.
20 versions - Latest release: almost 4 years ago - 4 dependent repositories - 34.9 thousand downloads total - 110 stars on GitHub - 1 maintainer
cricos_scrape 2.2
Scrape Institutions, Courses, Contacts from CRICOS
3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 6.8 thousand downloads total - 5 stars on GitHub - 1 maintainer
history_scraper 1.0.6
Scraps events, births and deaths that occured during a specific day of history.
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 2.32 thousand downloads total - 1 stars on GitHub - 1 maintainer
tanakai 1.7.3
Maintained fork of Kimurai, a modern web scraping framework written in Ruby and based on Capybara...
7 versions - Latest release: 5 months ago - 1 dependent repositories - 12.6 thousand downloads total - 260 stars on GitHub - 1 maintainer
rayyan-scrapers 0.2.0 removed
Rayyan scrapers that fetch external references like PubMed
7 versions - Latest release: almost 2 years ago - 1 dependent package - 3 dependent repositories - 14.4 thousand downloads total - 2 stars on GitHub - 1 maintainer