nuget.org "extraction" keyword
View the packages on the nuget.org package registry that are tagged with the "extraction" keyword.
refinerynet 1.0.1
Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of ...1 version - Latest release: about 2 years ago - 243 downloads total - 1 stars on GitHub - 1 maintainer
dotnet-extract 1.1.0
A .NET global tool to extract embedded resource files from a .NET assembly.11 versions - Latest release: almost 2 years ago - 6.38 thousand downloads total - 0 stars on GitHub - 1 maintainer
palworlddataextractor 0.1.3
Extract data from Palworld .pak file1 version - Latest release: over 125 years ago - 6 stars on GitHub
Top 9.7% on nuget.org
6 versions - Latest release: about 5 years ago - 1 dependent package - 3 dependent repositories - 12.6 thousand downloads total - 12 stars on GitHub - 1 maintainer
clipboard.openxml 1.2.1
A c# library that provides the ability to extract text from various document file formats, e.g. p...6 versions - Latest release: about 5 years ago - 1 dependent package - 3 dependent repositories - 12.6 thousand downloads total - 12 stars on GitHub - 1 maintainer
documentatom 1.0.21
Core classes for DocumentAtom. Download the DocumentAtom package specific to the content types y...17 versions - Latest release: 5 months ago - 2.94 thousand downloads total - 38 stars on GitHub - 1 maintainer
caseload.textextractor.pdf 1.5.0
A c# library that provides the ability to extract text from various document file formats, e.g. p...4 versions - Latest release: 5 months ago - 692 downloads total - 12 stars on GitHub - 1 maintainer
aspose.words 25.7.0
Aspose.Words for .NET is a powerful, high-performance document processing library for creating, e...166 versions - Latest release: 21 days ago - 51 dependent packages - 31.4 million downloads total - 2 maintainers
Top 3.2% on nuget.org
13 versions - Latest release: over 6 years ago - 4 dependent packages - 9 dependent repositories - 123 thousand downloads total - 224 stars on GitHub - 1 maintainer
openscraping 1.4.2
Turn unstructured HTML pages into structured data. The OpenScraping library can extract informati...13 versions - Latest release: over 6 years ago - 4 dependent packages - 9 dependent repositories - 123 thousand downloads total - 224 stars on GitHub - 1 maintainer
dokuextractorcore.model 1.1.0
DokuExtractorCore.Model is the data model for DokuExtractorCore1 version - Latest release: about 6 years ago - 1 dependent package - 1 dependent repositories - 1.22 thousand downloads total - 10 stars on GitHub - 1 maintainer
clipboard.pdf 1.2.1
A c# library that provides the ability to extract text from various document file formats, e.g. p...6 versions - Latest release: about 5 years ago - 1 dependent package - 1 dependent repositories - 12.7 thousand downloads total - 12 stars on GitHub - 1 maintainer
documentatom.powerpoint 1.0.26
DocumentAtom provides a light, fast library for breaking input PowerPoint (pptx) documents into c...11 versions - Latest release: 5 months ago - 1.67 thousand downloads total - 38 stars on GitHub - 1 maintainer
anglesharp.contentextraction 2.0.0
Content extraction via text density5 versions - Latest release: 6 months ago - 2.17 thousand downloads total - 0 stars on GitHub - 1 maintainer
stringssharp 1.0.3
Extract strings from files4 versions - Latest release: almost 7 years ago - 1 dependent repositories - 4.51 thousand downloads total - 2 stars on GitHub - 1 maintainer
documentatom.word 1.0.26
DocumentAtom provides a light, fast library for breaking input Word (docx) documents into constit...11 versions - Latest release: 5 months ago - 1.71 thousand downloads total - 38 stars on GitHub - 1 maintainer
soenneker.compression.sevenzip 3.0.479 💰
A utility library for 7zip compression related operations474 versions - Latest release: 15 days ago - 64.3 thousand downloads total - 2 stars on GitHub - 1 maintainer
faceonnx.addons 4.0.4.2
Face recognition and analytics library based on deep neural networks and ONNX runtime.31 versions - Latest release: 7 months ago - 38.6 thousand downloads total - 239 stars on GitHub - 2 maintainers
faceonnx.gpu 4.0.0.2
Face recognition and analytics library based on deep neural networks and ONNX runtime. Gpu implem...29 versions - Latest release: about 1 year ago - 1 dependent package - 45.8 thousand downloads total - 239 stars on GitHub - 2 maintainers
faceonnx.addons.gpu 4.0.0.2
Face recognition and analytics library based on deep neural networks and ONNX runtime. Gpu implem...27 versions - Latest release: about 1 year ago - 27.4 thousand downloads total - 239 stars on GitHub - 2 maintainers
faceonnx 4.0.4.2
Face recognition and analytics library based on deep neural networks and ONNX runtime.43 versions - Latest release: 7 months ago - 3 dependent packages - 143 thousand downloads total - 239 stars on GitHub - 2 maintainers
boilerpipe.net 1.2.0
The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplat...1 version - Latest release: over 125 years ago - 7 stars on GitHub
documentparser 1.0.2 💰
Simple C# library for extracting text and metadata from .docx, .pptx, and .xlsx files3 versions - Latest release: over 125 years ago - 1.26 thousand downloads total - 5 stars on GitHub - 1 maintainer
documentatom.text 1.0.26
DocumentAtom provides a light, fast library for breaking input text documents into constituent pa...9 versions - Latest release: 5 months ago - 1.27 thousand downloads total - 38 stars on GitHub - 1 maintainer
documentatom.excel 1.0.28
DocumentAtom provides a light, fast library for breaking input Excel (xlsx) documents into consti...14 versions - Latest release: 5 months ago - 2.2 thousand downloads total - 38 stars on GitHub - 1 maintainer
documentatom.markdown 1.0.26
DocumentAtom provides a light, fast library for breaking input markdown documents into constituen...9 versions - Latest release: 5 months ago - 1.3 thousand downloads total - 38 stars on GitHub - 1 maintainer
oldschool.i18n.lib 1.4.0
Finds localizable messages in *.fs and *.cs files by looking for calls such as I18n.Translate("me...2 versions - Latest release: over 6 years ago - 1.53 thousand downloads total - 4 stars on GitHub - 1 maintainer
dotnet-oldschool-i18n 1.4.0
Finds localizable messages in *.fs and *.cs files by looking for calls such as I18n.Translate("me...2 versions - Latest release: over 6 years ago - 1.6 thousand downloads total - 4 stars on GitHub - 1 maintainer
oldschool.i18n 1.3.0
Finds localizable messages in *.fs and *.cs files by looking for calls such as I18n.Translate("me...4 versions - Latest release: about 7 years ago - 5.8 thousand downloads total - 4 stars on GitHub - 1 maintainer
artesian.sdk 7.6.0
Artesian SDK library104 versions - Latest release: about 1 month ago - 69.1 thousand downloads total - 0 stars on GitHub - 1 maintainer
Top 2.8% on nuget.org
49 versions - Latest release: almost 8 years ago - 3 dependent packages - 30 dependent repositories - 374 thousand downloads total - 1 maintainer
accord.audio 3.8.0
Process, transforms, filters and handle audio signals for machine learning and statistical applic...49 versions - Latest release: almost 8 years ago - 3 dependent packages - 30 dependent repositories - 374 thousand downloads total - 1 maintainer
accusoft.formfix.net 6.0.581
The FormFixâ„¢ component contains all the necessary objects, methods, and properties it takes for y...1 version - Latest release: over 3 years ago - 1 dependent package - 1.33 thousand downloads total - 1 maintainer
Top 6.9% on nuget.org
2 versions - Latest release: over 125 years ago - 1 dependent package - 6 dependent repositories
lessmsi 1.0.8
LessMSI is a utility with a graphical user interface and a command line interface that can be use...2 versions - Latest release: over 125 years ago - 1 dependent package - 6 dependent repositories
pdfix.sdk 8.7.2
~ PDF Tagging and Accessibility ~ Logical Content Extraction ~ PDF to HTML conversion ~ PDF Edit...70 versions - Latest release: 9 days ago - 1 dependent package - 40.4 thousand downloads total - 1 maintainer
accusoft.formsapi.net 6.0.465
Forms API is a standard forms processing solution built using the various FormSuiteâ„¢ components i...1 version - Latest release: over 3 years ago - 1 dependent package - 890 downloads total - 1 maintainer
accusoft.formdirector.net 6.0.467
The FormDirectorâ„¢ component contains all the necessary objects, methods, and properties it takes ...1 version - Latest release: over 3 years ago - 1 dependent package - 1.09 thousand downloads total - 1 maintainer
text2data.com.sentimentanalysis.api 3.6.1
Text Analytics and Sentiment Analysis API - allows to perform following: Sentiment analysis, Docu...7 versions - Latest release: almost 6 years ago - 4.98 thousand downloads total - 1 maintainer
twileloop.spider 2.0.0
A simplified wrapper over Selenium web driver to easily do web scrapping5 versions - Latest release: over 1 year ago - 776 downloads total - 1 stars on GitHub - 1 maintainer
idylsdk 1.0.7
NET SDK for Mountain Fog's Idyl service. Idyl is a natural language processing webservice that pr...7 versions - Latest release: over 125 years ago
cliel.dotnet.library.package 1.0.0
The extension methods to extraction from a string value.1 version - Latest release: about 3 years ago
oakrey.excel 1.0.0
Package contains: Framework for reading and interacting with Excel spreadsheets, including ExcelC...1 version - Latest release: 3 months ago - 192 downloads total - 1 maintainer
iconutilities 1.0.2
A collection of methods for injecting or extracting icons from files on Windows operating systems.3 versions - Latest release: about 4 years ago - 2 dependent repositories - 7.39 thousand downloads total - 5 stars on GitHub - 1 maintainer
mountainfog.idylami.sdk 1.0.0
The Idyl AMI SDK facilitates integration with the Idyl AMI entity extraction engines.1 version - Latest release: over 125 years ago
apiverve.api.keywordextractor 1.0.8
Keyword Extractor is a simple tool for extracting keywords from a text. It returns the keywords a...5 versions - Latest release: about 1 year ago - 640 downloads total - 0 stars on GitHub - 1 maintainer
accusoft.formsuite.net 6.0.512
FormSuiteâ„¢ eliminates slow, costly manual data entry and data extraction for standard printed for...1 version - Latest release: over 3 years ago - 839 downloads total - 1 maintainer
apiverve.api.metadataextractor 1.1.9
Metadata Extractor is a simple tool for extracting metadata from web pages. It returns the meta t...12 versions - Latest release: 4 months ago - 1.65 thousand downloads total - 0 stars on GitHub - 1 maintainer
prosol.html.tagsprovider 2.0.0
TagsProvider is a tool for extracting HTML tags from a string, in event-driven way. Helps t...14 versions - Latest release: over 1 year ago - 1 dependent package - 2.67 thousand downloads total - 1 maintainer
Top 9.5% on nuget.org
4 versions - Latest release: almost 8 years ago - 7 dependent repositories - 158 thousand downloads total - 1 maintainer
accord.audition 3.8.0
Process, transforms, filters and handle audio signals for machine learning and statistical applic...4 versions - Latest release: almost 8 years ago - 7 dependent repositories - 158 thousand downloads total - 1 maintainer
activepdf.toolkit.ultimate 10.3.0
Toolkit With More Powerful Functionalities For PDF Rasterization, PDF Redaction, PDF Data Extract...3 versions - Latest release: almost 5 years ago - 5.05 thousand downloads total - 1 maintainer
prosol.webscrap 2.0.2
A HTML parser, for extracting the text from a web pages, with CSS selectors.4 versions - Latest release: over 1 year ago - 2.27 thousand downloads total - 1 maintainer
webscrap.json 1.0.0-rc.3
A HTML parser, for extracting the text from a web pages, with CSS selectors.4 versions - Latest release: over 125 years ago
devovercome.webscrap 1.0.0-rc.5
A HTML parser, for extracting the text from a web pages, with CSS selectors.2 versions - Latest release: over 125 years ago
impower.documentunderstanding.activities 1.168.1
Package Description7 versions - Latest release: over 125 years ago
documenttextextractor 1.0.10 💰
Simple C# library for extracting text and metadata from .docx, .pptx, and .xlsx files8 versions - Latest release: 10 months ago - 2.7 thousand downloads total - 5 stars on GitHub - 1 maintainer
prosol.web.html.tagsprovider 1.0.1
A tool for extracting tags from HTML, via push-notifications.1 version - Latest release: over 125 years ago
documentatom.pdf 1.0.26
DocumentAtom provides a light, fast library for breaking input PDF documents into constituent par...14 versions - Latest release: 5 months ago - 2.02 thousand downloads total - 38 stars on GitHub - 1 maintainer
txtraktor 0.0.2
Information extraction library2 versions - Latest release: over 4 years ago - 1 dependent package - 1.29 thousand downloads total - 1 maintainer
squid-box.sevenzipsharp.lite 1.6.2.24
Wraps 7z.dll or any compatible one and makes use of LZMA SDK. Excludes creation of self-extractin...6 versions - Latest release: over 1 year ago - 1 dependent package - 47.8 thousand downloads total - 282 stars on GitHub - 1 maintainer
squid-box.sevenzipsharp 1.3.318
Wraps 7z.dll or any compatible one and makes use of LZMA SDK, includes self-extraction functional...30 versions - Latest release: over 4 years ago - 4 dependent packages - 798 thousand downloads total - 282 stars on GitHub - 1 maintainer
tabula.json 0.1.5 💰
Extract tables from PDF files (port of tabula-java using PdfPig). Json writer.9 versions - Latest release: 4 months ago - 25.5 thousand downloads total - 187 stars on GitHub - 1 maintainer
tabula.csv 0.1.5 💰
Extract tables from PDF files (port of tabula-java using PdfPig). Csv and Tsv writers.9 versions - Latest release: 4 months ago - 27.3 thousand downloads total - 187 stars on GitHub - 1 maintainer
tabula 0.1.5 💰
Extract tables from PDF files (port of tabula-java using PdfPig).9 versions - Latest release: 4 months ago - 2 dependent packages - 422 thousand downloads total - 186 stars on GitHub - 1 maintainer
ghsoftware.worddoctextextractor 1.0.1
Extract text from legacy Microsoft Word .doc files (Word 97, Word 6.0).2 versions - Latest release: 10 days ago - 0 downloads total - 0 stars on GitHub - 1 maintainer
palworlddataextractor.abstractions 0.5.0
Abstractions for the PalworldDataExtractor.Lib project7 versions - Latest release: over 1 year ago - 1 dependent package - 1.36 thousand downloads total - 6 stars on GitHub - 1 maintainer
documentatom.typedetection 1.0.29
DocumentAtom provides a light, fast library for breaking input documents into constituent parts (...12 versions - Latest release: 4 months ago - 1.63 thousand downloads total - 38 stars on GitHub - 1 maintainer
refinery 1.0.6
Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of ...6 versions - Latest release: almost 2 years ago - 1.23 thousand downloads total - 1 stars on GitHub - 2 maintainers
dokuextractorcore 1.3.1
Easily extract data from PDF documents4 versions - Latest release: about 4 years ago - 3.27 thousand downloads total - 10 stars on GitHub - 1 maintainer
parseasy 1.0.0
A modern and efficient alternative to Regex for text pattern matching and extraction1 version - Latest release: 5 months ago - 231 downloads total - 17 stars on GitHub - 1 maintainer
data.dump.sql 1.0.0
A Sql Server implmentation of the Data.Dump engine for easy creation of data extractions based on...4 versions - Latest release: over 5 years ago - 3.95 thousand downloads total - 6 stars on GitHub - 1 maintainer
documentatom.ocr 1.0.23
DocumentAtom provides a light, fast library for breaking input image documents into constituent p...7 versions - Latest release: 5 months ago - 1.44 thousand downloads total - 38 stars on GitHub - 1 maintainer
documentatom.image 1.0.25
DocumentAtom provides a light, fast library for breaking input images into constituent text parts...16 versions - Latest release: 5 months ago - 2.75 thousand downloads total - 38 stars on GitHub - 1 maintainer
netpalette 0.0.2 💰
a library to extract color palettes from images. based on skiasharp so it works on all modern ...2 versions - Latest release: about 1 year ago - 758 downloads total - 5 stars on GitHub - 1 maintainer
documentatom.texttools 1.0.22
DocumentAtom provides a light, fast library for breaking input documents into constituent parts (...1 version - Latest release: 5 months ago - 249 downloads total - 38 stars on GitHub - 1 maintainer
soenneker.compression.tar.xz 3.0.11 💰
A utility library dealing with Tar and XZ (tar.xz) extraction/archiving and (de)compression11 versions - Latest release: 19 days ago - 636 downloads total - 0 stars on GitHub - 1 maintainer
textextractor 1.0.0
Library for text extraction. Supports doc, docx, xlsx, odt, pdf, rtf, html, rar, zip,1 version - Latest release: almost 8 years ago - 1 dependent repositories - 8.76 thousand downloads total - 3 stars on GitHub - 1 maintainer
data.dump.postgres 1.0.0
A PostgreSql implementation of the Data.Dump engine for easy creation of data extractions based o...1 version - Latest release: over 5 years ago - 661 downloads total - 6 stars on GitHub - 1 maintainer
data.dump.engine 1.0.0
A C# data dump engine for easy creation of data extractions based on any dataset or poco, with a ...5 versions - Latest release: over 5 years ago - 2 dependent packages - 1 dependent repositories - 6.09 thousand downloads total - 6 stars on GitHub - 1 maintainer
camelot 0.0.1-alpha002
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).2 versions - Latest release: over 4 years ago - 1 dependent package - 31 stars on GitHub
camelot.imageprocessing.opencvsharp4 0.0.1-alpha002
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig). Con...2 versions - Latest release: over 4 years ago - 31 stars on GitHub
file-drill 1.1.3 💰
file-drill is a powerful command-line tool for reading, classifying, and extracting structured da...6 versions - Latest release: 25 days ago - 739 downloads total - 0 stars on GitHub - 1 maintainer
hattrick.model.mssql 4.0.0
Microsoft SQL Server model extraction utility.15 versions - Latest release: over 2 years ago - 11.7 thousand downloads total - 0 stars on GitHub - 1 maintainer
taucode.data.text 2.0.1
TauCode library for text data representation8 versions - Latest release: 9 months ago - 1 dependent package - 5.04 thousand downloads total - 0 stars on GitHub - 1 maintainer
webspark.slurper 3.3.0
A flexible data extraction and transformation library for XML, JSON, CSV, and HTML.6 versions - Latest release: 27 days ago - 493 downloads total - 0 stars on GitHub - 1 maintainer
Top 6.7% on nuget.org
6 versions - Latest release: 5 months ago - 6 dependent repositories - 97.2 thousand downloads total - 387 stars on GitHub - 1 maintainer
toxy 2.5.0 💰
.NET text extraction framework6 versions - Latest release: 5 months ago - 6 dependent repositories - 97.2 thousand downloads total - 387 stars on GitHub - 1 maintainer
caseload.textextractor.openxml 1.5.0
A c# library that provides the ability to extract text from various document file formats, e.g. p...4 versions - Latest release: 5 months ago - 655 downloads total - 12 stars on GitHub - 1 maintainer
caseload.textextractor.text 1.4.0
A c# library that provides the ability to extract text from various document file formats, e.g. p...3 versions - Latest release: 5 months ago - 502 downloads total - 12 stars on GitHub - 1 maintainer
pdftextextractor 1.0.1 💰
A simple C# shell wrapper for the wonderful pdfplumber library in Python to extract text from .PD...2 versions - Latest release: over 1 year ago - 2.68 thousand downloads total - 4 stars on GitHub - 1 maintainer
palworlddataextractor.lib 0.5.0
Extract data from Palworld .pak file16 versions - Latest release: over 1 year ago - 3.44 thousand downloads total - 6 stars on GitHub - 1 maintainer
Top 8.2% on nuget.org
1 version - Latest release: over 125 years ago - 1 dependent package - 9 stars on GitHub
clipboard.ifilter 1.0.0
A c# library that provides the ability to extract text from various document file formats, e.g. p...1 version - Latest release: over 125 years ago - 1 dependent package - 9 stars on GitHub
clipboard.text 1.2.1
A c# library that provides the ability to extract text from various document file formats, e.g. p...6 versions - Latest release: about 5 years ago - 1 dependent package - 1 dependent repositories - 12.4 thousand downloads total - 12 stars on GitHub - 1 maintainer
stringfighter.stringextractors 1.0.1
Simple but functional string functions that I think might be useful.2 versions - Latest release: almost 5 years ago - 1.08 thousand downloads total - 1 stars on GitHub - 1 maintainer
misturtee 1.0.0-alpha2
MVC6 middleware that lets you extract claims from the current request, using RegEx, JsonPath, Key...3 versions - Latest release: over 7 years ago - 0 stars on GitHub
rosette_api 1.33.0
.Net (C#) Binding for Babel Street Analytics API43 versions - Latest release: 4 months ago - 1 dependent repositories - 151 thousand downloads total - 1 maintainer
hattrick.model.sql 4.0.0
sql model extraction utility.2 versions - Latest release: over 125 years ago - 1 dependent package - 0 stars on GitHub
Related Keywords
text
30
pdf
29
extract
24
word
21
excel
21
powerpoint
20
parsing
18
processing
17
xlsx
16
parser
16
parse
16
data
15
docx
15
csharp
14
dotnet
14
semantic
13
doc
13
pptx
13
cell
12
ai
12
llm
12
ppt
11
rag
11
png
11
keynote
11
chunk
11
pages
11
numbers
11
txt
11
chunking
11
etl
11
extraction-transformation-and-loading
11
extractor
11
html
9
tool
8
classification
8
export
7
automation
7
documents
7
document
6
invoice
6
recognition
6
json
6
detection
6
sql
5
table-extraction
5
pdfs
5
pdfparser
5
pdf-table-extraction
5
pdf-table-extract
5
netstandard
5
extraction-engine
5
extracting-tables
5
extract-table
5
pdfpig
5
table
5
embeddings
5
forms
5
1040
4
ACORD
4
faceonnx
4
face-recognition
4
omr
4
face-onnx
4
face-detection
4
face-analytics-library
4
software
4
mark
4
capture
4
deep-neural-networks
4
cpu
4
antispoofing
4
neural-networks
4
onnx
4
estimation
4
beauty
4
landmarks
4
read
4
gender
4
age
4
reader
4
form
4
face
4
ocr
4
system
4
auto
4
automated
4
automatic
4
sdk
4
structured
4
FormSuite
4
optical
4
field
4
Accusoft
4
template
4
align
4
translation
4
identify
4
insurance
4
certificate
4