Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 8.7% on nuget.org
Top 8.2% downloads on nuget.org
Top 7.0% dependent repos on nuget.org

nuget.org : html2xhtml

Html2Xhtml is a .NET 4.0 library for converting HTML to XHTML licensed under GPLv2 or above. I tested Html2Xhtml in the local reconstruction of a large online database of the European Union. Tidy/Tidy.NET would not even produce valid output most of the time, Chilkat's HTML-to-XML was a bit slow and produced strange results (misplaced, missing, unexplainable elements). In attempt to find a free, fast and reliable conversion tool I created this library. It converts 2 - 4x faster than all other libraries I tested. Html2Xhtml, combined with the power of LINQ to XML, is an excellent tool for all large-scale data extraction and web crawling scenarios.

Registry - Homepage - JSON
purl: pkg:nuget/html2xhtml
Keywords: Tidy, xhtml
License:
Latest release: almost 13 years ago
First release: almost 13 years ago
Dependent packages: 1
Dependent repositories: 1
Downloads: 36,711 total
Last synced: 28 days ago

    Loading...
    Readme
    Loading...