Catmandu::HTML - Modules for handling HTML data within the Catmandu framework
Contents
Available Modules
Catmandu::Exporter::HTML
Serialize HTML data
Catmandu::Importer::HTML
Parse HTML data
Catmandu::Fix::html_filter_tag
Filter HTML by name and optional group them
Catmandu::Fix::html_filter_type
Filter HTML by type
Catmandu::Fix::html_text
Select only the literal text values from the HTML
Copyright And License
The Perl software is copyright (c) 2018 by Patrick Hochstenbach. This is free software; you can
redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.
perl v5.36.0 2023-02-14 Catmandu::HTML(3pm)
Name
Catmandu::HTML - Modules for handling HTML data within the Catmandu framework
See Also
This module is based on Catmandu
Synopsis
Command line client "catmandu":
catmandu convert HTML to JSON < ex/test.html > data/test.json
catmandu convert JSON to HTML < data/test.json > ex/test.html
# Keep the text
curl -s http://librecat.org |
catmandu convert HTML to Text --fix "html_text()"
# Filter out the anchors
curl -s http://librecat.org |
catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")"
# Filter out the meta tags as YAML
curl -s http://librecat.org |
catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"
See documentation of modules for more examples.
