logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

Catmandu::HTML - Modules for handling HTML data within the Catmandu framework

Authors

       Patrick Hochstenbach, "patrick.hochstenbach at ugent.be"

Available Modules

       Catmandu::Exporter::HTML
           Serialize HTML data

       Catmandu::Importer::HTML
           Parse HTML data

       Catmandu::Fix::html_filter_tag
           Filter HTML by name and optional group them

       Catmandu::Fix::html_filter_type
           Filter HTML by type

       Catmandu::Fix::html_text
           Select only the literal text values from the HTML

Name

       Catmandu::HTML - Modules for handling HTML data within the Catmandu framework

See Also

       This module is based on Catmandu

Synopsis

       Command line client "catmandu":

         catmandu convert HTML to JSON < ex/test.html > data/test.json

         catmandu convert JSON to HTML < data/test.json > ex/test.html

         # Keep the text
         curl -s http://librecat.org |
           catmandu convert HTML to Text --fix "html_text()"

         # Filter out the anchors
         curl -s http://librecat.org |
           catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")"

         # Filter out the meta tags as YAML
         curl -s http://librecat.org |
           catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"

       See documentation of modules for more examples.

See Also