odidx - indexer of document files
Contents
Description
This manual page documents briefly the odidx commands.
The command odidx is a utility for indexing files on the local file system. This command is useful for a
full-text search system of a Web site. Supported file format are plain text and HTML. Supported character
encoding are US-ASCII and ISO-8859-1. The URI of each document is specified with the path of its file.
Attributes named as title and date are given to each documents. When a document is already existing in
the database, it is registered if its last modification time is newer, or it is ignored. Modification
times are stored in the sub database `_mtime' in the main database directory. Score information are
stored in the sub database `_score' in the main database directory. This command is used in the above
format. name specifies a database name. dir specifies a directory name.
Examples
For example, to register files which are under `/home/mikio' and whose suffix are `.txt', `.c', or `.h',
perform the following command.
odidxregister-tsuf".txt,.c,.h"-hsuf""casket/home/mikio
Thus, to retrieve documents which include `unix' and `posix' and show the top 8 terms, perform the
following command.
odmgrsearch-max8-hcasket"unixposix"Name
odidx - indexer of document files
Options
A summary of options is included below. For a complete description, see the file:///usr/share/doc/qdbm-
doc/spex.html#odeumcli.
-lfile
read a file and get list of paths of files to register. If `-' is specified, the standard input is
read.
-wmaxnum
specify the max number of words to be stored in the document database.
-tsufsufs
specify suffixes of plain text files in comma separated format. The default is `-tsuf .txt,.text'.
-hsufsufs
specify suffixes of HTML files in comma separated format. The default is `-hsuf .html,.htm'.
See Also
qdbm(3), odmgr(1), odeum(3), odopen(3).
Synopsis
odidxregister [-l file] [-wmax num] [-tsuf sufs] [-hsuf sufs] name[dir]
odidxrelatenameodidxpurgename