en:classifier:phind

Phind

Browsing classifier that produces a hierarchy of phrases found in the text, which is browsable via an applet.

The following table lists all of the configuration options available for Phind.

Option	Description	Value
Phind Options
text	The text used to build the phrase hierarchy.	Default: section:Title,section:text
title	The metadata field used to describe each document.	Default: Title
buttonname	The label for the classifier screen and button in navigation bar. The default is the metadata element specified with -metadata.	Default: Phrase
language	Language or languages to use building hierarchy. Languages are identified by two-letter country codes like en (English), es (Spanish), and fr (French). Language is a regular expression, so 'en\|fr' (English or French) and '..' (match any language) are valid.	Default: en
savephrases	If set, the phrase infomation will be stored in the given file as text. It is probably a good idea to use an absolute path.
suffixmode	The smode parameter to the phrase extraction program. A value of 0 means that stopwords are ignored, and of 1 means that stopwords are used.	Default: 1 Range: 0,1
min_occurs	The minimum number of times a phrase must appear in the text to be included in the phrase hierarchy.	Default: 2 Range: 1,
thesaurus	Name of a thesaurus stored in Phind format in the collection's etc directory.
untidy	Don't remove working files.
Options Inherited from BaseClassifier
buttonname	The label for the classifier screen and button in navigation bar. The default is the metadata element specified with -metadata.
no_metadata_formatting	Don't do any automatic metadata formatting (for sorting.)
builddir	Where to put the built indexes.
outhandle	The file handle to write output to.	Default: STDERR
verbosity	Controls the quantity of output. 0=none, 3=lots.	Default: 2