Plugin name (old name) | Description | Default fields | Available Fields |
BibTexPlugin (BibTexPlug) | Plugin that imports BibTex files. Inherits from SplitTextFile. | Title, Creator, Abstract, Author, Booktitle, Chapter, Copyright, Date, Edition, Editor, EntryType Journal, Keywords, Month, Note, Number, Pages, Publisher, PublisherAddress, Volume, Year | |
BookPlugin (BookPlug) | Plugin that imports Humanity Library collection files. A simplification of HBPlugin. Inherits from AutoExtractMetadata. | | |
CONTENTdmPlugin (CONTENTdmPlug) | Plugin that imports RDF files in exported CONTENTdm collections. Inherits from ConvertBinaryFile, ReadXMLFile. | | |
ConvertToRogPlugin (ConvertToRogPlug) | ?? Inherits from RogPlugin. | | |
CSVPlugin (CSVPlug) | Plugin that imports files in comma-separated value format. A new document will be created for each line of the file. Inherits from SplitTextFile. | | |
DatabasePlugin (DBPlug) | Plugin that extracts records from databases (requires additional Perl setup). Inherits from AutoExtractMetadata. | | (arbitrary metadata field names based on Database configuration file) |
DSpacePlugin (DSpacePlug) | Plugin that imports DSpace archive format. Inherits from BasePlugin. | | |
EmailPlugin (EMAILPlug) | Plugin that imports saved email files (not MS OutLook format though). Inherits from SplitTextFile. | Date, DateText, From, FromAddr, FromName, Headers, Subject, Title (based on subject, from, and date), To | |
ExcelPlugin (ExcelPlug) | Plugin that imports Microsoft Excel files. Inherits from ConvertBinaryFile. | | (all fields as in HTMLPlug) |
FavouritesPlugin (FavouritesPlug) | Plugin that imports Internet Explorer Favourites files. Inherits from ReadTextFile. | | |
FOXPlugin (FOXPlug) | Plugin that imports FOX database files. Inherits from BasePlugin. | | |
HBPlugin (HBPlug) | Plugin that imports an HTML book directory. Used by Humanity Library collection. Inherits from BasePlugin. | | |
HTMLPlugin (HTMLPlug) | Plugin that imports HTML files. Inherits from ReadTextFile, HBPlugin. | Title, URL | Author, Creator, Email (others as found in the -metadata_fields option) |
HTMLImagePlugin (W3ImgPlug) | Plugin that imports HTML files, creating a Greenstone document for each image in the web page. Inherits from HTMLPlugin. | | |
ImagePlugin (ImagePlug) | Plugin that imports JPEG, GIF etc see http://www.imagemagick.org/www/formats.html. Inherits from BasePlugin, ImageConverter. | Image, ImageHeight, ImageSize, ImageType, ImageWidth, ScreenHeight, screenicon, ScreenSize, ScreenType, ScreenWidth, Source, srclink, srcicon, Thumb, ThumbHeight, ThumbType, ThumbWidth | |
IndexPlugin (IndexPlug) | Plugin that processes an index.txt file, which lists all files to be included in the collection, plus additional metadata for those documents. Inherits from BasePlugin. | as in the index.txt file | (use metadata.xml files instead of using this plugin) |
ISISPlugin (ISISPlug) | Plugin that imports CDS/ISIS database files. Inherits from SplitTextFile. | | |
LaTeXPlugin (LaTeXPlug) | Plugin that imports LaTeX files. Inherits from ReadTextFile. | | |
LOMPlugin (LOMPlug) | Plugin that imports LOM (Learning Object Metadata) files. Inherits from ReadTextFile. | | |
MARCPlugin (MARCPlug) | Plugin that imports MARC metadata. Inherits from SplitTextFile. | Creator, Description, MarcIdentifier, MarcSource, URL, Publisher, Relation, Rights, Subject, Title, Type | (Metadata fields as in the marctodc.txt file) |
MARCXMLPlugin (MARCXMLPlug) | Plugin that imports MARC metadata in XML format. Inherits from ReadXMLFile, ReadTextFile. | | |
MediaInfoOGVPlugin | Plugin for importing OGV movie files. Requires Mediainfo (mediainfo.sourceforge.net) to be installed to extract metadata. | | |
MediaWikiPlugin (MediaWikiPlug) | Plugin that imports MediaWiki web pages. Inherits from HTMLPlugin. | | |
MetadataCSVPlugin (MetadataCSVPlug) | Plugin that imports metadata in CSV (comma separated value) format. The Filename field in the CSV file is used to determine which document the metadata belongs to. Inherits from BasePlugin. | | |
MP3Plugin (MP3Plug) | Plugin that imports MP3 audio files. Inherits from BasePlugin. | | |
NulPlugin (NULPlug) | Plugin that imports dummy files (.nul). These may generated when bibliographic databases are 'exploded'. Inherits from BasePlugin. | | |
OAIPlugin (OAIPlug) | Plugin that imports Open Archives Initiatives (OAI) data. Inherits from ReadXMLFile, ReadTextFile. | URL, (all metadata in .oai markup file) | |
OggVorbisPlugin(OggVorbisPlug) | Plugin that imports Ogg Vorbis Files. Inherits from BasePlugin. | | |
OpenDocumentPlugin (OpenDocumentPlug) | Plugin that imports OASIS OpenDocument format documents (used by OpenOffice 2.0). Inherits from ReadXMLFile. | | |
PagedImagePlugin (PagedImgPlug) | Plugin that imports sequences of image files (formats as for ImagePlug), with optional associated plain text. Each document requires an item file listing the image/text files that make up the document. Inherits from ReadXMLFile, ReadTextFile, ImageConverter. | Image, ImageHeight, ImageSize, ImageType, ImageWidth, ScreenHeight, screenicon, ScreenSize, ScreenType, ScreenWidth, Source, srclink, srcicon, Thumb, ThumbHeight, ThumbType, ThumbWidth | |
PDFPlugin (PDFPlug) | Plugin that imports PDF files. Inherits from ConvertBinaryFile. | | (all fields in HTMLPlug) |
PostScriptPlugin (PSPlug) | Plugin that imports Postscript files. Inherits from ConvertBinaryFile. | Title | Date, Pages, (all fields in TextPlug) |
PowerPointPlugin (PPTPlug) | Plugin that imports Microsoft Powerpoint files. Inherits from ConvertBinaryFile. | | (all fields in HTMLPlug) |
ProCitePlugin (ProCitePlug) | Plugin that imports ProCite files. Inherits from SplitTextFile. | | |
RealMediaPlugin (RealMediaPlug) | Plugin that imports RealMedia files. Inherits from BasePlugin. | | |
ReferPlugin (ReferPlug) | Plugin that imports Refer files. Inherits from SplitTExtFile. | Abstract, BookConfOnly, Booktitle, Copyright, Creator, Date, Editor, Keywords, Journal, JournalsOnly, Number, Pages, Publisher, Publisheraddr, Report, Title, Volume | |
RogPlugin (RogPlug) | Plugin that imports .rog or .mdb files. Inherits from BasePlugin. | | |
RTFPlugin (RTFPlug) | Plugin that imports RTF files. Inherits from ConvertBinaryFile. | | (all fields in HTMLPlug) |
SourceCodePlugin (SRCPlug) | Plugin that imports source code (C/C++, Perl, Shell). Inherits from ReadTextFile. | Title, filename, includes, class, classdecl | |
StructuredHTMLPlugin (StructuredHTMLPlug) | Plugin that imports structured HTML documents, splitting them into sections based on style information. Inherits from HTMLPlugin. | | |
TextPlugin (TEXTPlug) | Plugin that imports plain text files. Inherits from ReadTextFile. | Title | |
UnknownConverterPlugin | Plugin that imports files with a user-specified file extension. You must provide the command to an installed command line tool that will do the processing on the file to convert the file to text or html. Used to import and index the full text content of files that Greenstone can't otherwise handle. Inherits from UnknownPlugin. | (-exec_cmd and convert_to ) | |
UnknownPlugin | (UnknownPlug) Plugin that imports files with a user-specified file extension. No processing is done on the file. Instead a fictional document is created and the file is attached to that document. Used to import files that Greenstone can't otherwise handle. Inherits from BasePlugin. | (as given in the -assoc_field plugin argument) | |
WordPlugin (WordPlug) | Plugin that imports Microsoft Word documents. Inherits from ConvertBinaryFile. | | (all fields in HTMLPlug) |
ZIPPlugin (ZIPPlug) | Plugin that unpacks compressed or archive file formats and sends content down plugin pipeline. Handled formats include gzip (.gz, .z, .tgz, .taz), bzip (.bz), bzip2 (.bz2), zip (.zip, .jar) and tar (.tar). Relies on the appropriate utility being present: gunzip, bunzip, bunzip2, unzip, tar. Inherits from BasePlugin. | | |