Greenstone 3.05 tutorial exercises (July 2012)

  • For the latest versions of the tutorials, for Greenstone 2 and 3, see the main tutorials page.
  • For other old versions, including Spanish, French and Russian translations of the 2.70 version, see the Old Tutorials page.

Building a small collection of HTML files

  • Running the Greenstone Librarian Interface
  • Starting a new collection
  • Adding documents to the collection
  • Building the collection
  • Viewing the extracted metadata
  • Setting up a shortcut in the Librarian interface

A simple image collection

  • Adding Title and Description metadata
  • Change Format Features to display new metadata
  • Changing the size of image thumbnails
  • Adding a browsing classifier based on Description metadata
  • Creating a searchable index based on Description metadata

A collection of Word and PDF files

  • Viewing the extracted metadata
  • Manually adding metadata to documents in a collection
  • Document Plugins
  • Search indexes
  • Browsing classifiers

Formatting the Word and PDF collection

  • Tidying up the default format statement
  • Linking to Greenstone version or original version of documents
  • Making bookshelves show how many items they contain
  • Displaying multi-valued metadata
  • Advanced multi-valued metadata

Processing newer versions of PDF with PDFBox

Enhanced PDF handling

  • Modes in the Librarian Interface
  • Splitting PDFs into sections
  • Using image format
  • Using process_exp to control document processing (advanced)
  • Opening PDF files with query terms highlighted

Enhanced Word document handling

  • Using Windows native scripting
  • Modes in the Librarian Interface
  • Defining styles
  • Removing pre-defined table of contents
  • Extracting document properties as metadata

Associated files: combining different versions of the same document together

A large collection of HTML files—Tudor

  • Extracting more metadata from the HTML
  • Looking at different views of the files in the Gather and Enrich panels

Enhanced collection of HTML files—Tudor

  • Adding hierarchically-structured metadata and a Hierarchy classifier
  • Adding a hierarchical phrase browser (PHIND)
  • Partitioning the full-text index based on metadata values
  • Controlling the building process

Formatting the HTML collection—Tudor

Section tagging for HTML documents

Downloading files from the web

Pointing to documents on the web

Bibliographic collection

  • Using fielded searching
  • Exploding the database
  • Reformatting the collection to use the exploded metadata

CDS/ISIS collection

Looking at a multimedia collection

Building a multimedia collection

  • Manually correcting metadata
  • Browsing by media type
  • Suppressing dummy text
  • Using AZCompactList rather than List
  • Making bookshelves show how many items they contain
  • Adding a Phind phrase browser
  • Branding the collection with an image
  • Using UnknownPlugin
  • Cleaning up a title browser using regular expressions
  • Using non-standard macro files
  • Using different icons for different media types

  • Building a full-size version of the collection

Scanned image collection

  • Grouping documents by series title and displaying dates within each group
  • Browsing documents by Date.
  • Searching at page level

Advanced scanned image collection

  • Adding another newspaper to the collection
  • XML based item file

Open Archives Initiative (OAI) collection

  • Tweaking the presentation with format statements

Setting up your Greenstone OAI Server

  • Validating the Greenstone OAI server

Downloading over OAI

  • Downloading using the Librarian Interface
  • Downloading using the command line
  • Building the downloaded documents in GLI

Use METS as Greenstone's Internal Representation

Moving a collection from DSpace to Greenstone

  • Adding indexing and browsing capabilities to match DSpace's

Moving a collection from Greenstone to DSpace

  • Using Greenstone from the command line

Editing metadata sets

  • Running GEMS
  • Creating a new metadata set
  • Adding a new element to a metadata set

Building and searching with different indexers

  • Build with Lucene
  • Search with Lucene
  • Build with MGPP
  • Search with MGPP
  • Use search mode hotkeys with query term
  • A quick reference of the search mode hotkeys in MGPP