Greenstone 3.05 tutorial exercises (July 2012)
- These work with Greenstone 3.05. Print version
- For the latest versions of the tutorials, for Greenstone 2 and 3, see the main tutorials page.
- For other old versions, including Spanish, French and Russian translations of the 2.70 version, see the Old Tutorials page.
Building a small collection of HTML files
- Running the Greenstone Librarian Interface
- Starting a new collection
- Adding documents to the collection
- Building the collection
- Viewing the extracted metadata
- Setting up a shortcut in the Librarian interface
- Adding Title and Description metadata
- Change Format Features to display new metadata
- Changing the size of image thumbnails
- Adding a browsing classifier based on Description metadata
- Creating a searchable index based on Description metadata
A collection of Word and PDF files
- Viewing the extracted metadata
- Manually adding metadata to documents in a collection
- Document Plugins
- Search indexes
- Browsing classifiers
Formatting the Word and PDF collection
- Tidying up the default format statement
- Linking to Greenstone version or original version of documents
- Making bookshelves show how many items they contain
- Displaying multi-valued metadata
- Advanced multi-valued metadata
Processing newer versions of PDF with PDFBox
- Modes in the Librarian Interface
- Splitting PDFs into sections
- Using image format
- Using process_exp to control document processing (advanced)
- Opening PDF files with query terms highlighted
Enhanced Word document handling
- Using Windows native scripting
- Modes in the Librarian Interface
- Defining styles
- Removing pre-defined table of contents
- Extracting document properties as metadata
Associated files: combining different versions of the same document together
A large collection of HTML files—Tudor
- Extracting more metadata from the HTML
- Looking at different views of the files in the Gather and Enrich panels
Enhanced collection of HTML files—Tudor
- Adding hierarchically-structured metadata and a Hierarchy classifier
- Adding a hierarchical phrase browser (PHIND)
- Partitioning the full-text index based on metadata values
- Controlling the building process
Formatting the HTML collection—Tudor
Section tagging for HTML documents
Downloading files from the web
Pointing to documents on the web
- Using fielded searching
- Exploding the database
- Reformatting the collection to use the exploded metadata
Looking at a multimedia collection
Building a multimedia collection
- Manually correcting metadata
- Browsing by media type
- Suppressing dummy text
- Using AZCompactList rather than List
- Making bookshelves show how many items they contain
- Adding a Phind phrase browser
- Branding the collection with an image
- Using UnknownPlugin
- Cleaning up a title browser using regular expressions
- Using non-standard macro files
- Using different icons for different media types
- Building a full-size version of the collection
- Grouping documents by series title and displaying dates within each group
- Browsing documents by Date.
- Searching at page level
Advanced scanned image collection
- Adding another newspaper to the collection
- XML based item file
Open Archives Initiative (OAI) collection
- Tweaking the presentation with format statements
Setting up your Greenstone OAI Server
- Validating the Greenstone OAI server
- Downloading using the Librarian Interface
- Downloading using the command line
- Building the downloaded documents in GLI
Use METS as Greenstone's Internal Representation
Moving a collection from DSpace to Greenstone
- Adding indexing and browsing capabilities to match DSpace's
Moving a collection from Greenstone to DSpace
- Using Greenstone from the command line
- Running GEMS
- Creating a new metadata set
- Adding a new element to a metadata set
Building and searching with different indexers
- Build with Lucene
- Search with Lucene
- Build with MGPP
- Search with MGPP
- Use search mode hotkeys with query term
- A quick reference of the search mode hotkeys in MGPP