en:tutorials
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| en:tutorials [2017/10/05 02:42] – anupama | en:tutorials [2025/06/12 09:03] (current) – [Greenstone3] anupama | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | <TABAREA tabs="Greenstone3, | + | <tabbox |
| - | < | + | **Greenstone tutorial exercises (Dec 2024, Jun 2025)** |
| - | * These work with Greenstone 3.08. [[http:// | + | * These work with the tested |
| - | * Each tutorial that requires sample files has a link to a zip download of the files. Alternatively, | + | |
| + | * Each tutorial that requires sample files has a link to a zip download of the files. Alternatively, | ||
| * Tutorials for older versions of Greenstone can be found at the [[legacy: | * Tutorials for older versions of Greenstone can be found at the [[legacy: | ||
| + | |||
| + | |||
| **[[http:// | **[[http:// | ||
| Line 28: | Line 31: | ||
| **[[http:// | **[[http:// | ||
| + | |||
| * Extracting embedded metadata | * Extracting embedded metadata | ||
| * Adding in a map view to browsing | * Adding in a map view to browsing | ||
| Line 45: | Line 49: | ||
| * Tidying up the default format statement | * Tidying up the default format statement | ||
| - | * Linking to Greenstone version or original version of documents | + | * Linking to the Greenstone version or original version of documents |
| * Making bookshelves show how many items they contain | * Making bookshelves show how many items they contain | ||
| * Displaying multi-valued metadata | * Displaying multi-valued metadata | ||
| * Advanced multi-valued metadata | * Advanced multi-valued metadata | ||
| - | |||
| - | |||
| - | **[[http:// | ||
| - | |||
| **[[http:// | **[[http:// | ||
| - | * Modes in the Librarian Interface | ||
| - | * Splitting PDFs into sections | ||
| * Using image format | * Using image format | ||
| * Using process_exp to control document processing (advanced) | * Using process_exp to control document processing (advanced) | ||
| + | * Customising the table of contents section heading display | ||
| * Opening PDF files with query terms highlighted | * Opening PDF files with query terms highlighted | ||
| Line 71: | Line 70: | ||
| * Removing pre-defined table of contents | * Removing pre-defined table of contents | ||
| * Extracting document properties as metadata | * Extracting document properties as metadata | ||
| + | * Processing docx files | ||
| **[[http:// | **[[http:// | ||
| - | //This tutorial demonstrates how to link different versions of the same document together in Greenstone.// | + | '' |
| * Associating one document with another | * Associating one document with another | ||
| * Linking to associated documents | * Linking to associated documents | ||
| Line 138: | Line 138: | ||
| * Using different icons for different media types | * Using different icons for different media types | ||
| * Building a full-size version of the collection | * Building a full-size version of the collection | ||
| + | * Adding an image collage browser | ||
| Line 168: | Line 169: | ||
| * Downloading using the command line | * Downloading using the command line | ||
| * Building the downloaded documents in GLI | * Building the downloaded documents in GLI | ||
| + | |||
| + | |||
| + | **[[http:// | ||
| + | |||
| + | * Working with DjVu documents in Greenstone | ||
| + | * Extracting the text from DjVu documents with DjVuLibre' | ||
| + | * Processing DjVu documents with the UnknownConverterPlugin | ||
| + | * Associating an icon with DjVu documents in Greenstone | ||
| Line 199: | Line 208: | ||
| * Use search mode hotkeys with query term | * Use search mode hotkeys with query term | ||
| * A quick reference of the search mode hotkeys in MGPP | * A quick reference of the search mode hotkeys in MGPP | ||
| + | |||
| **[[http:// | **[[http:// | ||
| Line 205: | Line 215: | ||
| * Incrementally deleting some documents from a collection | * Incrementally deleting some documents from a collection | ||
| * Editing a document' | * Editing a document' | ||
| - | * Incrementally | + | * Automatic incremental |
| - | ===== Customization ===== | ||
| **[[http:// | **[[http:// | ||
| Line 259: | Line 268: | ||
| * Adding functionality to the quick search box | * Adding functionality to the quick search box | ||
| * Adding the library name and login links | * Adding the library name and login links | ||
| - | * Interface language files | ||
| - | </TAB> | + | **[[http://wiki.greenstone.org/ |
| - | <!-- ############################################################################################## | + | |
| - | ################################################################################################# | + | |
| - | ################################################## | + | |
| - | < | + | |
| - | ====== Greenstone 2.87 tutorial | + | |
| + | * Creating a user account | ||
| + | * Accessing WebSwing GLI: a Greenstone Librarian Interface (GLI) application accessible over your browser | ||
| + | * Setting up and working with the tutorial sample files through Webswing GLI | ||
| + | |||
| + | <tabbox Greenstone2> | ||
| + | ** Greenstone 2.87 tutorial exercises (September 2017) ** | ||
| * These work with Greenstone 2.87. [[http:// | * These work with Greenstone 2.87. [[http:// | ||
| * For installation and setup instructions, | * For installation and setup instructions, | ||
| Line 439: | Line 448: | ||
| * Use the Depositor to do incremental addition | * Use the Depositor to do incremental addition | ||
| * Batch addition with the Depositor | * Batch addition with the Depositor | ||
| - | </ | ||
| - | </ | ||
| - | <!-- | ||
| - | USING THE ICECITE TOOL TO CONVERT FROM PDF TO TXT | ||
| - | 1. Need Java 8 for compiling and probably also for running Icecite | + | </tabbox> |
| - | <code> | + | |
| - | export JAVA_HOME=/opt/ | + | |
| - | export PATH=$JAVA_HOME/ | + | |
| - | </code> | + | |
| - | 2. Get and compile icecite, following the instructions at https:// | ||
| - | < | ||
| - | git clone https:// | ||
| - | cd icecite | ||
| - | git pull --recurse-submodules | ||
| - | cd pdf-parent/ | ||
| - | mvn install | ||
| - | </ | ||
| - | |||
| - | 3. Run icecite, general instructions at https:// | ||
| - | < | ||
| - | cd ../../ | ||
| - | cd icecite/ | ||
| - | java -jar target/ | ||
| - | </ | ||
| - | Examples: | ||
| - | greenstone@bedrock: | ||
| - | |||
| - | greenstone@bedrock: | ||
| - | |||
| - | greenstone@bedrock: | ||
| - | |||
| - | (Also tried with input file pdf01.pdf from the Reports collection) | ||
| - | |||
| - | |||
| - | 4. If you see the exception | ||
| - | --- | ||
| - | Exception in thread " | ||
| - | at org.apache.pdfbox.pdmodel.encryption.PDEncryption.< | ||
| - | at org.apache.pdfbox.pdfparser.PDFParser.prepareDecryption(PDFParser.java: | ||
| - | at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java: | ||
| - | at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java: | ||
| - | at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java: | ||
| - | at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java: | ||
| - | at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java: | ||
| - | at parser.pdfbox.core.PdfStreamEngine.processFile(PdfStreamEngine.java: | ||
| - | at parser.pdfbox.PdfBoxParser.parse(PdfBoxParser.java: | ||
| - | at cli.PdfParserCommandLine.parse(PdfParserCommandLine.java: | ||
| - | at cli.PdfParserCommandLine.processFile(PdfParserCommandLine.java: | ||
| - | at cli.PdfParserCommandLine.process(PdfParserCommandLine.java: | ||
| - | at cli.PdfParserCommandLine.main(PdfParserCommandLine.java: | ||
| - | Caused by: java.lang.ClassNotFoundException: | ||
| - | at java.net.URLClassLoader$1.run(URLClassLoader.java: | ||
| - | at java.net.URLClassLoader$1.run(URLClassLoader.java: | ||
| - | at java.security.AccessController.doPrivileged(Native Method) | ||
| - | at java.net.URLClassLoader.findClass(URLClassLoader.java: | ||
| - | at java.lang.ClassLoader.loadClass(ClassLoader.java: | ||
| - | at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java: | ||
| - | at java.lang.ClassLoader.loadClass(ClassLoader.java: | ||
| - | ... 13 more | ||
| - | |||
| - | --- | ||
| - | |||
| - | Then: | ||
| - | a. Obtain bouncycastle (encryption? | ||
| - | |||
| - | Download both jar files listed under the " | ||
| - | |||
| - | b. Then see https:// | ||
| - | for how to run a java programme when you have multiple jar files on classpath, as you can't run java with both -cp and -jar. | ||
| - | |||
| - | greenstone@bedrock: | ||
| - | --> | ||
en/tutorials.1507171325.txt.gz · Last modified: 2017/10/05 02:42 by anupama
