User Tools

Site Tools


en:plugin:unknownconverterplugin

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:plugin:unknownconverterplugin [2020/08/07 07:43] anupamaen:plugin:unknownconverterplugin [2021/08/19 03:33] anupama
Line 12: Line 12:
  
 **The UnknownConverterPlugin has been officially available since Greenstone 3.09, so that 3.09 users can also start using Tika with the plugin, by** **The UnknownConverterPlugin has been officially available since Greenstone 3.09, so that 3.09 users can also start using Tika with the plugin, by**
 +
 +
 +0. following the quick steps [[#download_jre_8_and_install_locally_into_your_gs3|here]] to get a JRE 8 (32 bit only for Windows) and have it locally installed in your Greenstone 3, as the bundled version is JRE 7 and not compatible with tika-app-1.24.1.jar
  
 1. creating a subfolder called "tika" inside their GS3-install-dir/gs2build/ext, 1. creating a subfolder called "tika" inside their GS3-install-dir/gs2build/ext,
Line 17: Line 20:
 2. downloading the Apache-Tika binary jar file from https://www.apache.org/dyn/closer.cgi/tika/tika-app-1.24.1.jar (or by visiting http://trac.greenstone.org/browser/main/trunk/greenstone2/ext/tika/tika-app-1.24.1.jar and clicking the link labelled "downloading" there), then dropping the downloaded jar file into GS3/gs2build/ext/tika 2. downloading the Apache-Tika binary jar file from https://www.apache.org/dyn/closer.cgi/tika/tika-app-1.24.1.jar (or by visiting http://trac.greenstone.org/browser/main/trunk/greenstone2/ext/tika/tika-app-1.24.1.jar and clicking the link labelled "downloading" there), then dropping the downloaded jar file into GS3/gs2build/ext/tika
  
-3. and then configuring an UnknownConverterPlugin instance for any collection that needs docx processing as follows:+3. and then configuring an UnknownConverterPlugin instance for any collection that needs docx processing as follows. Note that **windows users** need to type ''%GSDLHOME%'' in place of ''$GSDLHOME'' in the following and type the full name of the tika-app jar file''tika-app-1.24.1.jar''.
  
 {{ :en:plugin:unknownconverterpluginwithtika.png?direct&850 |}} {{ :en:plugin:unknownconverterpluginwithtika.png?direct&850 |}}
Line 28: Line 31:
  
 For every doctype to be processed by UnknownConverterPlugin, the plugin requires you to have a command line tool installed that can convert that doctype to text or html. Apache-Tika supplies that, being the actual command line tool that can convert from a textual doctype to text or html. Next time you have a collection containing doctypes for which Greenstone does not provide existing plugins, experiment with the combination of the UnknownConverterPlugin with Tika. For every doctype to be processed by UnknownConverterPlugin, the plugin requires you to have a command line tool installed that can convert that doctype to text or html. Apache-Tika supplies that, being the actual command line tool that can convert from a textual doctype to text or html. Next time you have a collection containing doctypes for which Greenstone does not provide existing plugins, experiment with the combination of the UnknownConverterPlugin with Tika.
 +
 +
 +===== Download JRE 8 and install locally into your GS3 =====
 +GS3 comes bundled with JRE 7, but tika-app-1.24.1.jar needs JRE 8+.\\ The following steps will have you quickly set up with a JRE 8 local to your Greenstone 3 installation.
 +
 +1. **For Windows:**
 +
 +a. Use a File Explorer to do the following on the file system:
 +- Rename <your-GS3>\packages\jre to <your-GS3>\packages\jre.orig
 +- If you're on Windows: create folder <your-GS3>\packages\jre
 +
 +b. Visit: https://www.java.com/en/download/manual.jsp
 +
 +c. Click the "Windows Offline" link (which is the Java 8 update 301 for 32 bit win).
 +
 +//It has to be the 32 bit, don't get the 64 bit as then MG/MGPP indexers and GDBM will not work without manually recompiling Greenstone 3.//
 +
 +d. Then run the JRE windows installer and at the start of the installer, **ensure you tick "Change destination folder"** tickbox at the bottom.
 +- Set the destination folder to <your-GS3>\packages\jre
 +- Run through the installer
 +
 +The above steps will have put a compatible JRE8 into <your-GS3>/packages/jre
 +
 +
 +2. **For Linux users:**
 +
 +a. Rename <your-GS3>\packages\jre to <your-GS3>\packages\jre.orig
 +
 +b. Visit: https://www.java.com/en/download/manual.jsp
 +
 +c. Click the "Linux x64" link, which is the Java 8 update 301 for Linux x64.
 +
 +d. Put the downloaded tar.gz into the ''<your-GS3>/packages/'' folder.
 +Decompress.
 +
 +You may now have ended up with a decompressed folder like ''jre-...'' possibly containing yet another subfolder ''jre...''. Ultimately there will be ''bin'' and other subfolders in there somewhere.
 +
 +Move any ''jre...'' folder that immediately contains a ''bin'' and the other subfolders into ''<your-GS3>/packages/'', so you don't have more than one level of any folder called ''jre...''.
 +
 +Then rename the ''jre...'' folder to just ''jre''.
 +
 +You want to end up with this structure: ''<your-GS3>/packages/jre/bin''
 +
  
en/plugin/unknownconverterplugin.txt · Last modified: 2023/03/13 01:46 by 127.0.0.1