User Tools

Site Tools


en:user_advanced:ice_cite

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
en:user_advanced:ice_cite [2019/02/21 06:18] anupamaen:user_advanced:ice_cite [2019/02/21 06:20] – [Using the UnknownConverterPlugin to launch Icecite from GLI to do the PDF to text conversion] anupama
Line 24: Line 24:
   - Create a new collection called Icecite. In the Gather pane, drop in the sample PDF file into your collection.   - Create a new collection called Icecite. In the Gather pane, drop in the sample PDF file into your collection.
   - In the **Design** pane and select **Document Plugins** from the list on the left. Add the **UnknownConverterPlugin**. Having tried out the Icecite conversion command manually in the previous part of this tutorial, we're now ready to use it when configuring the **UnknownConverterPlugin**. Click **<Configure Plugin...>** and set up the plugin with the following settings:   - In the **Design** pane and select **Document Plugins** from the list on the left. Add the **UnknownConverterPlugin**. Having tried out the Icecite conversion command manually in the previous part of this tutorial, we're now ready to use it when configuring the **UnknownConverterPlugin**. Click **<Configure Plugin...>** and set up the plugin with the following settings:
-     * set convert_to to the text option, this is the output format upon conversion +     * set ''convert_to'' to the ''text'' option, this is the output format upon conversion 
-     * set mime_type to application/pdf +     * set ''mime_type'' to ''application/pdf'' 
-     * set srcicon to the iconpdf, since Greenstone already knows about this macro and already has an icon for PDFs and knows to associate the two +     * set ''srcicon'' to the ''iconpdf'', since Greenstone already knows about this macro and already has an icon for PDFs and knows to associate the two 
-     * set process_extension to pdf, this is the input format of the files that this instance of the UnknownConverterPlugin will process +     * set ''process_extension'' to ''pdf'', this is the input format of the files that this instance of the UnknownConverterPlugin will process 
-     * set the exec_cmd field as follows, depending on your operating system: +     * set the ''exec_cmd'' field as follows, depending on your operating system: 
-        * on Windows:\\ ''DRIVE:\PATH\TO\YOUR-JAVA-8-HOME\bin\java -classpath "%%GSDL3SRCHOME\ext\icecite\gs-installed-jars\*:%%GSDL3SRCHOME\ext\icecite\pdf-cli\target\pdf-cli-0.0.1-SNAPSHOT-jar-with-dependencies.jar" cli.PdfParserCommandLine --format txt --feature paragraphs %%INPUT_FILE %%OUTPUT''\\ +        * on Windows:\\ ''DRIVE:\PATH\TO\YOUR-JAVA-8-HOME\bin\java -classpath "%%GSDL3SRCHOME\ext\icecite\gs-installed-jars\*:%%GSDL3SRCHOME\ext\icecite\pdf-cli\target\pdf-cli-0.0.1-SNAPSHOT-jar-with-dependencies.jar" cli.PdfParserCommandLine --format txt --feature paragraphs %%INPUT_FILE %%OUTPUT'' 
-        * on Unix systems:\\ ''/PATH/TO/YOUR-JAVA-8-HOME/bin/java -classpath '%%GSDL3SRCHOME/ext/icecite/gs-installed-jars/*:%%GSDL3SRCHOME/ext/icecite/pdf-cli/target/pdf-cli-0.0.1-SNAPSHOT-jar-with-dependencies.jar' cli.PdfParserCommandLine --format txt --feature paragraphs %%INPUT_FILE %%OUTPUT''\\ \\ +        * on Unix systems:\\ ''/PATH/TO/YOUR-JAVA-8-HOME/bin/java -classpath '%%GSDL3SRCHOME/ext/icecite/gs-installed-jars/*:%%GSDL3SRCHOME/ext/icecite/pdf-cli/target/pdf-cli-0.0.1-SNAPSHOT-jar-with-dependencies.jar' cli.PdfParserCommandLine --format txt --feature paragraphs %%INPUT_FILE %%OUTPUT'' 
  
 Note: When filling in the ''exec_cmd'' field, leave the words with %% signs in front of them intact. They are placeholders for Greenstone to replace. Note: When filling in the ''exec_cmd'' field, leave the words with %% signs in front of them intact. They are placeholders for Greenstone to replace.
en/user_advanced/ice_cite.txt · Last modified: 2023/03/13 01:46 by 127.0.0.1