User Tools

Site Tools


en:release:3.09_release_notes

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
en:release:3.09_release_notes [2018/10/02 06:31] – [PDF plugin restructuring and the NEW PDFv2Plugin] anupamaen:release:3.09_release_notes [2018/12/11 06:02] – [Important Changes and Bug Fixes] anupama
Line 92: Line 92:
 We're deprecating the old "PDFPlugin". And in its place there will be 2 plugins to handle PDFs: We're deprecating the old "PDFPlugin". And in its place there will be 2 plugins to handle PDFs:
   * "//PDFv1Plugin//" which is the same as the old PDFPlugin but minus the PDFBox_conversion option. It returns to using the old ''pdftohtml'' tool to do the conversions, and is limited to older versions of PDFs.   * "//PDFv1Plugin//" which is the same as the old PDFPlugin but minus the PDFBox_conversion option. It returns to using the old ''pdftohtml'' tool to do the conversions, and is limited to older versions of PDFs.
-  * the recommended "//PDFv2Plugin//", which will contain the new functionality and should handle a greater range of PDF versions, including the newer ones that the old ''pdftohtml'' (now used by PDFv1Plugin) can't handle. The "PDFBox conversion" facility has been moved to the new PDFv2Plugin, but is now invisible: it will be triggered automatically depending on the "convert_to" format that you select when you Configure the PDFv2Plugin. PDFv2Plugin also uses additional conversion tools in the background to support the additional output formats.+  * **the recommended "//PDFv2Plugin//"**, which will contain the new functionality and should handle a greater range of PDF versions, including the newer ones that the old ''pdftohtml'' (now used by PDFv1Plugin) can't handle. The "PDFBox conversion" facility has been moved to the new PDFv2Plugin, but is now invisible: it will be triggered automatically depending on the "convert_to" format that you select when you Configure the PDFv2Plugin. PDFv2Plugin also uses additional conversion tools in the background to support the additional output formats.
  
 For the eventual 3.09 release, the old PDFPlugin that you're familiar with, the one which has the ''pdfbox_conversion'' flag but also makes use of the old ''pdftohtml'' tool behind the scenes, will hang around with a deprecated warning, to allow people to port over their collections and keep rebuilding with the old settings or to rebuild their collection with one of the 2 new PDF plugins. However, **new collections will have the //PDFv2Plugin// in the Document Plugins pipeline by default, for GS3, and PDFv1Plugin by default for GS2, since GS2 doesn't come with the PDFbox extension out of the box.** So GS2 users will have to manually add in PDFv2Plugin in place of PDFv1Plugin for new collections, after setting up the pdfbox extension. But then it should work as usual. For the eventual 3.09 release, the old PDFPlugin that you're familiar with, the one which has the ''pdfbox_conversion'' flag but also makes use of the old ''pdftohtml'' tool behind the scenes, will hang around with a deprecated warning, to allow people to port over their collections and keep rebuilding with the old settings or to rebuild their collection with one of the 2 new PDF plugins. However, **new collections will have the //PDFv2Plugin// in the Document Plugins pipeline by default, for GS3, and PDFv1Plugin by default for GS2, since GS2 doesn't come with the PDFbox extension out of the box.** So GS2 users will have to manually add in PDFv2Plugin in place of PDFv1Plugin for new collections, after setting up the pdfbox extension. But then it should work as usual.
Line 271: Line 271:
  
 ===== Important Changes and Bug Fixes ===== ===== Important Changes and Bug Fixes =====
-  * HTTPS support: Greenstone will obtain a certificate from the Certification Authority Let's Encrypt to run your GS3 tomcat over https. However, on unix systems (macs and linux), you will need to have sudo permissions. And on Windows you will probably need admin rights.+  * **HTTPS support:** Greenstone will obtain a certificate from the Certification Authority Let's Encrypt to run your GS3 tomcat over https. However, on unix systems (macs and linux), you will need to have sudo permissions. And on Windows you will probably need admin rights. For instructions on usage, see [[#setting_up_your_greenstone_to_run_over_https|Setting up your Greenstone to run over https]] 
 +  * **GreenstoneSQLPlugin/-out:** used in place of GreenstoneXMLPlugin/-out to write metadata and/or fulltext into a MySQL database instead of Greenstone doc.xml files. You can then use SQL statements to mass-edit metadata/fulltext and rebuild your collection with the modified metadata/fulltext. See the wiki page on [[http://wiki.greenstone.org/doku.php?id=en:user_advanced:greenstonesqlplugs|Using the GreenstoneSQLPlugout with GreenstoneSQLPlugin]]. 
 +  * **The UnknownConverterPlugin:** if you have a command line tool installed that can convert from a document format to text or html (or png/jpg/gif images) and which you're able to successfully run from the command line to do such a conversion, then you can configure the new UnknownConverterPlugin to launch that command line tool and run the conversion automatically. This will allows document formats unrecognised by other Greenstone plugins to have their full text extracted and made searchable in Greenstone. There is a tutorial for Greenstone 3 that covers how to use the UnknownConverterPlugin. 
 +  * GS3 now supports user comments. Refer to [[http://wiki.greenstone.org/doku.php?id=en:user:user_comments|Enabling user comments]] 
 +  * OAI deletion policy 
 +  * Better way to run processes from GLI will avoid some occasional and unexpected errors when GLI runs perl scripts 
 +  * Bug fixes to file locking issues on Windows when using Lucene as indexer 
 +  * Patch to SOLR extension to circumvent SIGPIPE errors on large collections 
 +  * Patches to perl code upgrading perl syntax to work with newer versions of perl
 ===== IMPORTANT information ===== ===== IMPORTANT information =====
  
en/release/3.09_release_notes.txt · Last modified: 2023/03/13 01:46 by 127.0.0.1