This page is in the 'old' namespace, and was imported from our previous wiki. We recommend checking for more up-to-date information using the search box.
An updated version for Greenstone 2.82 and above is available here.
Z39.50 is an international client/server protocol for searching bibliographic data. It can use the Internet Protocol (TCP/IP), which makes the databases on a server available from almost anywhere around the globe. It is widely used, for example, in on-line library catalogues. It allows a user to search one or more databases and retrieve the results of the query.
Greenstone has support for z3950, both as a client and a server. This support is not enabled by default, and recompilation is needed to enable it. To do this, you need the source code. If you haven't already got the source, rerun the installer, select custom, and then choose only the source code.
z3950 support in Greenstone is based around the YAZ toolkit, written by IndexData. We are currently using YAZ version 2.1.4. This is included in the Greenstone distribution without modification. Greenstone links against the libyaz.a library. We have also written yaz_zclient.h/cpp, which is based on the sample client.h/cpp. This is found in greenstone/src/colservr.
Greenstone can download records from a specified Z39.50 server from GLI. Start the Greenstone Librarian Interface. On the left-hand side of the Librarian Interface's Download panel, select Z39.50, and then specify the arguments on the right-hand side of the panel. There are five arguments:
The YAZ client supports the type-1 RPN query model. Both simple and advanced queries are supported, like single term query, boolean search and fielded search, etc. This page describes in detail the query syntax of YAZ. For exmaple, the following query finds records that have the phrase car history in title field @attr 1=4 "car history"
You can view the downloaded marc files on the Gather panel. On the left-hand side of the panel, double click the Downloaded Files folder to expand its content. The subfolders are named by the Z39.50 server url. The marc files are named as the combination of database name, query, and max_records if max_records is specified. These marc files are physically stored in a temporary cache directory.
You can build a collection using these downloaded marc files by dragging them across to the Collection section on the right-hand side of the Gather panel. MARCPlug must be included in the collection plugin list. MARCPlug has three options:
SRW (Search/Retrieve Web service) is an alternative way that uses web service to search and retrieval Z39.50 repositories. It replaces the Z39.50 communications protocol with HTTP and SOAP, but still supports the Z39.50 query syntax. Search results from SRW are in XML format.
Greenstone also support download from a Z39.50 server through SRW. Go to the Download panel in Greenstone Librarian Interface and select SRW. On the right-hand side of the panel, there are five parameters, which are the same with Z39.50 download. But different host and port values should be used here. For example, to connect to the Library of Congress Z39.50 server through SRW, the following host and port should be specified:
host: http://z3950.loc.gov
port: 7090/voyager?
The downloaded records are in XML format. Here is a sample record. You can view the downloaded records on the Gather panel. They are in the same subfolder as downloading from Z39.50 above.
To build a collection using the downloaded XML files, MARCXMLPlug must be added to the collection plugin list. MARCXMLPlug has one option metadata_mapping, which is the same with MARCPlug and defaults to marctodc.txt as well.
./configure --enable-z3950 make make install
nmake /f win32.mak USE_Z3950=1
Once Greenstone has z3950 support compiled in, it can act as a client to multiple z3950 servers. The file greenstone/etc/packages/z3950.cfg specifies a list of servers to connect to. By default, no servers are set up, although the config file comes with two (commented out) example Z39.50 servers, both for servers of the United States' Library of Congress.
Each entry consists of:
The entries need only be separated by whitespace, but for the purposes of clarity the sample entry uses newlines and tabs.
There is a list at the Library Of Congress website containing some servers publicly available for testing.
z3950server tcp:server-name:port-num ** For example, 'z3950server tcp:localhost:8080' or 'z3950server tcp:kanuka.cs.waikato.ac.nz:7070'
You can test the z3950 server by connecting to it using the Greenstone z3950 client. In the greenstone/etc/packages/z3950.cfg file, add a server entry similer to the following:
Greenstone kanuka.cs.waikato.ac.nz:9999 demo "The demo collection via z3950" About "This collection contains a few records from the Humanity Development Library"