User Tools

Site Tools


old:advanced_tasks

This page is in the 'old' namespace, and was imported from our previous wiki. We recommend checking for more up-to-date information using the search box.

Advanced Tasks

How can I obtain a list of all subject headings in my database?

Diego Spano explains:

There is no direct way to do that, but there is a workaround. Run the following commands in your terminal:

1. To set up the GS environment, type:

. ./setup.bash

2. Then type:

perl -S buildcol.pl -store_metadata_coverage your_collection_name

This will create a building folder inside your collection and there you will find a folder named text. Inside you will have a file, "your_collection_name.gdb". This is a database file that will contain all the metadata assigned to each document.

Now, you have to export this file to a txt format with the command db2txt:

3. Move to the folder that has the gdb file, by typing:

cd /greenstone/collect/your collection/building/text"

4. Next, run:

db2txt your_collection_name.gdb > meta.txt

Now you have a file named meta.txt (just plain text) in the same folder of your gdb file. Open it and take a look. You will have a list of metadata for each document.

5. Now you can "Filter" it with grep command (you have it on Linux but in Windows you can use Cygwin):

grep "<dc.Subject>" ./meta.txt > onlysubject.txt

Now you have onlysubject.txt file with all the values (not unique values).

old/advanced_tasks.txt · Last modified: 2023/03/13 01:46 by 127.0.0.1