User Tools

Site Tools


nzdl:projects

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
nzdl:projects [2017/12/05 01:12] – [Extracting data and metadata] kjdonnzdl:projects [2023/03/13 01:46] (current) – external edit 127.0.0.1
Line 1: Line 1:
 +
 +
 +
 ====== NZDL projects and Demonstrations ====== ====== NZDL projects and Demonstrations ======
  
Line 26: Line 29:
 [[http://www.nzdl.org/Kea/|Kea]] is a program for automatically extracting keywords and keyphrases from the full text of documents. Candidate keyphrases are identified using rudimentary lexical processing, features are computed for each candidate, and machine learning is used to determines which candidates should be assigned as keyphrases. [[http://www.nzdl.org/Kea/|Kea]] is a program for automatically extracting keywords and keyphrases from the full text of documents. Candidate keyphrases are identified using rudimentary lexical processing, features are computed for each candidate, and machine learning is used to determines which candidates should be assigned as keyphrases.
  
 +==== Maui ====
  
 +[[https://code.google.com/archive/p/maui-indexer/ |Maui]] is an indexing tool that automatically identifies main topics in text documents. Depending on the task, topics are tags, keywords, keyphrases, vocabulary terms, descriptors, index terms or titles of Wikipedia articles. Maui builds on the Kea algoritm, but provides additional functionalities: it allows the assignment of topics to documents based on terms from Wikipedia using Wikipedia Miner. Maui also has many new features that help identify topics more accurately.
 +
 +==== Wikipedia Miner ====
 +
 +[[http://nzdl.org/wikipediaminer | Wikipedia Miner]] is an open-source software system that allows researchers and developers to integrate Wikipediaʼs rich semantics into their own applications. The toolkit creates databases that contain summarized versions of Wikipediaʼs content and structure, and includes a Java API to provide access to them. 
 =====Browsing interfaces===== =====Browsing interfaces=====
  
Line 43: Line 52:
    
 It supports the PDF and DjVu document formats. It supports the PDF and DjVu document formats.
 +
 +==== MAT: Metadata Analysis Tool ====
 +
 +[[nzdl:mat|MAT]] is a tool for producing statistics and visualisations of repository metadata.
 +
  
 ==== Phind==== ==== Phind====
Line 78: Line 92:
 =====Others===== =====Others=====
  
-[[http://collections.nzdl.org/ELKB/|Electronic Lexical Knowledge Base (ELKB)]] is software for accessing and exploring the Roget's thesaurus. It also provides solutions for various natural language processing tasks. All scripts were originally developed as a part of Mario Jarmasz' Master thesis at the [[http://engineering.uottawa.ca/eecs/|University of Ottawa]], Canada.+[[http://nzdl.org/ELKB/|Electronic Lexical Knowledge Base (ELKB)]] is software for accessing and exploring the Roget's thesaurus. It also provides solutions for various natural language processing tasks. All scripts were originally developed as a part of Mario Jarmasz' Master thesis at the [[http://engineering.uottawa.ca/eecs/|University of Ottawa]], Canada.
  
nzdl/projects.1512436358.txt.gz · Last modified: 2017/12/05 01:12 by kjdon