nzdl:projects
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
nzdl:projects [2017/09/25 01:34] – [Sequitur] kjdon | nzdl:projects [2017/11/05 22:26] – [Chinese Text Segmentation] kjdon | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== NZDL projects and Demonstrations ====== | ====== NZDL projects and Demonstrations ====== | ||
- | New Zealand Digital Library Project members have developed a range of practical software packages in the course of their research. Much of this software is available for [[download]]. | + | New Zealand Digital Library Project members have developed a range of practical software packages in the course of their research. Much of this software is available for download. |
=====Digital libraries and indexing===== | =====Digital libraries and indexing===== | ||
Line 19: | Line 19: | ||
====Sequitur==== | ====Sequitur==== | ||
- | links dont work. http://sequence.rutgers.edu/sequitur/ | + | [[http://www.sequitur.info/ |Sequitur]] is a method for inferring compositional hierarchies from strings by detecting repetition and factoring it out of the string by forming rules in a grammar. Sequitur is useful for recognizing lexical structure in strings, and excels at very long sequences. The Sequitur WWW interface detects structure in text sequences. |
- | Sequitur is a method for inferring compositional hierarchies from strings by detecting repetition and factoring it out of the string by forming rules in a grammar. Sequitur is useful for recognizing lexical structure in strings, and excels at very long sequences. The Sequitur WWW interface detects structure in text sequences. | + | |
| | ||
Line 29: | Line 28: | ||
=====Text Mining===== | =====Text Mining===== | ||
- | | + | See our Text Mining Webpage. ?? what link? http:// |
=====Browsing interfaces===== | =====Browsing interfaces===== | ||
Line 38: | Line 37: | ||
==== 3D Book Visualizer ==== | ==== 3D Book Visualizer ==== | ||
- | The [[http:// | + | The [[http:// |
* Spinning the book around | * Spinning the book around | ||
Line 50: | Line 49: | ||
==== Phind==== | ==== Phind==== | ||
| | ||
- | Phind is an interface for browsing the phrases that occur in a collection. The phrases form an approximation of the topics covered. They are extracted from the noun-phrases occuring in the text, so nonsense phrases and phrases with very little information content are excluded. Each phrase is part of a hierarchy, and the user can browse more specialised topics, or retrieve documents that contain the phrase, at any point. You can see Phind in action in the [[http:// | + | [[http:// |
==== Collage==== | ==== Collage==== | ||
Line 66: | Line 65: | ||
A collage using a directory of images can be found at [[http:// | A collage using a directory of images can be found at [[http:// | ||
- | =====Word segmentation===== | + | ===== Chinese Text Segmentation===== |
[[http:// | [[http:// | ||
- | [[http:// | + | |
+ | [[http:// | ||
Word segmentation is designed to find word boundaries in languages like Chinese and Japanese, which are (unlike English) written without spaces or other word delimiters (except for punctuation marks). It plays a significant role in applications that use the word as the basic unit due to the fact that machine-readable Chinese text is invariably stored in unsegmented form. | Word segmentation is designed to find word boundaries in languages like Chinese and Japanese, which are (unlike English) written without spaces or other word delimiters (except for punctuation marks). It plays a significant role in applications that use the word as the basic unit due to the fact that machine-readable Chinese text is invariably stored in unsegmented form. | ||
- | We have implemented a WWW interface for segmanting | + | We have implemented a WWW interface for segmenting |
- | If your web browsers | + | If your web browser |
+ | Currently at [[http:// | ||
+ | More information can be found in the paper: [[https:// | ||
=====Others===== | =====Others===== | ||
[[http:// | [[http:// | ||
nzdl/projects.txt · Last modified: 2023/03/13 01:46 by 127.0.0.1