User Tools

Site Tools


nzdl:projects

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
nzdl:projects [2017/11/05 22:43] – [Others] kjdonnzdl:projects [2017/11/06 00:27] – [Chinese Text Segmentation] kjdon
Line 66: Line 66:
  
 ===== Chinese Text Segmentation===== ===== Chinese Text Segmentation=====
- 
-[[http://www.nzdl.org/cgi-bin/congb]] 
- 
-[[http://www.nzdl.org/chinese-text-segmenter/demo1.htm]]  
  
 Word segmentation is designed to find word boundaries in languages like Chinese and Japanese, which are (unlike English) written without spaces or other word delimiters (except for punctuation marks). It plays a significant role in applications that use the word as the basic unit due to the fact that machine-readable Chinese text is invariably stored in unsegmented form. Word segmentation is designed to find word boundaries in languages like Chinese and Japanese, which are (unlike English) written without spaces or other word delimiters (except for punctuation marks). It plays a significant role in applications that use the word as the basic unit due to the fact that machine-readable Chinese text is invariably stored in unsegmented form.
  
-We have implemented a WWW interface for segmenting Chinese text.+We have implemented a WWW interface for segmenting Chinese text. A demo used to be available at www.nzdl.org/cgi-bin/congb but that is no longer running. You can see an illustration of the transform at [[http://www.nzdl.org/chinese-text-segmenter/demo1.htm]]. (Currently at [[http://community.nzdl.org/www/chinese-text-segmenter/demo1.htm]])
  
-If your web browser does not support Chinese text[[http://www.nzdl.org/chinese-text-segmenter/demo1.htm|illustrations of the transformation]] are available. +(Note, the code can be found on community, in the chinese-text-segmenter directory.)
-Currently at [[http://commdev.nzdl.org/www/chinese-text-segmenter/demo1.htm]]+
  
 More information can be found in the paper: [[https://www.cs.waikato.ac.nz/~ihw/papers/00WT-YW-RMN-IHW-Comprsbased.pdf| A Compression-based Algorithm for Chinese Word Segmentation]] More information can be found in the paper: [[https://www.cs.waikato.ac.nz/~ihw/papers/00WT-YW-RMN-IHW-Comprsbased.pdf| A Compression-based Algorithm for Chinese Word Segmentation]]
nzdl/projects.txt · Last modified: 2023/03/13 01:46 by 127.0.0.1