This article is linked from the [[Full-Text]] page. It gives some insight into the implementation of the full-text features for Japanese text corpora. The Japanese version is [http://files.basex.org/etc/ja-ft.pdf also available as PDF].
The lexical analysis of Japanese documents is performed by
''morphological analysis''.[http://igo.sourceforge.jp/ Igo - A morphological analyzer] 0. 4.3 is used as an external library for morphological analysis. Some of the advantages and reasonsfor using Igo are:* it is a prominent morphological analyzer that is compatible with "MeCab" results* it can be used to distribute the dictionary project MeCab
* the morphological analyzer is implemented in Java and is relatively fast
The example sentence "私は本を書きました。(I wrote a book.)" is analyzed as follows.