Changes

Jump to navigation Jump to search
4 bytes removed ,  13:37, 2 July 2020
=Introduction=
The lexical analysis of Japanese documents is performed by [http://igo.osdn.jp/ Igo]. Igo is a ''morphological analyser'',and some of the advantages and reasons for using Igo are:* compatible with the results of a prominent morphological analyzer "MeCab"* it can use the dictionary distributed by the Project MeCab* the morphological analyzer is implemented in Java and is relatively fast
* Compatible with the results of a prominent morphological analyzer "MeCab".* It can use the dictionary distributed by the Project MeCab.* The morphological analyzer is implemented in Java and is relatively fast. Japanese tokenization will be activated in BaseX if Igo is found in theclasspath. [httphttps://en.sourceforgeosdn.jpnet/projects/igo/releases/ igo-0.4.3.jar]of Igo is currently included in all distributions of BaseX.
In addition to the library, one of the following dictionary files must either be unzipped into the current directory, or into the <code>etc</code> sub-directory of the project’s [[Configuration#Home Directory|Home Directory]]:
 
* IPA Dictionary: https://files.basex.org/etc/ipadic.zip
* NAIST Dictionary: https://files.basex.org/etc/naistdic.zip
Bureaucrats, editor, reviewer, Administrators
13,550

edits

Navigation menu