Changes

Full-Text (edit)

Revision as of 22:26, 25 May 2016

55 bytes added , 22:26, 25 May 2016

</pre>

By default, unless the languages codes "<code>ja"</code>, "<code>ar"</code>, "<code>ko"</code>, "<code>th"</code>, or "<code>zh" </code> are specified, a tokenizer for Western texts will be used to tokenize texts:

* Whitespaces are interpreted as token delimiters.

CG

Bureaucrats, editor, reviewer, Administrators

13,550

edits

Changes

Full-Text (edit)

Revision as of 22:26, 25 May 2016

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools