zorba-coders team mailing list archive
-
zorba-coders team
-
Mailing list archive
-
Message #09565
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
Paul J. Lucas has proposed merging lp:~zorba-coders/zorba/feature-ft_module into lp:zorba.
Requested reviews:
Paul J. Lucas (paul-lucas)
Related bugs:
Bug #944795 in Zorba: "XQDoc doesn't handle & in URLs"
https://bugs.launchpad.net/zorba/+bug/944795
For more details, see:
https://code.launchpad.net/~zorba-coders/zorba/feature-ft_module/+merge/105421
Documentation tweaks.
--
https://code.launchpad.net/~zorba-coders/zorba/feature-ft_module/+merge/105421
Your team Zorba Coders is subscribed to branch lp:zorba.
=== modified file 'doc/zorba/ft_tokenizer.dox'
--- doc/zorba/ft_tokenizer.dox 2012-05-03 12:31:51 +0000
+++ doc/zorba/ft_tokenizer.dox 2012-05-10 23:53:19 +0000
@@ -10,9 +10,12 @@
\section ft_tokenizer_tokization Tokenization
-Using the
-<a href="http://site.icu-project.org/">ICU library</a>,
-Zorba's implementation of tokenization
+By default,
+Zorba uses the
+<a href="http://site.icu-project.org/">ICU library</a>
+for tokenization.
+For Roman alphabets,
+Zorba (ICU)
considers only alpha-numeric sequences of characters to be part of a token;
whitespace and punctuation characters are not
and separate tokens.
@@ -117,7 +120,9 @@
<tr>
<td>\c lang</td>
<td>
- The language of the string.
+ The
+ <a href="http://www.w3.org/TR/xmlschema-2/#language">language</a>
+ of the string.
</td>
</tr>
<tr>
@@ -132,7 +137,7 @@
<tr>
<td>\c callback</td>
<td>
- The Callback to call once per token.
+ The \c Callback to call once per token.
</td>
</tr>
<tr>
@@ -232,7 +237,9 @@
<tr>
<td>\c languages</td>
<td>
- The list of languages supported by the tokenizer.
+ The list of
+ <a href="http://www.w3.org/TR/xmlschema-2/#language">languages</a>
+ supported by the tokenizer.
</td>
</tr>
<tr>
@@ -247,7 +254,9 @@
In addition to a \c Tokenizer,
you must also implement a \c TokenizerProvider
-that, given a language, provides a \c Tokenizer for that language:
+that,
+given a <a href="http://www.w3.org/TR/xmlschema-2/#language">language</a>,
+provides a \c Tokenizer for that language:
\code
class TokenizerProvider {
@@ -262,7 +271,11 @@
<table>
<tr>
<td>\c lang</td>
- <td>The language to tokenize.</td>
+ <td>
+ The
+ <a href="http://www.w3.org/TR/xmlschema-2/#language">language</a>
+ to tokenize.
+ </td>
</tr>
<tr>
<td>\c num</td>
Follow ups
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: noreply, 2012-05-11
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Zorba Build Bot, 2012-05-11
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Zorba Build Bot, 2012-05-11
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Paul J. Lucas, 2012-05-11
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Zorba Build Bot, 2012-05-11
-
Re: [Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Zorba Build Bot, 2012-05-11
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Zorba Build Bot, 2012-05-11
-
[Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Matthias Brantner, 2012-05-11
-
Re: [Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Matthias Brantner, 2012-05-11
-
Re: [Merge] lp:~zorba-coders/zorba/feature-ft_module into lp:zorba
From: Paul J. Lucas, 2012-05-10