mahara-contributors team mailing list archive
-
mahara-contributors team
-
Mailing list archive
-
Message #11907
[Bug 1072972] Re: Internal search ignores 'KATAKANA-HIRAGANA PROLONGED SOUND MARK'
** Changed in: mahara
Milestone: 1.7.0 => 1.8.0
** Also affects: mahara/1.7
Importance: Undecided
Status: New
** Changed in: mahara/1.7
Milestone: None => 1.7.1
--
You received this bug notification because you are a member of Mahara
Contributors, which is subscribed to Mahara.
Matching subscriptions: Subscription for all Mahara Contrib members
https://bugs.launchpad.net/bugs/1072972
Title:
Internal search ignores 'KATAKANA-HIRAGANA PROLONGED SOUND MARK'
Status in Mahara ePortfolio:
In Progress
Status in Mahara 1.7 series:
New
Bug description:
Mahara's (1.5.6) internal search cannot handle Japanese character
'KATAKANA-HIRAGANA PROLONGED SOUND MARK'. This character 'ー' is
frequently used. For example 'データ (data)', 'サーバー (server)' or
'ポートフォリオ (portfolio)'.
The cause of problem is line 1102 in search/internal/lib.php.
1102: $text = preg_replace('/['. PREG_CLASS_SEARCH_EXCLUDE .
']+/u', ' ', $text);
In this line, mahara replaces special characters specified by
PREG_CLASS_SEARCH_EXCLUDE with ' '. And 'KATAKANA-HIRAGANA PROLONGED
SOUND MARK' is included in PREG_CLASS_SEARCH_EXCLUDE.
The solution of this problem is very simple. Just remove 'KATAKANA-
HIRAGANA PROLONGED SOUND MARK' (code 0x30fc) from
PREG_CLASS_SEARCH_EXCLUDE. We can find the definition on line
1198-1225.
1221:
'\x{3099}-\x{309e}\x{30a0}\x{30fb}-\x{30fe}\x{3190}-\x{319f}\x{31c0}-\x{31cf}'.
should be replaced with
1221:
'\x{3099}-\x{309e}\x{30a0}\x{30fb}\x{30fd}\x{30fe}\x{3190}-\x{319f}\x{31c0}-\x{31cf}'.
P.S. The definition of PREG_CLASS_SEARCH_EXCLUDE is originally from
Drupal, and this fix was already applied.
http://api.drupal.org/api/drupal/modules!search!search.module/constant/PREG_CLASS_SEARCH_EXCLUDE/6
To manage notifications about this bug go to:
https://bugs.launchpad.net/mahara/+bug/1072972/+subscriptions
References