Search function and Unicode equivalence
Posted: Tue May 13, 2014 12:52 pm
Hello Community,
it seems reasonable to improve the search function wrt. Unicode equivalence, what is traditionally completely neglected.
PHP's intl extension offers grapheme_strpos() which could be used instead of the current strpos(), if available (otherwise we'd had to fall back to strpos() anyway).
We should furthermore consider to use a case-insensitive comparision, i.e. grapheme_stripos(), and otherwise fall back to the current algorithm that uses utf8_strtolower() and strpos(). Especially this step might provide a performance improvement and better results.
Christoph
it seems reasonable to improve the search function wrt. Unicode equivalence, what is traditionally completely neglected.
PHP's intl extension offers grapheme_strpos() which could be used instead of the current strpos(), if available (otherwise we'd had to fall back to strpos() anyway).
We should furthermore consider to use a case-insensitive comparision, i.e. grapheme_stripos(), and otherwise fall back to the current algorithm that uses utf8_strtolower() and strpos(). Especially this step might provide a performance improvement and better results.
Christoph