[pmwiki-users] Search for terms with ss and ß

Simon nzskiwi at gmail.com
Sat Feb 4 00:44:27 PST 2023


This works well for words with macrons āēīōū

thanks

SImon


On Sat, 4 Feb 2023 at 20:52, Hans Bracker <design at softflow.uk> wrote:

> Hello Petko,
>
> Friday, February 3, 2023, 3:22:00 PM, you wrote:
>
> >    https://www.pmwiki.org/wiki/Cookbook/UnaccentUTF8
>
> > Not sure if it will be enough for you as it also folds to lowercase. But
> you can copy this  function and adapt it. Maybe simply remove ":: Lower();"
> from the argument, or review the documentation for the Intl/Transliterator
> class at php.net.
>
> Thanks, I tried it out, as you put it, and as a customisation for
> TextExtract.
> I think one needs to be very careful, if one wants to use it.
> For German language, and used as it is, it will give many false positives
> in search results.
> Word pairs like Bär and Bar, Blüten and bluten, Fähre and fahre, möchte
> and mochte, are treated as the same, but have total different meanings. So
> I would not recommend this recipe for German language sites. I can imagine
> other languages using UTF8 could have similar problems.
>
> As to my TextExtract search for terms with ss and ß:
> I think it may be better if I offer a customisation, with a custom array
> of substitutes.
> That could then also offer substitutes for accented characters, like used
> in Roman languages, but not substitutes for ä, ö, ü, and others, which
> would lead to too many false positive results.
>
>
> cheers,
> Hans
>
>
> _______________________________________________
> pmwiki-users mailing list
> pmwiki-users at pmichaud.com
> http://www.pmichaud.com/mailman/listinfo/pmwiki-users
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.pmichaud.com/pipermail/pmwiki-users/attachments/20230204/7c39d549/attachment.html>


More information about the pmwiki-users mailing list