Search should not differ between UTF8 and ASCII

Any ideas about how to improve MediaMonkey? Let us know!

Moderator: Gurus

MitchMitchel23
Posts: 12
Joined: Sun Dec 09, 2007 4:24 pm
Contact:

Search should not differ between UTF8 and ASCII

Post by MitchMitchel23 »

Hi!

I'm tagging my files with MusicBrainz Picard. Musicbrainz alows UTF8 and ASCII characters.

The search funktion in MM differs between these characters. If I search for "It’s Raining Men" (with U+2019) MM should find both versions "It’s Raining Men" and "It's Raining Men" not only the ones with "’".

Or missed I an option?

Here are some examples:

Code: Select all

‘…’ (U+2018 'LEFT SINGLE QUOTATION MARK' / U+2019 'RIGHT SINGLE QUOTATION MARK')
“…” (U+201C 'LEFT DOUBLE QUOTATION MARK' / U+201D 'RIGHT DOUBLE QUOTATION MARK')
’ Apostrophe (U+2019 'RIGHT SINGLE QUOTATION MARK') ([4])
‘ left single quotation mark 
… 	Horizontal ellipsis 
- (U+0012 'HYPHEN-MINUS') is the multi-purpose character with its own key on the keyboard ([6]).
− (U+2212 'MINUS SIGN') is used in equations or for negative numbers ([8]).
‒ (U+2012 'FIGURE DASH') is the same width as a digit and thus mostly used within a number ([9]).
– (U+2013 'EN DASH') is used for ranges, relationships, and connections ([10]).
— (U+2014 'EM DASH') can be used instead of colons, parentheses or to show an aprupt change of thought ([11]).
· Mittelpunkt 	
German
„…“ (U+201E 'DOUBLE LOW-9 QUOTATION MARK' / U+201C 'LEFT DOUBLE QUOTATION MARK')
‚…‘ (U+201A 'SINGLE LOW-9 QUOTATION MARK' / U+2018 'LEFT SINGLE QUOTATION MARK')
»…« (U+00BB 'RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK' / U+00AB 'LEFT-POINTING DOUBLE ANGLE QUOTATION MARK')
›…‹ (U+203A 'SINGLE RIGHT-POINTING ANGLE QUOTATION MARK' / U+2039 'SINGLE LEFT-POINTING ANGLE QUOTATION MARK')
′ 	Minuten, Fuß
″ 	Zoll, Sekunden
Thanks!
deremder
Posts: 33
Joined: Thu Sep 06, 2012 1:28 pm
Contact:

Re: Search should not differ between UTF8 and ASCII

Post by deremder »

It's not the difference of the charset, it's a different char. That requires a char map or to ignore some none-word characters. In other cases, MM has this mapping. For example "motor" finds "Motörhead".
Post Reply