Accent insensitive search query in MySQL

You can change the collation at runtime in the sql query,

...where title like '%torun%' collate utf8_unicode_ci 

but beware that changing the collation on the fly at runtime forgoes the possibility of mysql using an index, so performance on large tables may be terrible.

Or, you can copy the column to another column, such as searchable_title, but change the collation on it. It’s actually common to do this type of stuff, where you copy data but have it in some slightly different form that’s optimized for some specific workload/purpose. You can use triggers as a nice way to keep the duplicated columns in sync. This method has the potential to perform well, if indexed.

Note – Make sure that your db really has those characters and not html entities.
Also, the character set of your connection matters. The above assumes it’s set to utf8, for example, via set names like set names utf8

If not, you need an introducer for the literal value

...where title like _utf8'%torun%' collate utf8_unicode_ci 

and of course, the value in the single quotes must actually be utf8 encoded, even if the rest of the sql query isn’t.

Leave a Comment