2012-09-30, 22:27
Sorry for crossposting this, but after posting it in the scraper thread yesterday I've started to think that perhaps this isn't a scraper issue after all but a tag reader issue:
I'm having a curious problem with umlauts. Certain umlauts are not parsed correctly while others are, and in some cases the same identical umlaut is parsed correctly in one place and not in another.
'Motörhead - March ör Die' is corrupted into 'Motörhead - March �r Die'. Notice how the umlaut 'ö' in the band name is correct, but the same umlaut gets corrupted in the album name. The incorrect strings get stored in the database. I'm using a nightly build and I know there have been some changes to the tag readers in the last few days so maybe this is related to those changes?
I'm having a curious problem with umlauts. Certain umlauts are not parsed correctly while others are, and in some cases the same identical umlaut is parsed correctly in one place and not in another.
Code:
01:09:58 T:4684 DEBUG: ADDON::CScraper::FindAlbum: Searching for 'Motörhead - March �r Die' using Universal Album Scraper scraper (path: 'D:\Static\_HTPC\XBMC_SVN\portable_data\addons\metadata.album.universal', content: 'albums', version: '1.3.3')
01:09:58 T:4684 DEBUG: scraper: CreateAlbumSearchUrl returned <url>http://search.musicbrainz.org/ws/2/release/?fmt=xml&query=release:March%20%f6r%20Die%20AND%20artist:Mot%c3%b6rhead</url>
01:09:58 T:4684 DEBUG: CurlFile::Open(08068978) http://search.musicbrainz.org/ws/2/release/?fmt=xml&query=release:March%20%f6r%20Die%20AND%20artist:Mot%c3%b6rhead
'Motörhead - March ör Die' is corrupted into 'Motörhead - March �r Die'. Notice how the umlaut 'ö' in the band name is correct, but the same umlaut gets corrupted in the album name. The incorrect strings get stored in the database. I'm using a nightly build and I know there have been some changes to the tag readers in the last few days so maybe this is related to those changes?