2010-12-06, 20:40
Hi everyone,
I have a problem concerning the encoding of the web site my scraper parses:
This is the meta tag of the web site:
[HTML]<meta http-equiv="content-type" content="text/html; charset=iso-8859-15" />
[/HTML]
There are some umlauts hwich are not read correctly: Those which are escaped by using html entities like auml; work perfectly. Unfortunatle there ae some characters which have not been correctly escaped by the website, so there is a ä (Ascii hex E4) directly in the source code.
This character is not read correctly ifmy result xml is utf-8 encoded. If I return a iso-8859-15 encoded docuement the ä character is isplayed correctly, but the html entities are broken.
Is there a way to convert the encoding or can it be done by xbmc automatically? Any other ideas how to solve this?
Kind regards
Larry_Lobster
I have a problem concerning the encoding of the web site my scraper parses:
This is the meta tag of the web site:
[HTML]<meta http-equiv="content-type" content="text/html; charset=iso-8859-15" />
[/HTML]
There are some umlauts hwich are not read correctly: Those which are escaped by using html entities like auml; work perfectly. Unfortunatle there ae some characters which have not been correctly escaped by the website, so there is a ä (Ascii hex E4) directly in the source code.
This character is not read correctly ifmy result xml is utf-8 encoded. If I return a iso-8859-15 encoded docuement the ä character is isplayed correctly, but the html entities are broken.
Is there a way to convert the encoding or can it be done by xbmc automatically? Any other ideas how to solve this?
Kind regards
Larry_Lobster