2008-10-13, 16:01
http://forum.xbmc.org/showthread.php?tid=38378
Hi spiff!
I've already tried to begin with the last.fm scrapper. But i have some problems understanding how the flows and interaction between the scrapper and xbmc works...
I started modifying your allmusic scrapper. Just to have something to begin with..
I would like to fully understand i the flow.
First, the scrapper create the albumsearchurl, i maganaged to get that working... after a working url has been created xbmc makes the request to it and then the regexp to parse the resuts. What follows is what i don't fully understand, once i got the basic information, album title, and url.. how the request for the album information url is done. I never get a list of albums or anything.
This is mi getalbumsearchresult :
<GetAlbumSearchResults dest="8">
<RegExp input="$$5" output="<results>\1</results>" dest="8">
<RegExp input="$$1" output="<entity><year>2000</year><genre>test</genre><title>\2</title><url>http://www.last.fm/music\1</url></entity>" dest="5">
<expression repeat="yes"><a href="(.*)">(.*)</a> <span</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetAlbumSearchResults>
I don't know if all fields (year,genre,title,etc) need to exist in the result of the first request, i don't have them available at first, but i would from the url fetched in the regexp...
Perhaps there is something wrong with the regexp, ive tried m any simple ones, with no result, is there any way to force a valid result?. So i can get to the next step in the scrapper?.
Well, i don't really know if i'm actually making any sense here.. but thank you very much in advance for your help.
Pyro-X
spiff Wrote:i have considered it but i do not feel comfortable scraping a site that provides open api's. that being said, anyone else is ofc free to do it
Hi spiff!
I've already tried to begin with the last.fm scrapper. But i have some problems understanding how the flows and interaction between the scrapper and xbmc works...
I started modifying your allmusic scrapper. Just to have something to begin with..
I would like to fully understand i the flow.
First, the scrapper create the albumsearchurl, i maganaged to get that working... after a working url has been created xbmc makes the request to it and then the regexp to parse the resuts. What follows is what i don't fully understand, once i got the basic information, album title, and url.. how the request for the album information url is done. I never get a list of albums or anything.
This is mi getalbumsearchresult :
<GetAlbumSearchResults dest="8">
<RegExp input="$$5" output="<results>\1</results>" dest="8">
<RegExp input="$$1" output="<entity><year>2000</year><genre>test</genre><title>\2</title><url>http://www.last.fm/music\1</url></entity>" dest="5">
<expression repeat="yes"><a href="(.*)">(.*)</a> <span</expression>
</RegExp>
<expression noclean="1"></expression>
</RegExp>
</GetAlbumSearchResults>
I don't know if all fields (year,genre,title,etc) need to exist in the result of the first request, i don't have them available at first, but i would from the url fetched in the regexp...
Perhaps there is something wrong with the regexp, ive tried m any simple ones, with no result, is there any way to force a valid result?. So i can get to the next step in the scrapper?.
Well, i don't really know if i'm actually making any sense here.. but thank you very much in advance for your help.
Pyro-X