German IMDB scraper, please test it and give feedback
#9
Eisbahn Wrote:@vdrfan:
Hmmm, sorry. Do we have a spec showing which tags are mandantory/optional? If not: how can I figure out which tags are supported? The IMDB com scraper fetches no infos about sound, subtitle, video-format (if I looked right), in several screenshots I could see infos about these things... So the answer: please do reverse engineering because everybody can implement tags however he/she likes is a bit contra productive and shows kind of quick-and-dirty-hacking without any concept? Is this the XBMC style?
What about:
Code:
<details>
    <title></title>
    <year></year>
    <director></director>
    <top250></top250>
    <mpaa></mpaa>
    <tagline></tagline>
    <runtime></runtime>
    <thumb></thumb>
    <credits></credits>
    <rating></rating>
    <votes></votes>
    <genre></genre>
    <actor>
        <name></name>
        <role></role>
    </actor>
    <outline></outline>
    <plot></plot>
</details>

@donabi: to cut some infos away is not a real problem and done in few seconds. But gathering all possible things is a bit more complicated. So first I would have a scraper which gets all infos.
If you have a decription of the alowed tags, please provide it. Is the order/sequence relevant, what tags are supported, what format is expected and so on. If the german board has active members, why not. But to be honest: think after the scraper my active work is over :=(

@all: Where can I get infos which tags are supported by XBMC? If the skins shows the infos doesn't matter at all, think a "good scrapper" should gather as much as possible. For the result of a scraper: is the order/sequence relevant, what tags are supported, what format is expected and so on. Today all I've done is reverse engineering, but I think thats not the right way...

Eisbahn

All tags are optional , but i would say its best that the TITLE is at least supplied

Code:
<details>
    <title>single instance/Required</title>
    <id>single instance/optional</id>
    <studio>single instance/optional</studio>
    <year>single instance/optional</year>
    <director>multiple instance/optional</director>
    <top250>single instance/optional</top250>
    <mpaa>single instance/optional</mpaa>
    <tagline>single instance/optional</tagline>
    <runtime>single instance/optional</runtime>
    <thumb>multiple instance/optional</thumb>
    <credits></credits>
    <rating>single instance/optional</rating>
    <votes>single instance/optional</votes>
    <genre>multiple instance/optional</genre>
    <actor>
        <name></name>
        <thumb></thumb>
        <role></role>
    </actor>
    <outline>single instance/optional</outline>
    <plot>single instance/optional</plot>
    <premiered>single instance/optional</premiered>
    <set>multiple instance/optional</set>
    <trailer>multiple instance/optional</trailer>
    <streamdetails>
       <audio/>
          <codec></codec>
          <channels></channels>
       </audio>
       <video>
           <codec></codec>
           <height></height>
           <width></width>
      </video>
      <subtitle>
         <language></language>
      </subtitle>
   </streamdetails>
</details>

of course it goes without saying that actor, audio (inside stream info), video(inside stream info) and subtitle(inside stream info) are multiple instance and optional
ScraperXML Open Source Web Scraper Library compatible with XBMC XML Scrapers


I Suck, and if you act now by sending only $19.95 and a self addressed stamped envelop, so can you!

Image
Reply


Messages In This Thread
[No subject] - by Spaggi - 2010-06-05, 00:35
[No subject] - by Eisbahn - 2010-06-05, 09:27
[No subject] - by mkortstiege - 2010-06-05, 10:55
[No subject] - by donabi - 2010-06-05, 10:56
[No subject] - by Eisbahn - 2010-06-05, 18:20
[No subject] - by olympia - 2010-06-05, 21:31
[No subject] - by Eisbahn - 2010-06-05, 23:04
[No subject] - by Nicezia - 2010-06-05, 23:15
[No subject] - by olympia - 2010-06-06, 08:57
[No subject] - by Nicezia - 2010-06-06, 09:11
[No subject] - by Nicezia - 2010-06-06, 09:24
[No subject] - by mkortstiege - 2010-06-06, 11:39
[No subject] - by Nicezia - 2010-06-06, 12:01
[No subject] - by Eisbahn - 2010-06-06, 23:59
[No subject] - by Nicezia - 2010-06-07, 00:44
[No subject] - by spiff - 2010-06-07, 10:25
[No subject] - by Eisbahn - 2010-06-07, 22:38
[No subject] - by Nicezia - 2010-06-10, 21:42
[No subject] - by Eisbahn - 2010-06-12, 12:24
[No subject] - by Eisbahn - 2010-06-13, 18:46
[No subject] - by Eisbahn - 2010-06-18, 17:40
[No subject] - by Eisbahn - 2010-06-18, 20:19
[No subject] - by xsidx - 2010-06-21, 13:34
[No subject] - by Eisbahn - 2010-07-11, 22:58
[No subject] - by krolli - 2010-07-12, 09:48
[No subject] - by mkortstiege - 2010-07-12, 10:19
[No subject] - by Eisbahn - 2010-07-12, 11:34
[No subject] - by mkortstiege - 2010-07-12, 11:44
[No subject] - by Eisbahn - 2010-07-12, 12:08
[No subject] - by mkortstiege - 2010-07-12, 12:25
[No subject] - by olympia - 2010-07-12, 13:57
[No subject] - by Eisbahn - 2010-07-12, 14:23
[No subject] - by Eisbahn - 2010-07-12, 15:28
[No subject] - by theuni - 2010-07-12, 15:38
[No subject] - by Eisbahn - 2010-07-12, 15:53
[No subject] - by olympia - 2010-07-12, 18:11
[No subject] - by Eisbahn - 2010-07-12, 19:30
[No subject] - by Gambler - 2010-07-16, 17:14
[No subject] - by Eisbahn - 2010-07-18, 10:36
[No subject] - by Eisbahn - 2010-07-24, 13:48
[No subject] - by Eisbahn - 2010-08-07, 15:58
[No subject] - by llwmuerte - 2010-08-08, 12:59
[No subject] - by sportsman - 2010-08-24, 11:17
[No subject] - by BurningSky - 2010-08-24, 15:54
[No subject] - by sportsman - 2010-08-24, 16:28
[No subject] - by sportsman - 2010-08-25, 00:13
[No subject] - by BurningSky - 2010-08-25, 07:19
[No subject] - by sportsman - 2010-08-25, 08:10
[No subject] - by schmchris - 2010-08-26, 18:37
[No subject] - by phil65 - 2010-08-31, 06:37
[No subject] - by jackad - 2010-09-02, 17:33
[No subject] - by Squizzy - 2010-09-04, 16:22
[No subject] - by BurningSky - 2010-09-04, 23:02
[No subject] - by Eisbahn - 2010-09-05, 09:21
[No subject] - by Squizzy - 2010-09-05, 13:28
[No subject] - by Eisbahn - 2010-09-05, 19:48
[No subject] - by Squizzy - 2010-09-06, 17:28
[No subject] - by Eisbahn - 2010-09-19, 17:00
[No subject] - by phil65 - 2010-09-28, 06:41
[No subject] - by Eisbahn - 2010-09-28, 23:11
Bug with " in the title - by Hoschie - 2010-10-05, 12:50
[No subject] - by Eisbahn - 2010-10-07, 22:38
[No subject] - by Squizzy - 2010-10-10, 17:39
[No subject] - by tjost - 2010-10-28, 13:54
[No subject] - by gorthaur - 2010-12-28, 16:46
[No subject] - by BurningSky - 2010-12-28, 17:04
The <set> tag - by XBMC-Roger - 2011-01-16, 15:03
[No subject] - by timmi1000 - 2011-01-17, 15:38
[No subject] - by linuxluemmel - 2011-01-26, 11:46
[No subject] - by linuxluemmel - 2011-01-30, 13:48
[No subject] - by segroove - 2011-03-17, 22:19
[No subject] - by Krauti - 2011-03-23, 15:41
[No subject] - by mbosner - 2011-03-24, 13:34
[No subject] - by Krauti - 2011-03-24, 14:50
[No subject] - by nicx76 - 2011-04-14, 15:17
[No subject] - by nicx76 - 2011-04-14, 15:29
[No subject] - by hawi1981 - 2011-04-19, 12:11
modded 3.0.5.1a - by trackel - 2011-04-22, 20:01
[No subject] - by linuxluemmel - 2011-06-20, 20:50
[No subject] - by airmax - 2011-06-26, 14:49
[No subject] - by apoapo - 2011-07-11, 07:43
[No subject] - by mkortstiege - 2011-07-15, 13:19
[No subject] - by daniello - 2011-07-31, 11:34
[No subject] - by trackel - 2011-07-31, 17:39
[No subject] - by daniello - 2011-07-31, 18:02
[No subject] - by trackel - 2011-08-01, 14:40
[No subject] - by trackel - 2011-08-04, 15:04
[release] 3.1.0 - by trackel - 2011-08-06, 15:15
[No subject] - by trackel - 2011-08-07, 13:15
[No subject] - by otcho - 2011-08-21, 15:11
[No subject] - by trackel - 2011-08-21, 16:19
[No subject] - by otcho - 2011-08-21, 16:29
[No subject] - by devkid - 2011-08-25, 12:59
[No subject] - by Anira - 2011-08-28, 16:17
[No subject] - by trackel - 2011-09-02, 17:16
[No subject] - by daniello - 2011-09-26, 17:45
imdb.de with eden beta - by daniello - 2011-12-30, 18:23
Bug with years - by cooper2k4 - 2012-02-17, 20:13
Logout Mark Read Team Forum Stats Members Help
German IMDB scraper, please test it and give feedback1