Req Wikipedia as source
#1
Hi, I'm a bit confused why nobody scrapes wikipedia? IMHO it's a pretty good choice:
  • API support
  • lightweight Wikitext or wikidata structures
  • great coverage of artists, musicians, CDs, albums, movies, photos, ...
  • CC licensed content -> legal offline support/dumps

So what are the reasons that it's currently skipped?
Reply
#2
Wikipedia doesn't have standards for movies and music, so you are not guaranteed to get the correct fields via the API

http://www.onemusicapi.com/blog/2014/09/...m-artwork/

http://stackoverflow.com/questions/55838...c-category

http://stackoverflow.com/questions/45649...-wikipedia

There are many much better alternatives for music and movies scraping

http://en.wikipedia.org/wiki/List_of_onl..._databases

http://en.wikipedia.org/wiki/List_of_online_databases
Reply
#3
I second a wikipedia scraper, PLEX does this for me but i can't find how to export the data to XBMC it's a feature i miss
Yeah, Me, Myself, and I, The Three Musketeers
Image
Reply
#4
I need this!

I have shows from the 50s and foreign countries that are not in TheTVDB or TVRage, but all are in Wikipedia. I've been working around this using local nfo files but this is very tedious and reinventing the wheel since all the data is in the episode lists on Wikipedia. I need a scraper I can set for a specific show that will only "scrape" once (no need to do it on a periodic basis) to load all the info and get Kodi to accept it as a real show. Plex does this perfectly.
Reply

Logout Mark Read Team Forum Stats Members Help
Wikipedia as source0