Login at Kodi Home

pko66 · 2007-10-16, 21:47

I don't know if this has been already suggested (a search for "opensubtitles scraper" here gave me no results), but I think http://www.opensubtitles.org could be a great aid to make a scraper (nothing related with subtitles, although it could be nice to be able to download the subtitles with it).

Opensubtitles has an algorithm to identify a file (it gives a hash for the file) and, based in that hash, users indicate the IMDB ID and then upload a subtitle. Right now, their database contains IDs for 411649 files, which translates into 28561 movies (any TV show episode is referenced to de IMDB ID of the whole TV show, that is one of the reasons for so few movies for so much files). Also, there are many different releases and different (foreign) titles/dubbings for the same movie.

It is of course not exhaustive but could be really valuable as an aid when a file is not correctly identified (because it is not correctly named, or there are some movies with the same title, or maybe it has not the english title).

Also, storing the hash in the database could be great to identify the movie even if it is moved to another path or renamed, since the hash is specific to the file. Also, maybe the hash stored could be used by a script to get the subtitles, without the need to calculate the hash from a samba link in python that I think is problematic...

pko66 · 2007-10-17, 18:45

pko66 Wrote:I don't know if this has been already suggested (a search for "opensubtitles scraper" here gave me no results), but I think http://www.opensubtitles.org could be a great aid to make a scraper (nothing related with subtitles, although it could be nice to be able to download the subtitles with it).

Opensubtitles has an algorithm to identify a file (it gives a hash for the file) and, based in that hash, users indicate the IMDB ID and then upload a subtitle. Right now, their database contains IDs for 411649 files, which translates into 28561 movies (any TV show episode is referenced to de IMDB ID of the whole TV show, that is one of the reasons for so few movies for so much files). Also, there are many different releases and different (foreign) titles/dubbings for the same movie.

It is of course not exhaustive but could be really valuable as an aid when a file is not correctly identified (because it is not correctly named, or there are some movies with the same title, or maybe it has not the english title).

Also, storing the hash in the database could be great to identify the movie even if it is moved to another path or renamed, since the hash is specific to the file. Also, maybe the hash stored could be used by a script to get the subtitles, without the need to calculate the hash from a samba link in python that I think is problematic...

I think this message has been incorrectly moved here by an administrator... I am not speaking about a python script for subtitles but about a scraper to identify movies, and so it would more properly reside in the forum and thread where it was originally posted by me :confused2: the fact it refers to the same website as the script it is here been discussed is merely coincidental.

smokeyfingaz · 2007-12-13, 15:28

hi all, i remember there was discussion of a script to dl subs from opensubtitles.org - has zhis been develloped yet, or has progress been stopped? how hard would it be? is it impossible?

[2ge] · 2007-12-15, 19:09

Hello,

I am checking this thread from time to time, it is pitty, nothing new here Sad

I wish I knew python, so I should help, but I can't. Ok, from my side, I created trac/wiki for opensubtitles, so here you can find some important info:

http://trac.opensubtitles.org/projects/opensubtitles
Look section for developers. Also, in XMLRPC method CheckMovieHash should be used for identifying movie files (getting IMDB ID for video files based on hash).

Is there someone who will code downloading script for XBMC ? I think Scraper.xml is not possible to use (surely not for working with hashes, which is base idea).

jpam · 2007-12-16, 17:06

if you are dutch i've made a subtitle downloader for the xbox long ago
it runs as script on the xbox
you can connect to 2 different sites
http://www.ondertitel.com/
http://www.nlondertitels.com/ (very slow)
you can search via movietitle on your harddisk or search manual

subtitles will be automatic renamed as your movietitles
if you want it i can send you the script

jpam

sheppie · 2008-01-03, 10:50

jpam Wrote:if you are dutch i've made a subtitle downloader for the xbox long ago
it runs as script on the xbox
you can connect to 2 different sites
http://www.ondertitel.com/
http://www.nlondertitels.com/ (very slow)
you can search via movietitle on your harddisk or search manual

subtitles will be automatic renamed as your movietitles
if you want it i can send you the script

jpam

Yes that would be nice..

HarshReality · 2008-01-10, 08:17

Agreed, could/should be easily modified to search for specific lang files (en, esp.). In a perfect world... it would somehow draw the information from the GUI setting of the xbox but that I think would be putting cart before horse...

Gamester17 · 2008-01-10, 12:40

If someone like to volunteer to write this script them may I suggest that you look at the script called "undertexter.se" and then maybe use that as a base to start from. As it is a python script for XBMC that downloads (swedish) subtitles from a swedish website, and those subtitles are compressed as ZIP or RAR so that script might be of good to learn from how to download and extract compressed as ZIP or RAR from a website and then extract and rename them correctly.
http://forum.xbmc.org/showthread.php?tid=23309
http://www.xbmcscripts.com/index.php?opt...me&dir=ASC