Failure to scrape movies (IMDB scraper)
#1
Hi,

I'm having issues scraping a number of movies. Most of the movies scrape correctly but there are a few that don't scrape. I have tried removing from the library movies that have scraped correctly and ones that don't, and after refreshing the library, I get the same results.

A few examples of movies that don't scrape correctly:
The Fish child - http://www.imdb.com/title/tt1235842/
See What I'm Saying The Deaf Entertainers Documentary (2010) - http://www.imdb.com/title/tt1125387/
Journal of a Contract Killer (2008) - http://www.imdb.com/title/tt1043695/

Cheers.

Code:
00:15:49 T:716 M:1002770432   DEBUG: VideoInfoScanner: No NFO file found. Using title search for 'smb://coconut/Volume_1/Video/Movies/Whatever Works (2009).avi'
00:15:49 T:716 M:1002725376   DEBUG: CIMDB::InternalFindMovie: Searching for 'whatever works' using IMDb scraper (path: 'C:\Users\svarela\AppData\Roaming\XBMC\addons\metadata.imdb.com', content: 'movies', version: '2.1.4')
00:15:49 T:716 M:1002725376   DEBUG: scraper: CreateSearchUrl returned <url>http://akas.imdb.com/find?s=tt;q=whatever%20works%20(2009)</url>
00:15:49 T:716 M:1002725376   DEBUG: FileCurl::Open(0610F16C) http://akas.imdb.com/find?s=tt;q=whatever%20works%20(2009)
00:15:50 T:2892 M:1002717184   DEBUG: SECTION:UnloadDelayed(DLL: special://xbmcbin/system/ImageLib.dll)
00:15:50 T:2892 M:1002717184   DEBUG: Win32DllLoader::Unload ImageLib.dll
00:15:53 T:716 M:1003180032   DEBUG: FileCurl::Close(0610F16C) http://akas.imdb.com/title/tt1178663/?fr=c2M9MXxsbT01MDB8ZmI9dXx0dD0xfG14PTIwfGh0bWw9MXxjaD0wfGNvPTB8cG49MHxmdD0wfGt3PTB8cXM9d2hhdGV2ZXIgd29ya3MgKDIwMDkpfHNpdGU9YWthfHE9d2hhdGV2ZXIgd29ya3N8bm09MA__;fc=1;ft=20
00:15:53 T:716 M:1003180032   DEBUG: scraper: GetSearchResults returned <?xml version="1.0" encoding="iso-8859-1" standalone="yes"?><results><entity><title>Whatever Works</title><year>2009</year><url cache="-main.html">http://akas.imdb.com/title//</url><id></id></entity></results>
00:15:53 T:716 M:1003134976   DEBUG: FileCurl::Open(0610F168) http://akas.imdb.com/title//
00:15:53 T:716 M:1003102208 WARNING: XFILE::CFileCurl::CReadState::FillBuffer: curl failed with code 22
00:15:53 T:716 M:1003102208   ERROR: CFileCurl::CReadState::Open, didn't get any data from stream.
00:15:53 T:716 M:1003102208   DEBUG: FileCurl::Close(0610F168) http://akas.imdb.com/title//
00:15:53 T:716 M:1003102208   ERROR: ADDON::CScraper::Run: Unable to parse web site
00:15:53 T:716 M:1003102208   DEBUG: VideoInfoScanner: Adding new item to movies:smb://coconut/Volume_1/Video/Movies/Whatever Works (2009).avi
00:15:53 T:716 M:1003102208   DEBUG: CVideoDatabase::GetMovieId (smb://coconut/Volume_1/Video/Movies/Whatever Works (2009).avi), query = select idMovie from movie where idFile=87
00:15:53 T:716 M:1004269568   DEBUG: Previous line repeats 1 times.
00:15:53 T:716 M:1004269568   DEBUG: CVideoDatabase::GetMovieId (smb://coconut/Volume_1/Video/Movies/Winter's Bone (2010).mkv), query = select idMovie from movie where idFile=867
00:15:54 T:716 M:1004269568   DEBUG: CVideoDatabase::GetMovieId (smb://coconut/Volume_1/Video/Movies/Yes Man (2008).avi), query = select idMovie from movie where idFile=611
00:15:54 T:716 M:1004269568   DEBUG: CVideoDatabase::GetMovieId (smb://coconut/Volume_1/Video/Movies/You Don't Know Jack (2010).avi), query = select idMovie from movie where idFile=88
00:15:54 T:716 M:1004580864   DEBUG: VideoInfoScanner: Finished adding information from dir smb://coconut/Volume_1/Video/Movies/
Reply
#2
Since I had the same issue with file names containing "?", can you perhaps try by removing the parenthesis around the year?
Reply
#3
I had that problem with a couple movies. Some were known on IMDB by alterante titles, either in English, or in another language.

The Fish Child = El niño pez
Journal of.... = Egy bérgyilkos naplója

For that doc, maybe truncated the titel, liek "See What I'm Saying" or soemthing.
Reply
#4
Star 
Skulking around the forums I picked up this useful tidbit...scraper set for IMDB

In the information scraper... pick manual and use the IMDB# "tt1043695" without the quotes. it's exact and grabs precisely that one.. at least in Dharma 10.1

So if the IMDB has it.. you can get it.
Reply

Logout Mark Read Team Forum Stats Members Help
Failure to scrape movies (IMDB scraper)0