2013-02-10, 22:16
Hi Olympia.
I have noticed that contrary to the (now broken) IMDB scraper, the Universal scraper does not bring back matches of the following types:
Note, for the type, it's important to look at the 'suffix' on the http://akas.imdb.com/find?q=... search page, in order to know what string gets matched in the regex. It is sometimes different from what you see on the detail page for the item...
In order to fix the issue, I updated the regular expression on line 55 in universal.xml as follows:
Any chance this gets updated in a future version?
I have noticed that contrary to the (now broken) IMDB scraper, the Universal scraper does not bring back matches of the following types:
- "TV Mini-Series" e.g. http://akas.imdb.com/title/tt0074006/
- "TV Series" e.g. http://akas.imdb.com/title/tt0826760
- "TV Short" e.g. http://akas.imdb.com/title/tt0897387
Note, for the type, it's important to look at the 'suffix' on the http://akas.imdb.com/find?q=... search page, in order to know what string gets matched in the regex. It is sometimes different from what you see on the detail page for the item...
In order to fix the issue, I updated the regular expression on line 55 in universal.xml as follows:
Code:
<expression repeat="yes" noclean="1,2"><td\sclass="result_text">\s<a\shref="/title/([t0-9]*)/[^>]*>(?:&#x22;)?([^<]*?)(?:&#x22;)?</a>\s*(?:\([IV]+\) )?\([^\(]*?([0-9]{4})[^\)]*\)\s(?:\(TV Movie\)\s|\(TV Special\)\s|\(TV Series\)\s|\(TV Mini-Series\)\s|\(TV Short\)\s|\(TV Episode\)\s|\(Video\)\s|\(Short\)\s)?<</expression>
Any chance this gets updated in a future version?