Rotten Tomatoes Scraper
#1
Hello All,

I'm working on creating a Rotten Tomatoes scraper and I've hit a small hurdle. RT redirects based on your IP address, so I only have access to the Australian site. I'm not sure if there's a proxy I could use to get around it somehow, but I thought I'd ask for some help from some of you US and UK folks.

There will be a setting in the scraper to change which rating (MPAA, OFLC or BBFC) is retrieved, so pretty much I need some localized samples of HTML from the RT site. The section of code I am after is

Code:
<div id="movie_stats">
    <div class="fl">
      <p>
        <span class="label">Australian Rating:</span>
        <span class="content">M <a style="font-weight: normal;" class="movie_rating_reason" id="movie_rating_work" href="javascript:void(0);">[See Full Rating]</a>
            <span class="movie_rating_reason" style="display: none"> Frequent action violence</span>        </span>

      </p><p><span class="label">Runtime:</span> <span class="content">2 hrs 33 mins</span></p><p><span class="label">Genre:</span> <span class="content"><a href="/movie/browser.php?genre=200001">Action/Adventure</a></span></p>    </div>
    <div class="fl">
      <p><span class="label">Australian Theatrical Release:</span><br /><span class="content">Jul 16, 2008 Wide</span></p>                  <p><span class="label">US Box Office:</span> <span class="content"><a href="/m/the_dark_knight/numbers.php">$533,316,061</a></span></p>    </div>

  </div>

For those wanting a progress report I have Synopsis, Running Time, Year, Director, Actors, Rating (RT Score in %) and Votes all scraping successfully. I also have Australian Ratings and Rating Reason working.

Any help with the HTML samples is appreciated. Thanks.
Reply


Messages In This Thread
Rotten Tomatoes Scraper - by seedzero - 2009-08-05, 13:11
[No subject] - by seedzero - 2009-08-06, 04:24
[No subject] - by seedzero - 2009-08-06, 12:58
[No subject] - by seedzero - 2009-08-12, 15:54
[No subject] - by spiff - 2009-08-12, 16:07
[No subject] - by seedzero - 2009-08-13, 00:19
[No subject] - by spiff - 2009-08-13, 10:59
[No subject] - by seedzero - 2009-08-13, 15:44
[No subject] - by seedzero - 2009-08-15, 14:44
[No subject] - by blacklist - 2009-08-15, 16:46
[No subject] - by seedzero - 2009-08-16, 01:47
[No subject] - by rausch101 - 2009-10-21, 05:52
[No subject] - by seedzero - 2009-10-21, 06:00
[No subject] - by rausch101 - 2009-10-21, 06:05
[No subject] - by phonics - 2009-10-21, 07:01
[No subject] - by muzzakus - 2009-10-28, 04:58
[No subject] - by seedzero - 2009-10-29, 01:19
[No subject] - by shadylog - 2009-12-17, 17:23
[No subject] - by mkortstiege - 2009-12-17, 18:51
[No subject] - by seedzero - 2009-12-17, 22:48
[No subject] - by shadylog - 2010-01-09, 19:14
[No subject] - by mkortstiege - 2010-01-09, 19:34
[No subject] - by teddy6565 - 2010-04-17, 12:04
[No subject] - by teddy6565 - 2010-04-18, 02:44
[No subject] - by seedzero - 2010-10-19, 12:06
[No subject] - by seedzero - 2010-10-25, 01:33
[No subject] - by rausch101 - 2010-10-25, 04:28
[No subject] - by kneufeld - 2010-10-25, 04:47
[No subject] - by gabbott - 2010-10-25, 04:59
[No subject] - by olympia - 2010-10-25, 06:47
[No subject] - by seedzero - 2010-10-25, 08:38
[No subject] - by olympia - 2010-10-25, 10:50
[No subject] - by booker88 - 2010-11-26, 04:31
[No subject] - by seedzero - 2010-11-26, 08:13
[No subject] - by booker88 - 2010-11-27, 09:41
[No subject] - by mortstar - 2011-03-31, 18:26
[No subject] - by Flicker - 2011-04-15, 10:59
[No subject] - by NotShorty - 2011-04-19, 01:41
[No subject] - by seedzero - 2011-04-19, 04:07
[No subject] - by sourbob - 2011-06-20, 22:07
Logout Mark Read Team Forum Stats Members Help
Rotten Tomatoes Scraper0