2009-08-05, 13:11
Hello All,
I'm working on creating a Rotten Tomatoes scraper and I've hit a small hurdle. RT redirects based on your IP address, so I only have access to the Australian site. I'm not sure if there's a proxy I could use to get around it somehow, but I thought I'd ask for some help from some of you US and UK folks.
There will be a setting in the scraper to change which rating (MPAA, OFLC or BBFC) is retrieved, so pretty much I need some localized samples of HTML from the RT site. The section of code I am after is
For those wanting a progress report I have Synopsis, Running Time, Year, Director, Actors, Rating (RT Score in %) and Votes all scraping successfully. I also have Australian Ratings and Rating Reason working.
Any help with the HTML samples is appreciated. Thanks.
I'm working on creating a Rotten Tomatoes scraper and I've hit a small hurdle. RT redirects based on your IP address, so I only have access to the Australian site. I'm not sure if there's a proxy I could use to get around it somehow, but I thought I'd ask for some help from some of you US and UK folks.
There will be a setting in the scraper to change which rating (MPAA, OFLC or BBFC) is retrieved, so pretty much I need some localized samples of HTML from the RT site. The section of code I am after is
Code:
<div id="movie_stats">
<div class="fl">
<p>
<span class="label">Australian Rating:</span>
<span class="content">M <a style="font-weight: normal;" class="movie_rating_reason" id="movie_rating_work" href="javascript:void(0);">[See Full Rating]</a>
<span class="movie_rating_reason" style="display: none"> Frequent action violence</span> </span>
</p><p><span class="label">Runtime:</span> <span class="content">2 hrs 33 mins</span></p><p><span class="label">Genre:</span> <span class="content"><a href="/movie/browser.php?genre=200001">Action/Adventure</a></span></p> </div>
<div class="fl">
<p><span class="label">Australian Theatrical Release:</span><br /><span class="content">Jul 16, 2008 Wide</span></p> <p><span class="label">US Box Office:</span> <span class="content"><a href="/m/the_dark_knight/numbers.php">$533,316,061</a></span></p> </div>
</div>
For those wanting a progress report I have Synopsis, Running Time, Year, Director, Actors, Rating (RT Score in %) and Votes all scraping successfully. I also have Australian Ratings and Rating Reason working.
Any help with the HTML samples is appreciated. Thanks.