Excalibur Scraper - Near to be the best (after little help) for +18 Adult Movies
Good evening. After looking the Excalibur scraper, i realized it need to be updated. Of course, i'm not able to correct the strings, but with little help, this scraper can become the best for porn movies. the first thing i did, is to update the thumb link : much better quility, without blue and withe strokes and higher resolution =).

result in xbmc :


As you can see, the new thumb is wonderful (not photoshoped, directly grabbed from excalibur, i simply changed the thumb link in the excalibur.xml)

Download the scraper : (updated 26 march 09)
Alright, gotta head to work but I'll post what I've done so far:
- Added the film runtime.
- Fixed the "Year" tag so it only picks up the year and not the whole date. (Notice your screenshot has a (6) next to the title because it's picking up the month part of the date.)
- Fixed the "Plot" tag so it doesn't pick up all the extra bs afterwards.
- Fixed the "Director" tag so it picks up the name of the director based on the name displayed, not the link. (I did this because there are a few low budget directors who Excalibur doesn't have profiles for; therefore, there aren't any links for them so the original script would pick up nothing.)

Minor annoyances
- For some reason, I can't get the last quote of the plot to stop being picked up. Not really a big deal so I left it alone.

When I get back home I'll clean up a few things and see what else I can do (rating, cast, etc.).


<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>

<CreateSearchUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1&lt;/url&gt;" dest="3">
<expression noclean="1"></expression>


<GetSearchResults dest="6">
<RegExp input="$$5" output="&lt;?xml version=&quot;1.0&quot; encoding=&quot;iso-8859-1&quot; standalone=&quot;yes&quot;?&gt;&lt;results&gt;\1&lt;/results&gt;" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression>&lt;a href=&quot;([^/]*).htm&quot;&gt;</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;/1&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/$$4.htm&lt;/url&gt;&lt;/entity&gt;" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;\2&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;&lt;/entity&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm&quot; class=&quot;searchTitle12&quot;&gt;&lt;font class=&quot;searchTitle12&quot;&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>


<GetDetails dest="3">
<RegExp input="$$5" output="&lt;details&gt;\1&lt;/details&gt;" dest="3">
<RegExp input="$$1" output="&lt;thumb&gt;http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1&lt;/thumb&gt;" dest="5">
<RegExp input="$$1" output="&lt;title&gt;\1&lt;/title&gt;" dest="5+">
<expression>Buy ([^D]*) </expression>
<RegExp input="$$1" output="&lt;actor&gt;&lt;name&gt;\1&lt;/name&gt;&lt;/actor&gt;" dest="5+">
<expression repeat="yes">&lt;a[^&gt;]*&gt; ([^&lt;]*)</expression>
<RegExp input="$$1" output="&lt;genre&gt;\1&lt;/genre&gt;" dest="5+">
<RegExp input="$$1" output="&lt;year&gt;\1&lt;/year&gt;" dest="5+">
<RegExp input="$$1" output="&lt;runtime&gt;\1 mins.&lt;/runtime&gt;" dest="5+">
<expression>Time: &lt;/font&gt;&lt;[^&gt;]*&gt;([0-9]*)</expression>
<RegExp input="$$1" output="&lt;plot&gt;\1&lt;/plot&gt;" dest="5+">
<RegExp input="$$1" output="&lt;studio&gt;\1&lt;/studio&gt;" dest="5+">
<RegExp input="$$1" output="&lt;director&gt;\1&lt;/director&gt;" dest="5+">
<expression>&gt;Director: &lt;/font&gt;[^&gt;]*&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>
Woaaa !! so much better Big Grin Can't wait the end of your work.

- For the "year" it's ok. i know why i get numbers like "6" before, thanks Wink
- About casting, scrapers are able to grad actress pictures for example ? i don't know if there is this feature in scrapers like grabbing infos from http://www.excaliburfilms.com/pornlist/s..._Santi.htm

Also i noticed 2 issues :

- The plot is not complete Smile

On your screenshot, we have :

Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining."

But here is the complete plot :

The’ My First Porn’ series is a comical sex-spectacle of Jack directing his first porn movie. Volume 10 is over 2 hours and features Stoya in a brand-new, amazing ass-a-liscious anal scene. The A-list cast includes sensational fuck stars, Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining.

2nd issue : If you can look the casting when you have time. Old excalibur scraper missed "stoya" actress for example. Dunno why Wink

I get excalibur.jpg if you need it Wink

thanks again Bleckshire for your help, can't wait to see what you want to do Tongue
artik Wrote:Woaaa !! so much better Big Grin Can't wait the end of your work.

- For the "year" it's ok. i know why i get numbers like "6" before, thanks Wink
- About casting, scrapers are able to grad actress pictures for example ? i don't know if there is this feature in scrapers like grabbing infos from http://www.excaliburfilms.com/pornlist/s..._Santi.htm

Also i noticed 2 issues :

- The plot is not complete Smile

On your screenshot, we have :

Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining."

But here is the complete plot :

The’ My First Porn’ series is a comical sex-spectacle of Jack directing his first porn movie. Volume 10 is over 2 hours and features Stoya in a brand-new, amazing ass-a-liscious anal scene. The A-list cast includes sensational fuck stars, Sophia Santi, Kortni Kiss, Sabrina Rose, Evie Delatosso, Savanah Gold. With Stoya, juicy boy/girl romps, anal, tons of tease and titty, My First Porn 10 has it all. Enjoy some first-class, good time Digital Playground humor interlaced between all the delicious stars, all of which makes My First Porn 10 insanely satisfying and extremely entertaining.

2nd issue : If you can look the casting when you have time. Old excalibur scraper missed "stoya" actress for example. Dunno why Wink

I get excalibur.jpg if you need it Wink

thanks again Bleckshire for your help, can't wait to see what you want to do Tongue

I'll take a look at the casting. The most I'd be able to grab from the star info would be a thumbnail for each and that's depending on how they file them. I'll look into it though. And the plot actually pulls the complete plot. The reason that screenshot only shows the ending half of it is because in that view, the plot is actually scrolling. So I took the shot once it scrolled to the end of the plot. Tongue I'll take a look at the casting. I didn't even look at it before because I just figured it was already picking up everything.
Bleckshire Wrote:I'll take a look at the casting. The most I'd be able to grab from the star info would be a thumbnail for each and that's depending on how they file them. I'll look into it though. And the plot actually pulls the complete plot. The reason that screenshot only shows the ending half of it is because in that view, the plot is actually scrolling. So I took the shot once it scrolled to the end of the plot. Tongue I'll take a look at the casting. I didn't even look at it before because I just figured it was already picking up everything.

guys..: thank you for your effort and your fantastic scrapter for adult content!

Okay, this is probably about where I'll stop. Let me go over a few of the ups and downs of what I've done with this scraper:

- As I said in my previous post, runtime was added, the year has been changed to detect the year and not the month (or the whole mm/dd/yyyy date as it originally was), the plot should pick up JUST the plot and nothing extra, and the director should pick up correctly now (see below for a comment on this).

What I've added since:

- I was able to get the stars' thumbnails for those listed in the cast. It should display the main picture on that star's profile page. Here's the image displayed on Sabrina Rose's profile page:


Here's a screenshot of the cast listing:


- That damn quote at the end of the plot was annoying the hell out of me so I rewrote the plot detection and figured out why it was doing that. Unfortunately, I couldn't remove it (at least not without compromising any other possible film plots I haven't seen yet) so instead, I got it to leave the opening quote, so now it detects the plot just as it is on the page, inside quotes.

Now, for the bad:

- Artik, I did figure out why it wasn't picking up Stoya when pulling the cast. It's related to the same issue I was mentioning with the director. The original script was using links (a href tags) to pull cast and directors. As with the director, if Excalibur does not have a profile for the star/director, then the page obviously won't create a link and just has the star/director listed in plain text. So I did what I did with the director and was able to detect Stoya in our example film. Problem with that was that it wasn't detecting any of the other stars. With the way Excalibur lists the cast, there's really no way of hopping around each star if some have profiles and some don't (someone more proficient in regexp might be able to do it, but it still seems like it'd be a lot of work). Now the reason my change to the director works is because they have only one director listed. I'm sure if they had multiple directors listed, the script wouldn't pull both. So, I left it as is. It will detect all stars in the cast that have profile pages. Granted, there are some films that have a complete cast with no profiles at all. If it hits one of those, it won't pull any of the cast obviously.

- Couldn't pull the rating either since Excalibur doesn't actually list a numbered rating to pull. They just have 'star-5.jpg' for example. Not much I can do with that.

Looks like that's just about everything I can change. Hope that helps ya out, artik and anyone else who wanted to use this.

<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>

<CreateSearchUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1&lt;/url&gt;" dest="3">
<expression noclean="1"></expression>


<GetSearchResults dest="6">
<RegExp input="$$5" output="&lt;?xml version=&quot;1.0&quot; encoding=&quot;iso-8859-1&quot; standalone=&quot;yes&quot;?&gt;&lt;results&gt;\1&lt;/results&gt;" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression>&lt;a href=&quot;([^/]*).htm&quot;&gt;</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;/1&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/$$4.htm&lt;/url&gt;&lt;/entity&gt;" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;\2&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;&lt;/entity&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm&quot; class=&quot;searchTitle12&quot;&gt;&lt;font class=&quot;searchTitle12&quot;&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>


<GetDetails dest="3">
<RegExp input="$$5" output="&lt;details&gt;\1&lt;/details&gt;" dest="3">
<RegExp input="$$1" output="&lt;thumb&gt;http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1&lt;/thumb&gt;" dest="5">
<RegExp input="$$1" output="&lt;title&gt;\1&lt;/title&gt;" dest="5+">
<expression>Buy ([^D]*) </expression>
<RegExp input="$$1" output="&lt;actor&gt;&lt;name&gt;\2&lt;/name&gt;&lt;thumb&gt;http://images.excaliburfilms.com/pornlist/starpicsCH3WY/\1.jpg&lt;/thumb&gt;&lt;/actor&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;/pornlist/starpgs/([^\.]*)[^&gt;]*&gt;([^&lt;]*)</expression>
<RegExp input="$$1" output="&lt;genre&gt;\1&lt;/genre&gt;" dest="5+">
<RegExp input="$$1" output="&lt;year&gt;\1&lt;/year&gt;" dest="5+">
<RegExp input="$$1" output="&lt;runtime&gt;\1 mins&lt;/runtime&gt;" dest="5+">
<expression>Time: &lt;/font&gt;&lt;[^&gt;]*&gt;([0-9]*)</expression>
<RegExp input="$$1" output="&lt;plot&gt;\1&lt;/plot&gt;" dest="5+">
<RegExp input="$$1" output="&lt;studio&gt;\1&lt;/studio&gt;" dest="5+">
<RegExp input="$$1" output="&lt;director&gt;\1&lt;/director&gt;" dest="5+">
<expression>&gt;Director: &lt;/font&gt;[^&gt;]*&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>
Amazing work Bleckshire, many thx again. But, cast don't display the actress picture with the actual code Wink did you update it ? or is there something special to do ?

Quote:<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>

<CreateSearchUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1&lt;/url&gt;" dest="3">
<expression noclean="1"></expression>


<GetSearchResults dest="6">
<RegExp input="$$5" output="&lt;?xml version=&quot;1.0&quot; encoding=&quot;iso-8859-1&quot; standalone=&quot;yes&quot;?&gt;&lt;results&gt;\1&lt;/results&gt;" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression>&lt;a href=&quot;([^/]*).htm&quot;&gt;</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;/1&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/$$4.htm&lt;/url&gt;&lt;/entity&gt;" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;\2&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;&lt;/entity&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm&quot; class=&quot;searchTitle12&quot;&gt;&lt;font class=&quot;searchTitle12&quot;&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>


<GetDetails dest="3">
<RegExp input="$$5" output="&lt;details&gt;\1&lt;/details&gt;" dest="3">
<RegExp input="$$1" output="&lt;thumb&gt;http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1&lt;/thumb&gt;" dest="5">
<RegExp input="$$1" output="&lt;title&gt;\1&lt;/title&gt;" dest="5+">
<expression>Buy ([^D]*) </expression>
<RegExp input="$$1" output="&lt;actor&gt;&lt;name&gt;\2&lt;/name&gt;&lt;thumb&gt;http://images.excaliburfilms.com/pornlist/starpicsCH3WY/\1.jpg&lt;/thumb&gt;&lt;/actor&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;/pornlist/starpgs/([^\.]*)[^&gt;]*&gt;([^&lt;]*)</expression>
<RegExp input="$$1" output="&lt;genre&gt;\1&lt;/genre&gt;" dest="5+">
<RegExp input="$$1" output="&lt;year&gt;\1&lt;/year&gt;" dest="5+">
<RegExp input="$$1" output="&lt;runtime&gt;\1 mins&lt;/runtime&gt;" dest="5+">
<expression>Time: &lt;/font&gt;&lt;[^&gt;]*&gt;([0-9]*)</expression>
<RegExp input="$$1" output="&lt;plot&gt;\1&lt;/plot&gt;" dest="5+">
<RegExp input="$$1" output="&lt;studio&gt;\1&lt;/studio&gt;" dest="5+">
<RegExp input="$$1" output="&lt;director&gt;\1&lt;/director&gt;" dest="5+">
<expression>&gt;Director: &lt;/font&gt;[^&gt;]*&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>
I'm pretty sure I pasted the correct one. I just ran a couple test movies and they all pulled up the pictures fine. Let me know which particular movie you noticed it on and I'll check it out. I'll paste again just in case I somehow did accidentally change it.

<scraper name="Excalibur" content="movies" thumb="excalibur.jpg">
<NfoUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;" dest="3">
<expression noclean="1">excaliburfilms.com/AdultDVD/([^.]*)</expression>

<CreateSearchUrl dest="3">
<RegExp input="$$1" output="&lt;url&gt;http://search.excaliburfilms.com/SmartSearch_result.cfm?title_in=\1&lt;/url&gt;" dest="3">
<expression noclean="1"></expression>


<GetSearchResults dest="6">
<RegExp input="$$5" output="&lt;?xml version=&quot;1.0&quot; encoding=&quot;iso-8859-1&quot; standalone=&quot;yes&quot;?&gt;&lt;results&gt;\1&lt;/results&gt;" dest="6">
<RegExp input="$$1" output="\1" dest="4">
<expression>&lt;a href=&quot;([^/]*).htm&quot;&gt;</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;/1&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/$$4.htm&lt;/url&gt;&lt;/entity&gt;" dest="5">
<expression clear="yes">Buy ([^D]*),</expression>
<RegExp input="$$1" output="&lt;entity&gt;&lt;title&gt;\2&lt;/title&gt;&lt;url&gt;http://www.excaliburfilms.com/AdultDVD/\1.htm&lt;/url&gt;&lt;/entity&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;http://www\.excaliburfilms\.com/AdultDVD/([^.]*)\.htm&quot; class=&quot;searchTitle12&quot;&gt;&lt;font class=&quot;searchTitle12&quot;&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>


<GetDetails dest="3">
<RegExp input="$$5" output="&lt;details&gt;\1&lt;/details&gt;" dest="3">
<RegExp input="$$1" output="&lt;thumb&gt;http://images.excaliburfilms.com/DVD/reviews/imagesCH3WY/largemoviepic/\1&lt;/thumb&gt;" dest="5">
<RegExp input="$$1" output="&lt;title&gt;\1&lt;/title&gt;" dest="5+">
<expression>Buy ([^D]*) </expression>
<RegExp input="$$1" output="&lt;actor&gt;&lt;name&gt;\2&lt;/name&gt;&lt;thumb&gt;http://images.excaliburfilms.com/pornlist/starpicsCH3WY/\1.jpg&lt;/thumb&gt;&lt;/actor&gt;" dest="5+">
<expression repeat="yes">&lt;a href=&quot;/pornlist/starpgs/([^\.]*)[^&gt;]*&gt;([^&lt;]*)</expression>
<RegExp input="$$1" output="&lt;genre&gt;\1&lt;/genre&gt;" dest="5+">
<RegExp input="$$1" output="&lt;year&gt;\1&lt;/year&gt;" dest="5+">
<RegExp input="$$1" output="&lt;runtime&gt;\1 mins&lt;/runtime&gt;" dest="5+">
<expression>Time: &lt;/font&gt;&lt;[^&gt;]*&gt;([0-9]*)</expression>
<RegExp input="$$1" output="&lt;plot&gt;\1&lt;/plot&gt;" dest="5+">
<RegExp input="$$1" output="&lt;studio&gt;\1&lt;/studio&gt;" dest="5+">
<RegExp input="$$1" output="&lt;director&gt;\1&lt;/director&gt;" dest="5+">
<expression>&gt;Director: &lt;/font&gt;[^&gt;]*&gt;([^&lt;]*)</expression>
<expression noclean="1"></expression>
please post a trac ticket (diff), then i will include in svn
Bleckshire Wrote:I'm pretty sure I pasted the correct one. I just ran a couple test movies and they all pulled up the pictures fine. Let me know which particular movie you noticed it on and I'll check it out. I'll paste again just in case I somehow did accidentally change it.

i tried on my first porn movie 10 Smile
artik Wrote:i tried on my first porn movie 10 Smile

[offtopic] artik: mind telling me which skin you are working with when you created those screenshots in your postings? it looks like a really cool skin.

thanks Wink
azido Wrote:[offtopic] artik: mind telling me which skin you are working with when you created those screenshots in your postings? it looks like a really cool skin.

thanks Wink

mediastream 0.99, amazing skin Wink

artik Wrote:mediastream 0.99, amazing skin Wink


indeed! thanks for the link, bro..
spiff Wrote:please post a trac ticket (diff), then i will include in svn

Let's work a little bit more on it, and that will be done ! Smile
Bleckshire Wrote:I'm pretty sure I pasted the correct one. I just ran a couple test movies and they all pulled up the pictures fine. Let me know which particular movie you noticed it on and I'll check it out. I'll paste again just in case I somehow did accidentally change it.

It seems it doesn't work here (refreshed my first porn, and tried on few others movies ...) but plot, runtime, director, everything work Wink :


Logout Mark Read Team Forum Stats Members Help
Excalibur Scraper - Near to be the best (after little help) for +18 Adult Movies3
This forum uses Lukasz Tkacz MyBB addons.