[Release] TheAudioDb.com Music Video Scraper

  Thread Rating:
  • 4 Vote(s) - 2.5 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Post Reply
roura97311 Offline
Junior Member
Posts: 12
Joined: Mar 2017
Reputation: 0
Location: French Guiana
Post: #256
Thanks for your reply but it does not work either with shy m

So When i modify the scrapper's code like this

<RegExp input="$$5" output="\1" dest="3">
<RegExp input="$$1" output="\1\\&apos;\2" dest="1">
<expression repeat="yes">((?:[^%]*(?:%20))*[^%]*)(?:%27)((?:[^%]*(?:%20))*[^%]*)</expression>
</RegExp>


<RegExp input="$$5" output="\1" dest="3">
<RegExp input="$$1" output="\1\\&apos;\2" dest="1">
<expression repeat="yes">((?:[^%]*(?:%20))*[^%]*)(?:')((?:[^%]*(?:%20))*[^%]*)</expression>
</RegExp>

Artist sh'ym was found. What do you think ?
(This post was last modified: 2017-03-09 17:45 by roura97311.)
find quote
roura97311 Offline
Junior Member
Posts: 12
Joined: Mar 2017
Reputation: 0
Location: French Guiana
Post: #257
(2017-03-09 16:52)DarkHelmet Wrote:  You could try leaving the apostrophe away from the name and make Shy m out of it. I got "Guns N' Roses" tracks to work as "Guns N Roses". However Noel Gallagher's High Flying Birds fails whatever i do.

With my modify

Noel Gallagher's High Flying Birds - Everybody's on the Run

is found and scrapp
find quote
docwra Offline
Fan
Posts: 482
Joined: Nov 2014
Reputation: 20
Post: #258
@roura97311 Thanks!

If people can agree that the REGEX is improved this way, then I will merge it into the Add-on. We need some more testing though.

EDIT: Also I fixed the albums you reported, it was because the apostrophe uses a strange UTF8 character.

Code:
http://www.theaudiodb.com/api/v1/json/18626d636d76696473706d/searchalbum.php?s=Keen%27v&a=L%C3%A0%20o%C3%B9%20le%20vent%20me%20m%C3%A8ne
Now works.

[Image: cook.png] Formally chef Zag
(This post was last modified: 2017-03-10 17:52 by docwra.)
find quote
ferent Offline
Fan
Posts: 400
Joined: Sep 2010
Reputation: 0
Post: #259
Cmon! we are getting closer! Great team work, this is a real community!
find quote
roura97311 Offline
Junior Member
Posts: 12
Joined: Mar 2017
Reputation: 0
Location: French Guiana
Post: #260
(2017-03-10 17:49)docwra Wrote:  @roura97311 Thanks!

If people can agree that the REGEX is improved this way, then I will merge it into the Add-on. We need some more testing though.

EDIT: Also I fixed the albums you reported, it was because the apostrophe uses a strange UTF8 character.

Code:
http://www.theaudiodb.com/api/v1/json/18626d636d76696473706d/searchalbum.php?s=Keen%27v&a=L%C3%A0%20o%C3%B9%20le%20vent%20me%20m%C3%A8ne
Now works.

Thanks too with my albums reported. now it's fine.
But i have a question about the link between musiclibrary database and musicvideo database.

because the singer with apostrophe appear not link in the two base respectly

heuuu sorry for my stupid question but where i download the news version of scrapper :-)

ONE THING VERY IMPORTANT
the apostrophe is like this Shy'm and not like this Shy’m

the code is not the same %27 for the first and %E2%80%99 for the second
(This post was last modified: 2017-03-10 21:49 by roura97311.)
find quote
docwra Offline
Fan
Posts: 482
Joined: Nov 2014
Reputation: 20
Post: #261
Has anyone tested this modification yet with a large library? How many results do they get?

I will test at some point this week and release a new version of this scraper if successful.

[Image: cook.png] Formally chef Zag
find quote
ferent Offline
Fan
Posts: 400
Joined: Sep 2010
Reputation: 0
Post: #262
I didn't count how many I had before and how many after, but now it works much much better finding the music videos. Only the videos with only numbers and with apostrophes remains.

Good work.
find quote
roura97311 Offline
Junior Member
Posts: 12
Joined: Mar 2017
Reputation: 0
Location: French Guiana
Post: #263
For my part, I have no problem with apostrophes for now.
Can you give me the link of the new version of the scrapper. Thank you
(This post was last modified: 2017-03-15 14:22 by roura97311.)
find quote
DarkHelmet Offline
Posting Freak
Posts: 822
Joined: Jul 2008
Reputation: 6
Location: Hamburg, Germany
Post: #264
(2017-03-09 16:49)DarkHelmet Wrote:  
(2017-03-09 16:29)docwra Wrote:  
(2017-03-09 16:20)DarkHelmet Wrote:  I have two tracks too that I cannot get to be scraped:

Prince - 1999.mp4
Phoenix - 1901.mp4

This is just a wild guess, but maybe the song title, a number in this case, gets misinterpreted as a year?

http://pastebin.com/9ePFzw52 log for the Prince title

http://pastebin.com/R1ckHtzX for the Phoenix title

Yep looks like the track name is empty when sent to the TADB API.

I'm guessing the scraper regex is removing numbers for some reason.

Can you bring this to the right guy's attention or should I? I'd make a trac report then.

I created a ticket on trac.

http://trac.kodi.tv/ticket/17395
find quote
DarkHelmet Offline
Posting Freak
Posts: 822
Joined: Jul 2008
Reputation: 6
Location: Hamburg, Germany
Post: #265
All scraper related get automatically assigned to Olympia. If I may ask, is Olympia still active in scraper develeopment? I checked the last scraper related bug reports on trac and at least on trac there was no reaction by anyone to these reports.
find quote
docwra Offline
Fan
Posts: 482
Joined: Nov 2014
Reputation: 20
Post: #266
OK finally got some time to test this modified regex, Results as follows:

C:\Encode\MViDS\a\*.mp4 = 758 Items
Apostrophe in artist name = 2
Apostrophe in track name = 20
No Apostrophe = 736

Kodi with modified regex = 683 scraped successfully [94 Seconds]
Kodi with unmodified regex = 681 scraped successfully [94 Seconds]

So yes this modification fixes the artist name apostrophe with no adverse effects. The title apostrophe was already working as far as I can tell.

Before
[Image: screenshot003.jpg]

After
[Image: screenshot002.png]

[Image: cook.png] Formally chef Zag
(This post was last modified: 2017-03-21 23:32 by docwra.)
find quote
ferent Offline
Fan
Posts: 400
Joined: Sep 2010
Reputation: 0
Post: #267
Woooow, congratulations, much much better now, it detected 26 new music videos in my library.

Many thanks.
(This post was last modified: 2017-03-21 23:26 by ferent.)
find quote
docwra Offline
Fan
Posts: 482
Joined: Nov 2014
Reputation: 20
Post: #268
No problems, nice team effort!

New version uploaded here:

https://github.com/zag2me/metadata.music...udiodb.com

[Image: cook.png] Formally chef Zag
find quote
ferent Offline
Fan
Posts: 400
Joined: Sep 2010
Reputation: 0
Post: #269
http://www.theaudiodb.com/album/2130009
I think there is a problem with that album, could you review it please? none of the songs are detected and is the only one that remains in my library.

Thanks
(This post was last modified: 2017-03-23 04:36 by ferent.)
find quote
Ferkner Offline
Junior Member
Posts: 47
Joined: Oct 2015
Reputation: 0
Location: Ottawa, ON
Post: #270
I have two videos that won't scrape. Both AC/DC songs (Thunderstruck and Back in Black). In the filename it's spelled ACDC, which is the alternate title for the band on theaudiodb, but the scraper isn't finding them. I know this was an issue in the past, but I didn't see if it ever got fixed.

Two years ago Zag addressed it with the following API links:

Code:
http://www.theaudiodb.com/api/v1/json/1/searchtrack.php?s=ac/dc&t=thunderstruck

If you go to that one in the browser, you see it returns a bunch of info. But the below one which was said to have worked, doesn't anymore:

Code:
http://www.theaudiodb.com/api/v1/json/1/searchtrack.php?s=acdc&t=thunderstruck

For reference, the filenames for the videos are ACDC - Thunderstruck and ACDC - Back in Black (both .mp4). Anyone get these working? If I make an NFO for these two, so I have to get these two videos to be scraped locally, and provide the album art myself?
find quote
Post Reply