2017-06-25, 18:13
What help would you need from my side?
Sent from my E5823
Sent from my E5823
<ParseMBAlbumTitle dest="5">
<RegExp input="$$2" output="<details>\1</details>" dest="5">
<RegExp input="$$1" output="<musicBrainzAlbumID>\1</musicBrainzAlbumID><title>\2</title>" dest="2">
<expression noclean="1"><release id="([^"]*)"><title>([^<]*)<</expression>
</RegExp>
<!--MBID - release group-->
<RegExp input="$$1" output="<musicbrainzreleasegroupid>\1</musicbrainzreleasegroupid>" dest="2+">
<expression noclean="1"><release-group\stype="[^"]*"\sid="([^"]*)"</expression>
</RegExp>
<RegExp input="$$1" output="<musicbrainzreleasegroupid>\1</musicbrainzreleasegroupid>" dest="2+">
<expression noclean="1"><release-group\stype="[^"]*"\stype-id="[^"]*"\sid="([^"]*)"</expression>
</RegExp>
<RegExp input="$$1" output="<musicbrainzreleasegroupid>\1</musicbrainzreleasegroupid>" dest="2+">
<expression noclean="1"><release-group\sid="([^"]*)"\stype="[^"]*"</expression>
</RegExp>
<RegExp input="$$1" output="<musicbrainzreleasegroupid>\1</musicbrainzreleasegroupid>" dest="2+">
<expression noclean="1"><release-group\sid="([^"]*)"\stype-id="[^"]*"</expression>
</RegExp>
<RegExp input="$$1" output="<musicbrainzreleasegroupid>\1</musicbrainzreleasegroupid>" dest="2+">
<expression noclean="1"><release-group\stype-id="[^"]*"\sid="([^"]*)"</expression>
</RegExp>
<RegExp input="$$1" output="<musicbrainzreleasegroupid>\1</musicbrainzreleasegroupid>" dest="2+">
<expression noclean="1"><release-group\stype-id="[^"]*"\stype="[^"]*"\sid="([^"]*)"</expression>
</RegExp>
<expression noclean="1">(.+)</expression>
</RegExp>
</ParseMBAlbumTitle>
<GetAlbumSearchResults dest="8">
<RegExp input="$$5" output="<results sorted="yes">\1</results>" dest="8">
<RegExp input="$$1" output="<entity><year>\5-\4-T#\6</year><artist>\3</artist><title>\2</title><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5">
<expression repeat="yes">id="([^"]*)"><title>([^<]*)</title><status>Official</status>(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group\stype="Album"\sid="[^"]*">(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\5-\4-T#\6</year><artist>\3</artist><title>\2</title><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)"><title>([^<]*)</title><status>Official</status>(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group(?:\stype="[^"]*")*\sid="[^"]*">(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\6-\5-T#\7</year><artist>\4</artist><title>\3</title><relevance scale="100">\2</relevance><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)" ext:score="(\d*)"><title>([^<]*)</title><status>Official</status>(?:<disambiguation>[^<]*</disambiguation>)*(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group\sid="[^"]*"\stype="Album">(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\6-\5-T#\7</year><artist>\4</artist><title>\3</title><relevance scale="100">\2</relevance><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)" ext:score="(\d*)"><title>([^<]*)</title><status>Official</status>(?:<disambiguation>[^<]*</disambiguation>)*(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*<)*/script></text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group\sid="[^"]*"(?:\stype="[^"]*")*>(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\5-\4-T#\6</year><artist>\3</artist><title>\2</title><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)"><title>([^<]*)</title><status>(?!Official)[^<]*</status>(?:<disambiguation>[^<]*</disambiguation>)*(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group\stype="Album"\sid="[^"]*">(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\5-\4-T#\6</year><artist>\3</artist><title>\2</title><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)"><title>([^<]*)</title><status>(?!Official)[^<]*</status>(?:<disambiguation>[^<]*</disambiguation>)*(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group(?:\stype="[^"]*")*\sid="[^"]*">(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\6-\5-T#\7</year><artist>\4</artist><title>\3</title><relevance scale="100">\2</relevance><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)" ext:score="(\d*)"><title>([^<]*)</title><status>(?!Official)[^<]*</status>(?:<disambiguation>[^<]*</disambiguation>)*(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group\sid="[^"]*"\stype="Album">(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<RegExp input="$$1" output="<entity><year>\6-\5-T#\7</year><artist>\4</artist><title>\3</title><relevance scale="100">\2</relevance><url cache="mb-\1-album.xml">$INFO[mbsite]/ws/2/release/\1?inc=recordings+release-groups+artists+labels+ratings</url></entity>" dest="5+">
<expression repeat="yes">id="([^"]*)" ext:score="(\d*)"><title>([^<]*)</title><status>(?!Official)[^<]*</status>(?:<disambiguation>[^<]*</disambiguation>)*(?:<packaging>[^<]*</packaging>)?<text-representation>(?:<language>[^<]*</language>)*(?:<script>[^<]*</script>)*</text-representation><artist-credit><name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>([^<]*)</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>(?:<name-credit(?:>)*(?:\sjoinphrase="[^"]*">)*(?:<name>[^<]*</name)*(?:>)*<artist\sid="[^"]*"><name>[^<]*</name>(?:<sort-name>[^<]*</sort-name>)*(?:<disambiguation>[^<]*</disambiguation>)*(?:<alias-list>(?:<alias[^<]*</alias>)*</alias-list>)*</artist></name-credit>)*</artist-credit><release-group\sid="[^"]*"(?:\stype="[^"]*")*>(?:<primary-type>[^<]*</primary-type>)*(?:<secondary-type-list>(?:<secondary-type>[^<]*</secondary-type>)+</secondary-type-list>)*</release-group><date>(\d{4})[^<]*</date>(?:<country>)*([^<]*)?.*?<track-list\scount="(\d+)</expression>
</RegExp>
<expression noclean="1" />
</RegExp>
</GetAlbumSearchResults>
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Don%27t%20Get%20Me%20Wrong%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22The%20Best%20of%20Frances%20Black%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22The%20Sky%20Road%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22The%20Smile%20on%20Your%20Face%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Spirits%20Colliding%22%20AND%20(artistname:%22Paul%20Brady%22%20OR%20artist:%22Paul%20Brady%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Treasure%20the%20Questions%22%20AND%20(artistname:%22Martyn%20Joseph%22%20OR%20artist:%22Martyn%20Joseph%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Last%20Look%22%20AND%20(artistname:%22Torcuato%20Mariano%22%20OR%20artist:%22Torcuato%20Mariano%22)
(2017-02-24, 21:52)Merwenus Wrote: Musicbrainz:I replied
For example: if your requests are coming in at 4 requests per second, we don't honour 25% of them and decline the other 75% - we decline 100% of them, until the rate drops to 1 per second or lower.
And the scraper is an idiot, try to download 10/seconds instead of 1/1,5 seconds....
(2017-02-25, 00:12)DaveBlake Wrote: I don't think it is the scraper doing more than 1 per sec. Probably an addon that is hammering Musicbrainz.and
(2017-02-26, 10:02)DaveBlake Wrote: The scraper has a 1s sleep built into it, if you are sure there are more requests that that happening then it is from an addon.well I was wrong. Sorry about that Merwenus, and everyone else.
(2017-06-26, 17:11)DaveBlake Wrote: Thanks @olympia
Examples of albums in MB database (tested the query url manually) that the scraper doesn't parse the sreach results:
Don’t Get Me Wrong, Frances Black
The Best of Frances Black, Frances Black
The Sky Road, Frances Black
The Smile on Your Face, Frances Black
Spirits Colliding, Paul Brady
Treasure the Questions, Martyn Joseph
Last Look, Torcuato Mariano
Here are the links:
Code:http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Don%27t%20Get%20Me%20Wrong%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22The%20Best%20of%20Frances%20Black%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22The%20Sky%20Road%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22The%20Smile%20on%20Your%20Face%22%20AND%20(artistname:%22Frances%20Black%22%20OR%20artist:%22Frances%20Black%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Spirits%20Colliding%22%20AND%20(artistname:%22Paul%20Brady%22%20OR%20artist:%22Paul%20Brady%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Treasure%20the%20Questions%22%20AND%20(artistname:%22Martyn%20Joseph%22%20OR%20artist:%22Martyn%20Joseph%22)
http://musicbrainz.org/ws/2/release/?fmt=xml&query=release:%22Last%20Look%22%20AND%20(artistname:%22Torcuato%20Mariano%22%20OR%20artist:%22Torcuato%20Mariano%22)
(2017-06-30, 22:23)olympia Wrote: This albums are without year - currently the scraper expects the year, that's why - I will need to see how to resolve this the most efficientlyWell spotted @olympia
(2017-07-01, 00:12)DaveBlake Wrote:(2017-06-30, 22:23)olympia Wrote: This albums are without year - currently the scraper expects the year, that's why - I will need to see how to resolve this the most efficientlyWell spotted @olympia
I stared at the data for ages (until regexp of xml melted my brain) and I couldn't see what the common element was.
Thanks also for checking and merging the scraper PR, now I just need to sort out the throttling in core.
Quote:REM GENRE Jazz / ProgThe flac file (Perigeo - La Valle dei Templi.flac) contains the following tags:
REM DATE 1989
PERFORMER "Perigeo"
TITLE "La Valle dei Templi"
FILE "Perigeo - La Valle dei Templi.flac" WAVE
TRACK 01 AUDIO
TITLE "Tamale"
PERFORMER "Perigeo"
INDEX 00 00:00:00
INDEX 01 00:00:30
TRACK 02 AUDIO
TITLE "La Valle dei Templi"
PERFORMER "Perigeo"
INDEX 00 04:31:30
INDEX 01 04:35:60
TRACK 03 AUDIO
TITLE "Looping"
PERFORMER "Perigeo"
INDEX 00 10:47:60
INDEX 01 10:53:30
TRACK 04 AUDIO
TITLE "Mistero della Firefly"
PERFORMER "Perigeo"
INDEX 00 13:57:46
INDEX 01 14:00:55
TRACK 05 AUDIO
TITLE "Pensieri"
PERFORMER "Perigeo"
INDEX 00 19:57:15
INDEX 01 20:01:25
TRACK 06 AUDIO
TITLE "Periplo"
PERFORMER "Perigeo"
INDEX 00 22:15:70
INDEX 01 22:18:40
TRACK 07 AUDIO
TITLE "Eucalyptus"
PERFORMER "Perigeo"
INDEX 00 27:22:60
INDEX 01 27:27:40
TRACK 08 AUDIO
TITLE "Alba di un Mondo"
PERFORMER "Perigeo"
INDEX 01 28:26:40
TRACK 09 AUDIO
TITLE "Cantilena"
PERFORMER "Perigeo"
INDEX 00 31:20:00
INDEX 01 31:23:30
TRACK 10 AUDIO
TITLE "2000 e due Notti"
PERFORMER "Perigeo"
INDEX 00 35:19:45
INDEX 01 35:24:15
TRACK 11 AUDIO
TITLE "Un cerchio Giallo"
PERFORMER "Perigeo"
INDEX 00 40:59:35
INDEX 01 41:03:65
Quote:Artist Name : PerigeoAn excertp from the kodi.log file (debugging enabled): kodi.log
Track Title :
Album Title : La Valle dei Templi
Date : 1989
Genre :
Composer :
Performer :
Album Artist :
Track Number :
Total Tracks : 11
Disc Number :
Total Discs : 1
Comment :
<MUSICBRAINZ_ALBUMID> : df2d616a-8fd2-3ceb-b4df-05b3c199787c
<ORIGINALDATE> : 1975