Search and scrape just does not find a best match
#1
I'm having trouble doing a regular scrape on just about any movies on r1013

Movies that it can not auto scrape (force best match) on:
The Heat
The Grandmaster
Elf (2003)

Meatdata is set to IMDB, and poster/fanart TMDB.

This is the only program having difficulty finding movies, Plex and YAMJ have no problems.

Before this it was The Croods. These are not uncommon movies.


Is there anything I'm missing?
Reply
#2
The problem here is, that IMDB has 2 results of "The Heat".
One from 2013 and one from 2006.
Since both are perfect 100% matches, TMM decided not to take any, and it will not be scraped.
(Better skip a movie, than scrape a wrong one - you'll never find it)

In this case, please scrape manually, without forcing a best match.
Or use TMDB - they only have one....

(same for "The Grandmaster" and the others)
tinyMediaManager - THE media manager of your choice :)
Wanna help translate TMM ?
Image
Reply
#3
This is a problem. Your going to find alot of movies that are similar. this is the only scraper that can not handle this well. It would not be too bad if it were just a few out of hundreds but for me its alot worse.
Reply
#4
We don't care about "similar" ones - but it might be a problem with "exact the same" titles.
We already try to filter some results from IMDB, but they have sometimes very weird entries in their DB (contrary to TMDB imho)

When IMDB gives me 2 exact results:
"The Heat" and
"The Heat"

Please tell me what in your opinion TMM should take?
Of course we just could take anyone / the first one (like others?)
But... do you really want the probably wrong movie scraped?
me not.

A workaround for you could be, to scrape only the title first with TheMovieDB,
then you automatically have a valid TMDB & IMDB id, where we can parse the right movie from IMDB directly afterwards....

hth

PS:
how many 'duplicate' titles do you have?
could you send me a list?
tinyMediaManager - THE media manager of your choice :)
Wanna help translate TMM ?
Image
Reply
#5
maybe I need to reinstall this? Here is a quick list of force best match,

127 hours (found)
1408 (no movie found)
17 Again (no movie Found)
2012 (no movie found)
21 (no movie found)
21 & over (found)
21 jump street (no movie found)
28 Days Later... (found)
2 fast 2 furious (no movie found)
3:10 to Yuma (no movie found)
300 (no movie found)
Reply
#6
Is it possible to seach the movie on google and then scrape the first imdb link?

the heat site:imdb.com
1408 site:imdb.com
17 Again site:imdb.com

find the first link like below...
"www.imdb.com/title/tt"
and then scrape that?

Trying to find an alternative to find a best match.

The first link in a google search for any of those movies looks correct to me.
Reply
#7
works for me
1408 - found
17 again - found
2012 - found
2f2f - found
21 - 2 exact matches
21 jump street - 2 exact matches

What language settigns do you have setup for scraper & GUI? IMDB is very... well, there often problems with it Smile
Please use the bug report feature of TMM and upload the logfile - thanks.
tinyMediaManager - THE media manager of your choice :)
Wanna help translate TMM ?
Image
Reply
#8
I deleted everything and redownload and now it doesn't start.

I get a little empty dialog box and the log below.

2013-10-05 10:40:29,185 INFO [main] org.tinymediamanager.TinyMediaManager:172 - =====================================================
2013-10-05 10:40:29,189 INFO [main] org.tinymediamanager.TinyMediaManager:173 - === tinyMediaManager © 2012-2013 Manuel Laggner ===
2013-10-05 10:40:29,189 INFO [main] org.tinymediamanager.TinyMediaManager:174 - =====================================================
2013-10-05 10:40:29,189 INFO [main] org.tinymediamanager.TinyMediaManager:176 - os.name : Linux
2013-10-05 10:40:29,190 INFO [main] org.tinymediamanager.TinyMediaManager:177 - os.version : 3.8.0-25-generic
2013-10-05 10:40:29,190 INFO [main] org.tinymediamanager.TinyMediaManager:178 - os.arch : amd64
2013-10-05 10:40:29,190 INFO [main] org.tinymediamanager.TinyMediaManager:179 - java.version : 1.7.0_25
2013-10-05 10:40:29,204 WARN [main] org.tinymediamanager.ui.TmmUIHelper:61 - cannot open init filedialogorg.eclipse.swt.widgets.FileDialog
2013-10-05 10:40:29,205 INFO [main] org.tinymediamanager.TinyMediaManager:584 - default encoding : UTF-8 | UTF8 | UTF-8
2013-10-05 10:40:29,206 INFO [main] org.tinymediamanager.TinyMediaManager:584 - set encoding to : UTF-8 | UTF8 | UTF-8
2013-10-05 10:40:29,651 INFO [main] org.tinymediamanager.TinyMediaManager:206 - System language : en_en
2013-10-05 10:40:29,651 INFO [main] org.tinymediamanager.TinyMediaManager:207 - GUI language : en_US
2013-10-05 10:40:29,651 INFO [main] org.tinymediamanager.TinyMediaManager:208 - Scraper language : English
2013-10-05 10:40:29,652 INFO [main] org.tinymediamanager.TinyMediaManager:209 - TV Scraper lang : English
2013-10-05 10:40:42,430 ERROR [main] org.tinymediamanager.TinyMediaManager:418 - start of tmm
java.lang.NullPointerException: null
at org.tinymediamanager.core.Utils.deleteOldBackupFile(Utils.java:910) ~[tmm.jar:2.4.1 (r1030)]
at org.tinymediamanager.TinyMediaManager$1.doStartupTasks(TinyMediaManager.java:567) ~[tmm.jar:2.4.1 (r1030)]
at org.tinymediamanager.TinyMediaManager$1.run(TinyMediaManager.java:228) ~[tmm.jar:2.4.1 (r1030)]
at java.awt.event.InvocationEvent.dispatch(InvocationEvent.java:251) ~[na:1.7.0_25]
at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:733) ~[na:1.7.0_25]
at java.awt.EventQueue.access$200(EventQueue.java:103) ~[na:1.7.0_25]
at java.awt.EventQueue$3.run(EventQueue.java:694) ~[na:1.7.0_25]
at java.awt.EventQueue$3.run(EventQueue.java:692) ~[na:1.7.0_25]
at java.security.AccessController.doPrivileged(Native Method) ~[na:1.7.0_25]
at java.security.ProtectionDomain$1.doIntersectionPrivilege(ProtectionDomain.java:76) ~[na:1.7.0_25]
at java.awt.EventQueue.dispatchEvent(EventQueue.java:703) ~[na:1.7.0_25]
at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:242) ~[na:1.7.0_25]
at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:161) ~[na:1.7.0_25]
at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:150) ~[na:1.7.0_25]
at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:146) ~[na:1.7.0_25]
at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:138) ~[na:1.7.0_25]
at java.awt.EventDispatchThread.run(EventDispatchThread.java:91) ~[na:1.7.0_25]
Reply
#9
ok, another problem Smile
Where did you have installed TMM?

The problem might be, that you don't have write permissions in that folder (only as admin)
Move your complete TMM folder somewhere else.
tinyMediaManager - THE media manager of your choice :)
Wanna help translate TMM ?
Image
Reply
#10
I'm using Linux and its under my home folder right now,

/home/draztik/tmm


launcher.log

2013/10/05 11:01:40:106 INFO m.a: ------------------ VM Info ------------------
2013/10/05 11:01:40:108 INFO m.a: -- OS Name: Linux
2013/10/05 11:01:40:108 INFO m.a: -- OS Arch: amd64
2013/10/05 11:01:40:108 INFO m.a: -- OS Vers: 3.8.0-25-generic
2013/10/05 11:01:40:109 INFO m.a: -- Java Vers: 1.7.0_25
2013/10/05 11:01:40:109 INFO m.a: -- Java Home: /usr/lib/jvm/java-7-openjdk-amd64/jre
2013/10/05 11:01:40:109 INFO m.a: -- User Name: draztik
2013/10/05 11:01:40:109 INFO m.a: -- User Home: /home/draztik
2013/10/05 11:01:40:109 INFO m.a: -- Cur dir: /home/draztik/tmm
2013/10/05 11:01:40:110 INFO m.a: ---------------------------------------------
2013/10/05 11:01:40:147 INFO m.a: ---------------- Proxy Info -----------------
2013/10/05 11:01:40:147 INFO m.a: -- Proxy Host: null
2013/10/05 11:01:40:147 INFO m.a: -- Proxy Port: null
2013/10/05 11:01:40:147 INFO m.a: ---------------------------------------------
2013/10/05 11:01:40:149 INFO m.a: Skipping [quals=windows, osname=linux, osarch=amd64, key=resource, value=[windows] tinyMediaManagerCMD.exe]
2013/10/05 11:01:40:151 INFO m.a: Skipping [quals=linux-arm, osname=linux, osarch=amd64, key=resource, value=[linux-arm] native/linux-arm/libmediainfo.so]
2013/10/05 11:01:40:151 INFO m.a: Skipping [quals=linux-arm, osname=linux, osarch=amd64, key=resource, value=[linux-arm] native/linux-arm/libzen.so]
2013/10/05 11:01:40:151 INFO m.a: Skipping [quals=linux-i386, osname=linux, osarch=amd64, key=resource, value=[linux-i386] native/linux-i386/libmediainfo.so]
2013/10/05 11:01:40:151 INFO m.a: Skipping [quals=linux-i386, osname=linux, osarch=amd64, key=resource, value=[linux-i386] native/linux-i386/libzen.so]
2013/10/05 11:01:40:152 INFO m.a: Skipping [quals=linux-i686, osname=linux, osarch=amd64, key=resource, value=[linux-i686] native/linux-i686/libmediainfo.so]
2013/10/05 11:01:40:152 INFO m.a: Skipping [quals=linux-i686, osname=linux, osarch=amd64, key=resource, value=[linux-i686] native/linux-i686/libzen.so]
2013/10/05 11:01:40:152 INFO m.a: Skipping [quals=mac os x, osname=linux, osarch=amd64, key=resource, value=[mac os x] native/mac-x86_64/libmediainfo.dylib]
2013/10/05 11:01:40:152 INFO m.a: Skipping [quals=windows-amd64, osname=linux, osarch=amd64, key=resource, value=[windows-amd64] native/windows-amd64/MediaInfo.dll]
2013/10/05 11:01:40:153 INFO m.a: Skipping [quals=windows-x64, osname=linux, osarch=amd64, key=resource, value=[windows-x64] native/windows-x64/MediaInfo.dll]
2013/10/05 11:01:40:153 INFO m.a: Skipping [quals=windows-x86, osname=linux, osarch=amd64, key=resource, value=[windows-x86] native/windows-x86/MediaInfo.dll]
2013/10/05 11:01:40:153 INFO m.a: Skipping [quals=windows-x86, osname=linux, osarch=amd64, key=resource, value=[windows-x86] native/windows-x86/mingwm10.dll]
2013/10/05 11:01:40:154 INFO m.a: Skipping [quals=mac os x, osname=linux, osarch=amd64, key=jvmarg, value=[mac os x] -Dapple.awt.graphics.UseQuartz=true]
2013/10/05 11:01:40:165 INFO m.a: Able to lock for updates: true
2013/10/05 11:01:40:243 INFO m.a: Verifying application: http://tinymediamanager.googlecode.com/svn/dist/
2013/10/05 11:01:40:243 INFO m.a: Version: -1
2013/10/05 11:01:40:244 INFO m.a: Class: org.tinymediamanager.TinyMediaManager
2013/10/05 11:01:40:246 INFO m.a: Dropping status 'm.validating'.
2013/10/05 11:01:40:271 INFO m.a: Attempting to refetch 'digest.txt' from 'http://tinymediamanager.googlecode.com/svn/dist/digest.txt'.
2013/10/05 11:01:40:408 INFO m.a: No signers, not verifying file [path=digest.txt]
2013/10/05 11:01:40:427 INFO m.a: Resources verified.
2013/10/05 11:01:40:431 INFO m.a: Didn't find any custom environment variables, not setting any.
2013/10/05 11:01:40:431 INFO m.a: Running /usr/lib/jvm/java-7-openjdk-amd64/jre/bin/java
-classpath
/home/draztik/tmm/./tmm.jar:/home/draztik/tmm/./lib/asm.jar:/home/draztik/tmm/./lib/betterbeansbinding-core.jar:/home/draztik/tmm/./lib/betterbeansbinding-el.jar:/home/draztik/tmm/./lib/betterbeansbinding-swingbinding.jar:/home/draztik/tmm/./lib/commons-codec.jar:/home/draztik/tmm/./lib/commons-io.jar:/home/draztik/tmm/./lib/commons-lang3.jar:/home/draztik/tmm/./lib/DJNativeSwing.jar:/home/draztik/tmm/./lib/DJNativeSwing-SWT.jar:/home/draztik/tmm/./lib/fanarttvapi.jar:/home/draztik/tmm/./lib/forms.jar:/home/draztik/tmm/./lib/glazedlists.jar:/home/draztik/tmm/./lib/httpclient.jar:/home/draztik/tmm/./lib/httpcore.jar:/home/draztik/tmm/./lib/httpmime.jar:/home/draztik/tmm/./lib/imgscalr-lib.jar:/home/draztik/tmm/./lib/jackson-annotations.jar:/home/draztik/tmm/./lib/jackson-core.jar:/home/draztik/tmm/./lib/jackson-databind.jar:/home/draztik/tmm/./lib/jcl-over-slf4j.jar:/home/draztik/tmm/./lib/jdom.jar:/home/draztik/tmm/./lib/jmte-unbundled.jar:/home/draztik/tmm/./lib/jna.jar:/home/draztik/tmm/./lib/jsoup.jar:/home/draztik/tmm/./lib/JSplitButton.jar:/home/draztik/tmm/./lib/JTattoo.jar:/home/draztik/tmm/./lib/l2fprod-common-buttonbar.jar:/home/draztik/tmm/./lib/l2fprod-common-shared.jar:/home/draztik/tmm/./lib/log4j-over-slf4j.jar:/home/draztik/tmm/./lib/logback-classic.jar:/home/draztik/tmm/./lib/logback-core.jar:/home/draztik/tmm/./lib/objectdb.jar:/home/draztik/tmm/./lib/platform.jar:/home/draztik/tmm/./lib/resources.jar:/home/draztik/tmm/./lib/Scaling-bin.jar:/home/draztik/tmm/./lib/slf4j-api.jar:/home/draztik/tmm/./lib/themoviedbapi.jar:/home/draztik/tmm/./lib/thetvdbapi.jar:/home/draztik/tmm/./lib/xmlrpc-client.jar
-Dcom.threerings.getdown=true
-Xms64m
-Xmx512m
-Xss512k
-splashConfusedplashscreen.png
-Djna.nosys=true
org.tinymediamanager.TinyMediaManager
Reply
#11
I've just found the bug Wink
just create a dir called backup in the tmm install dir - this will prevent the crash (a fix is being comitted soon)

after that please send us a bug report right after scraping several of your non found movies (in the log we will see the exact search pattern for imdb)

thanks
manuel
tinyMediaManager - THE media manager of your choice - available for Windows, macOS and Linux
Help us translate tinyMediaManager at Weblate
Found a bug or want to submit a feature request? Contact us at GitLab
Image
Reply
#12
I submitted a bug....
Reply
#13
well, I think you misunderstood me Wink
if you send a bug report from inside tmm, we get all relevant logs (menu "contact us" -> "send bug report")
tinyMediaManager - THE media manager of your choice - available for Windows, macOS and Linux
Help us translate tinyMediaManager at Weblate
Found a bug or want to submit a feature request? Contact us at GitLab
Image
Reply



Logout Mark Read Team Forum Stats Members Help
Search and scrape just does not find a best match0
This forum uses Lukasz Tkacz MyBB addons.