Posts: 46
Joined: Jun 2011
Reputation:
0
Well, perhaps someone will develop an open source program that will be able to identify TV Shows and Movies by their audio just like Shazam and provide an API. It would be nice to have a way to identify the files this way since we could also use the information to rename and file them.
Posts: 793
Joined: Oct 2010
Reputation:
17
Echo nest has an open source audio ID software, server and all. Just need a good source of confirmed finger prints. Maybe an addon that checks the current library for a semi positive ID (assuming that the majority of people won't let Ann item stay misidentified long), fingerprints the file and submits the ID info.
Posts: 90
Joined: Aug 2010
Reputation:
2
There are some questions concerning this method to identify tvshows/movies:
1. The lookup time will most likely be increased compared to todays solution due to actual sampling of the tvshow (in other words: xbmc have to open and play the video file before it can be identified)
2. Movies and tvshows usually have intros (studios involved, identical intros for several episodes during a season etc),how precise would this method be?
3. Does this give greater functionality than what already exists today with filename comparison?
Posts: 90
Joined: Aug 2010
Reputation:
2
2012-09-23, 16:11
(This post was last modified: 2012-09-23, 16:12 by texaco.)
maybe i have mistanke how shazam works but to retrieve the sample you have to play it back, meaning if you want a 10s second sample you effectively add 10 seconds per file just to identify it. if you then want to add a whole season with 20 episodes you have to wait 200 seconds before every file has been identified.
This methods adds a lot of overhead just to retrieve waht already is widely used in the filename: showname.s01e01.
Im not saying its a cool though, but why reinvent something that already works great? Identification with shazam makes more sense when identifying music (more variations, mixes, new versions etc) than to identify a movie or tvshows which has more static content.
Posts: 793
Joined: Oct 2010
Reputation:
17
No doesn't need to be played back, it just reads a chunk of the data from the file. The server knows what a given sound should look like given a certain encoding.
It's actually pretty quick, most of the time in shazaam is spent cleaning up the microphone audio and submitting. Echo nest claims their analysis should take about a tenth of a second. So 20 episodes would only take 2 seconds plus network time. They could all be submitted and retrieved at once so call that 3 seconds. 23 seconds might even be faster than the current scrapers =)
Posts: 793
Joined: Oct 2010
Reputation:
17
Yes shazaam is proprietary, but I've probably said 10 times now that echo nest is open source, both client and server. The amount of languages is irrelevant as it only needs to identify one of them. Although it would be a nice site benefit for a scraper to be able to detect available languages.
Musicbrainz doesn't help because they don't allow tv shows or movies. The database will have to be grown in the same way the tv db and all the others were, user contribution. The one powerful advantage in this area is that it can build on their success by pulling automated contributions from user's libraries