Kodi Community Forum
Scrape data from the files themselves? - Printable Version

+- Kodi Community Forum (https://forum.kodi.tv)
+-- Forum: Support (https://forum.kodi.tv/forumdisplay.php?fid=33)
+--- Forum: Supplementary Tools for Kodi (https://forum.kodi.tv/forumdisplay.php?fid=116)
+---- Forum: Ember Media Manager (https://forum.kodi.tv/forumdisplay.php?fid=195)
+---- Thread: Scrape data from the files themselves? (/showthread.php?tid=335576)



Scrape data from the files themselves? - Kalamar - 2018-09-15

I have a series of files for which none of the sources can find any data (french kids anime). But the files are already named partially correctly (S02E01 xxxx ). Is there a way to make Ember "scrape the data from the files", so that it can fetch the titles/episodes from the file names to fill the meta data (so that I can have properly filled nfo files and rename the files using the same naming convention as other files)?
(this also applies when the data from tvdb is incorrect,
e.g.: 
The episode list from here https://fr.wikipedia.org/wiki/Les_As_de_la_jungle#Seconde_s%C3%A9rie_:_Les_As_de_la_jungle_%C3%A0_la_rescousse_(depuis_2014) is consistent with my files, but not with the data from https://www.thetvdb.com/series/the-jungle-bunch/seasons/2 )

I would like to be able to get the initial data from the files themselves and then fetch possibly the plot/screenshots from tvdb later


RE: Scrape data from the files themselves? - calexicoca - 2019-11-21

(2018-09-15, 15:48)Kalamar Wrote: I have a series of files for which none of the sources can find any data (french kids anime). But the files are already named partially correctly (S02E01 xxxx ). Is there a way to make Ember "scrape the data from the files", so that it can fetch the titles/episodes from the file names to fill the meta data (so that I can have properly filled nfo files and rename the files using the same naming convention as other files)?
(this also applies when the data from tvdb is incorrect,
e.g.: 
The episode list from here https://fr.wikipedia.org/wiki/Les_As_de_la_jungle#Seconde_s%C3%A9rie_:_Les_As_de_la_jungle_%C3%A0_la_rescousse_(depuis_2014) is consistent with my files, but not with the data from https://www.thetvdb.com/series/the-jungle-bunch/seasons/2 )

I would like to be able to get the initial data from the files themselves and then fetch possibly the plot/screenshots from tvdb later

Too bad no one of experts responded to this. I also want to be able to use file name instead of generic SXXEXX Ember uses. I just wonder how.


RE: Scrape data from the files themselves? - DanCooper - 2019-11-21

(2018-09-15, 15:48)Kalamar Wrote: I have a series of files for which none of the sources can find any data (french kids anime). But the files are already named partially correctly (S02E01 xxxx ). Is there a way to make Ember "scrape the data from the files", so that it can fetch the titles/episodes from the file names to fill the meta data (so that I can have properly filled nfo files and rename the files using the same naming convention as other files)?
(this also applies when the data from tvdb is incorrect,
e.g.: 
The episode list from here https://fr.wikipedia.org/wiki/Les_As_de_la_jungle#Seconde_s%C3%A9rie_:_Les_As_de_la_jungle_%C3%A0_la_rescousse_(depuis_2014) is consistent with my files, but not with the data from https://www.thetvdb.com/series/the-jungle-bunch/seasons/2 )

I would like to be able to get the initial data from the files themselves and then fetch possibly the plot/screenshots from tvdb later
Sorry for my late response. Looks like you use the "DVD" and not the "aired" order for your files. You can set a whole tv show source or a single tv show to another order and rescrape it. The TVDb scraper provides "Official/Aired", "DVD" and "Absolute" order. Looks like TMDb also have implemented that in one of the last updates, but I haven't updatet the TMDb scraper to get this without an enabled TVDb scraper. I'm not shure if TVDb is still broken in Ember, so you have to test it.

Image


RE: Scrape data from the files themselves? - DanCooper - 2019-11-21

(2019-11-21, 13:24)calexicoca Wrote: Too bad no one of experts responded to this. I also want to be able to use file name instead of generic SXXEXX Ember uses. I just wonder how.
You can disable the generic SxxExx title creating for episodes without NFO. Than the filename will be cleaned (removes tv show title if the same text is in the episode file name like the tv show folder name, removes all SxxExx and also all known tags like dvd, bluray, codec ect.) and used as episode title. The season and episode information is still known and Ember can scrape it if you want. This is a system wide settings for all tv shows, but it only affects episodes that do not have NFO when they are read. It's basically just a cosmetic effect and a help to spot episodes that have not been scraped yet.

Image


RE: Scrape data from the files themselves? - calexicoca - 2019-11-23

THANKS Dan!