2010-06-16, 00:37
MaDDoGo Wrote:Hi,So... the options are:
I know what happens. The problem with this movie is related to non-standard plots in filmaffinity. The scraper uses the (FILMAFFINITY) word in almost every films on this page to take plots. It takes everything from the start of the plot to this word.
In this movie, there is no final tag so the scraper doesn't know what it has to scrape. The answer to this is to scrape everything including (FILMAFFINITY) but I don't thing this is a good way to do things.
I'll answer if it's possible to do different rules depending of the emptiness of fields.
I expect you to understand me because of my "English".
a) To scrape everything including (FILMAFFINITY). This way will ALWAYS work the argumento.
b) Use "(FILMAFFINITY)" word to take plots but, this way, sometimes the argumento doesnt work.
I prefer to scrape everything and this way always work argumento.
Or, maybe, scrape everything and the try to eliminate "(FILMAFFINITY)" word
doing something like:
Code:
argumento = argumento.split("(FILMAFFINITY)")
Entiendo el problema, pero creo que tiene que tener alguna solucion facil... esperar a que los de FA arreglen su base de datos puede ser largo.
Bye