scraping java script page
#1
hi.
i am trying to get html content but i get:
"Javascript is required. Please enable javascript before you are allowed to see this page"
any way to get the html content of this page?
url is http://www.sparo.pw/series
thanks for all your help.
Reply
#2
you'd have to execute those scripts through some js interpreter. i believe there are some python modules floating around to achieve that, but portability might suffer, see e.g. https://forum.kodi.tv/showthread.php?tid=266858
Reply
#3
See my example in python for windows operating system, try to adapt to the kodi:

https://pypi.python.org/pypi/selenium#downloads
https://sites.google.com/a/chromium.org/.../downloads
http://phantomjs.org/download.html
https://github.com/mozilla/geckodriver/releases

directory
Code:
[dir]selenium
    [dir]common
    [dir]webdriver
    [file]chromedriver.exe
    [file]geckodriver.exe
    [file]phantomjs.exe
    [file]__init__.py

[file]test.py

------------

test.py
Code:
from selenium import webdriver
chromedriver = "selenium\chromedriver.exe"

options = webdriver.ChromeOptions()
options.add_argument('--window-size=1,111')
options.add_argument('--window-position=-100,120')
#options.add_argument("--silent-launch")
#options.add_argument("--start-maximized")
#options.add_argument("--kiosk")
#driver = webdriver.Chrome(chrome_options=options)

def get_FramePlayer(url):
    #driver = webdriver.PhantomJS(r"selenium\phantomjs.exe")
    driver = webdriver.Chrome(executable_path=chromedriver, chrome_options=options)
    try:
        driver.get(url)
        lnkPlayer = driver.find_element_by_id('Playerholder')
        tagIFRAME = lnkPlayer.find_element_by_name('Player')
        myFrame = tagIFRAME.get_attribute('src')
        
        print 'iframe: ' + myFrame

        driver.quit()
    except:
        driver.quit()
        print 'Error msg'

get_FramePlayer("http://www.redecanais.com/pacto-secreto-dublado-2009-1080p_7e9cef777.html")
Reply
#4
[font][font]I tried to use [/font][/font]Selenium[font][font], but it didn't work.[/font][/font]
Reply
#5
(2017-09-04, 04:41)antrrax Wrote: See my example in python for windows operating system, try to adapt to the kodi:

https://pypi.python.org/pypi/selenium#downloads
https://sites.google.com/a/chromium.org/.../downloads
http://phantomjs.org/download.html
https://github.com/mozilla/geckodriver/releases

directory
Code:
[dir]selenium
[dir]common
[dir]webdriver
[file]chromedriver.exe
[file]geckodriver.exe
[file]phantomjs.exe
[file]__init__.py

[file]test.py

------------

test.py
Code:
from selenium import webdriver
chromedriver = "selenium\chromedriver.exe"

options = webdriver.ChromeOptions()
options.add_argument('--window-size=1,111')
options.add_argument('--window-position=-100,120')
#options.add_argument("--silent-launch")
#options.add_argument("--start-maximized")
#options.add_argument("--kiosk")
#driver = webdriver.Chrome(chrome_options=options)

def get_FramePlayer(url):
#driver = webdriver.PhantomJS(r"selenium\phantomjs.exe")
driver = webdriver.Chrome(executable_path=chromedriver, chrome_options=options)
try:
driver.get(url)
lnkPlayer = driver.find_element_by_id('Playerholder')
tagIFRAME = lnkPlayer.find_element_by_name('Player')
myFrame = tagIFRAME.get_attribute('src')

print 'iframe: ' + myFrame

driver.quit()
except:
driver.quit()
print 'Error msg'

get_FramePlayer("http://www.redecanais.com/pacto-secreto-dublado-2009-1080p_7e9cef777.html")

Take a look at this CLI I created for https://redecanais.pictures/ ...
Reply

Logout Mark Read Team Forum Stats Members Help
scraping java script page0