Selenium get source


Selenium is a web automation module that can be used to get a webpages html code. In this article we will show how to achieve that.

You may need to set the path to chromium:

export PATH=$PATH:/usr/lib/chromium/

Related course
Selenium WebDriver and Python: WebTest Automation Course

Get html source
We import webdriver from the selenium module. A webdriver object is created (chromium) and we can optionally specify if we want to ignore certificate errors. Once the web browser started we navigate it to a webpage URL using the get() module. Then we get the page source.

from selenium import webdriver
import time
 
options = webdriver.ChromeOptions()
options.add_argument('--ignore-certificate-errors')
options.add_argument("--test-type")
options.binary_location = "/usr/bin/chromium"
driver = webdriver.Chrome(chrome_options=options)
driver.get('https://python.org')
 
html = driver.page_source
print(html)

It will output the webpage source, which is stored in the variable html.

selenium chromium

Selenium will start the chromium browser automatically

selenium webdriver
selenium textbox
This entry was posted in Selenium and tagged . Bookmark the permalink.