Python WebDriver come stampare l'intera pagina sorgente (html)

Sto usando Python 2.7 con Selenium WebDriver. La mia domanda è come stampare l'intera pagina sorgente con il metodo print. C'è del metodo WebDriver page_source ma restituisce WebDriver e non so come convertirlo in stringa o semplicemente stamparlo nel terminalePython WebDriver come stampare l'intera pagina sorgente (html)

fonte

2014-12-10 wmarchewka

.page_source su un'istanza webdriver è quello che vi serve:

>>> from selenium import webdriver 
>>> driver = webdriver.Firefox() 
>>> driver.get('http://google.com') 
>>> print(driver.page_source) 
<!DOCTYPE html> 
<html xmlns="http://www.w3.org/1999/xhtml" lang="en" itemtype="http://schema.org/WebPage" itemscope=""><head><meta name="descri 
... 
:before,.vscl.vslru div.vspib{top:-4px}</style></body></html>

fonte

2014-12-10 22:20:04 alecxe

Grazie, questo è esattamente ciò di cui ho bisogno! Questa è stata colpa mia perché ho fatto in modo errato 'print driver.page_source' (driver.page_source non era tra parentesi) – wmarchewka

È può anche ottenere l'origine della pagina HTML senza utilizzare un browser. Il modulo delle richieste ti consente di farlo.

import requests 

res = requests.get('https://google.com') 
res.raise_for_status() # this line trows an exception if an error on the 
         # connection to the page occurs. 
print(res.text)

fonte

2017-07-25 13:59:23 Myke

Python WebDriver come stampare l'intera pagina sorgente (html)

risposta

Problemi correlati