不要等待在 Python 中使用 Selenium 加载页面

时间：2023-07-04

本文介绍了不要等待在 Python 中使用 Selenium 加载页面的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

如何让 selenium 在页面完全加载之前点击元素并抓取数据?我的互联网连接非常糟糕，所以有时需要很长时间才能完全加载页面，这有什么问题吗?

How do I make selenium click on elements and scrape data before the page has fully loaded? My internet connection is quite terrible so it sometimes takes forever to load the page entirely, is there anyway around this?

推荐答案

ChromeDriver 77.0(支持 Chrome 77 版)现在支持 eager作为 pageLoadStrategy.

ChromeDriver 77.0 (which supports Chrome version 77) now supports eager as pageLoadStrategy.

已解决的问题 1902:支持急切页面加载策略 [Pri-2]

Resolved issue 1902: Support eager page load strategy [Pri-2]

<小时>

当你提到在页面完全加载之前点击元素并抓取数据在这种情况下，我们可以利用属性pageLoadStrategy.当 Selenium 默认加载页面/url 时，它遵循默认配置，将 pageLoadStrategy 设置为 normal.Selenium 可以从不同的文档就绪状态开始执行下一行代码.目前 Selenium 支持 3 种不同的 Document readiness state，我们可以通过 pageLoadStrategy 配置如下:

As you question mentions of click on elements and scrape data before the page has fully loaded in this case we can take help of an attribute pageLoadStrategy. When Selenium loads a page/url by default it follows a default configuration with pageLoadStrategy set to normal. Selenium can start executing the next line of code from different Document readiness state. Currently Selenium supports 3 different Document readiness state which we can configure through the pageLoadStrategy as follows:

无(未定义)
eager(页面变为交互式)
正常(完成页面加载)

这是配置pageLoadStrategy的代码块:

Here is the code block to configure the pageLoadStrategy:

from selenium import webdriver from selenium.webdriver.common.desired_capabilities import DesiredCapabilities binary = r'C:Program FilesMozilla Firefoxfirefox.exe' caps = DesiredCapabilities().FIREFOX # caps["pageLoadStrategy"] = "normal" # complete caps["pageLoadStrategy"] = "eager" # interactive # caps["pageLoadStrategy"] = "none" # undefined driver = webdriver.Firefox(capabilities=caps, firefox_binary=binary, executable_path="C:\Utility\BrowserDrivers\geckodriver.exe") driver.get("https://google.com")

这篇关于不要等待在 Python 中使用 Selenium 加载页面的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持跟版网！

上一篇：如何让 Python 中的 Selenium WebDriver 休眠几毫秒 下一篇：如何在 python webdriver 中为 chrome 设置代理?

相关文章

通过 Selenium 和 python 切换到 iframe

为什么 Python 看不到环境变量?

如何读取 Windows 环境变量值?

os.getenv 和 os.environ.get 的区别

解析配置文件、环境和命令行参数，以获取单个选项集合

打开命令行时Windows环境变量发生变化?

在 Windows 中将 DJANGO_SETTINGS_MODULE 永久设置为环境变量

cron运行python脚本时的环境变量

如何在 Amazon Elastic Beanstalk (Python) 中设置环境变量

python - os.getenv 和 os.environ 看不到我的 bash shell 的环境变量