用selenium刮网（selenium怎么用）

25-02-27 10

想了解用selenium刮网的新动态吗？本文将为您提供详细的信息，我们还将为您解答关于selenium怎么用的相关问题，此外，我们还将为您介绍关于c#–是否可以在不安装SeleniumServer的情

想了解用selenium刮网的新动态吗？本文将为您提供详细的信息，我们还将为您解答关于selenium怎么用的相关问题，此外，我们还将为您介绍关于c# – 是否可以在不安装Selenium Server的情况下使用ISelenium / DefaultSelenium？、java+selenium, python+selenium浏览界面,滑动页面、Python selenium —— 一定要会用selenium的等待，三种等待方式解读、python+selenium十：selenium的二次封装的新知识。

本文目录一览：

用selenium刮网（selenium怎么用）
c# – 是否可以在不安装Selenium Server的情况下使用ISelenium / DefaultSelenium？
java+selenium, python+selenium浏览界面,滑动页面
Python selenium —— 一定要会用selenium的等待，三种等待方式解读
python+selenium十：selenium的二次封装

用selenium刮网（selenium怎么用）

我正尝试在此网站上搜索companynames, code, industry, sector, mkt cap, etcselenium表中的清单。我是新手，并编写了以下代码：

path_to_chromedriver = r''C:\Documents\chromedriver''browser = webdriver.Chrome(executable_path=path_to_chromedriver)url = r''http://sgx.com/wps/portal/sgxweb/home/company_disclosure/stockfacts''browser.get(url)time.sleep(15)output = browser.page_sourceprint(output)

但是，我可以获取以下标签，但不能获取其中的数据。

            <div>                <table>                    <thead>                        <tr></tr>                    </thead>                    <tbody></tbody>                </table>            </div>            <div></div>

我以前也尝试过BS4进行刮擦，但失败了。任何帮助深表感谢。

答案1

小编典典

该 结果是在一个iframe -切换到它，然后得到.page_source：

iframe = driver.find_element_by_css_selector("#mainContent iframe")driver.switch_to.frame(iframe)

我还要添加一个等待表加载的方法：

from selenium.webdriver.common.by import Byfrom selenium.webdriver.support.ui import WebDriverWaitfrom selenium.webdriver.support import expected_conditions as ECwait = WebDriverWait(driver, 10)# locate and switch to the iframeiframe = driver.find_element_by_css_selector("#mainContent iframe")driver.switch_to.frame(iframe)# wait for the table to loadwait.until(EC.visibility_of_element_located((By.CSS_SELECTOR, ''.companyName'')))print(driver.page_source)

c# – 是否可以在不安装Selenium Server的情况下使用ISelenium / DefaultSelenium？

我之前使用IWebDriver控制IE进行测试.但是IWebDriver和IWebElement支持的方法非常有限.我发现属于Selenium命名空间的ISelenium / DefaultSelenium非常有用.如何在不安装Selenium Server的情况下使用它们来控制IE？

这是DefaultSelenium的构造函数：

ISelenium sele = new DefaultSelenium(**serveraddr**,**serverport**,browser,url2test);
sele.Start();
sele.open();
...

似乎我必须在创建ISelenium对象之前安装Selenium Server.

我的情况是,我正在尝试使用C#Selenium构建一个.exe应用程序,它可以在不同的PC上运行,并且不可能在所有PC上安装Selenium Server(你永远不知道哪个是下一个运行应用程序).

有没有人知道如何在不安装服务器的情况下使用ISelenium / DefaultSelenium？
谢谢！

解决方法

在不使用RC Server的情况下,Java中有一些解决方案：

1)对于selenium浏览器启动：

DesiredCapabilities capabilities = new DesiredCapabilities();
capabilities.setbrowserName("safari");
CommandExecutor executor = new SeleneseCommandExecutor(new URL("http://localhost:4444/"),new URL("http://www.google.com/"),capabilities);
WebDriver driver = new RemoteWebDriver(executor,capabilities);

2)对于selenium命令：

// You may use any WebDriver implementation. Firefox is used here as an example
WebDriver driver = new FirefoxDriver();

// A "base url",used by selenium to resolve relative URLs
 String baseUrl = "http://www.google.com";

// Create the Selenium implementation
Selenium selenium = new WebDriverBackedSelenium(driver,baseUrl);

// Perform actions with selenium
selenium.open("http://www.google.com");
selenium.type("name=q","cheese");
selenium.click("name=btnG");

// Get the underlying WebDriver implementation back. This will refer to the
// same WebDriver instance as the "driver" variable above.
WebDriver driverInstance = ((WebDriverBackedSelenium) selenium).getWrappedDriver();

//Finally,close the browser. Call stop on the WebDriverBackedSelenium instance
//instead of calling driver.quit(). Otherwise,the JVM will continue running after
//the browser has been closed.
selenium.stop();

描述于此：http://seleniumhq.org/docs/03_webdriver.html

谷歌在C#中有类似的东西.没有其他方法可以实现这一目标.

java+selenium, python+selenium浏览界面,滑动页面

用到selenium的时候,有时需要滑动屏幕,模拟浏览,这时候就可以用到下边的代码了:

首先是java的selenium滑屏代码(不知道为啥后边就乱了排版,自己调整下就好~)

private static Random random = new Random();

private void pretendScrolling(JavascriptExecutor js, int sum) throws InterruptedException {
    sum--;
 int step;
 int length;
 switch (sum) {
 case 0:
 step = random.nextInt(30) + 1 + 50;
 length = random.nextInt(3000) + 1 + 3000;
 smoothlyScrolling(js, 0, step, length, true);
 smoothlyScrolling(js, 0, step, length, false);

 smoothlyScrolling(js, 0, step, length, true);
 smoothlyScrolling(js, length, step, 8000, true);

 step = random.nextInt(20) + 1 + 80;
 smoothlyScrolling(js, 0, step, 8000, false);

 length = random.nextInt(2000) + 1 + 3000;
 smoothlyScrolling(js, 0, step, length, true);
 smoothlyScrolling(js, 0, 20, length, false);
 break;
 case 1:
 step = random.nextInt(10) + 1 + 40;
 smoothlyScrolling(js, 0, step, 6000, true);
 Thread.sleep(3000);
 smoothlyScrolling(js, 0, step, 6000, false);

 length = random.nextInt(1500) + 1 + 1000;
 step = random.nextInt(10) + 1 + 40;
 Thread.sleep(3000);
 smoothlyScrolling(js, 0, step, length, true);
 Thread.sleep(5000);
 step = random.nextInt(10) + 1 + 40;
 smoothlyScrolling(js, length, step, 7000, true);
 Thread.sleep(3000);

 step = random.nextInt(20) + 1 + 80;
 smoothlyScrolling(js, 0, step, 8000, false);
 Thread.sleep(3000);
 break;
 case 2:
 step = random.nextInt(10) + 1 + 30;
 length = random.nextInt(1300) + 1 + 1000;
 smoothlyScrolling(js, 0, step, length, true); Thread.sleep(8000); step = random.nextInt(10) + 1 + 20; smoothlyScrolling(js, length, step, 6000, true); Thread.sleep(3000); length = random.nextInt(1500) + 1 + 1500; step = random.nextInt(20) + 1 + 40; smoothlyScrolling(js, length, step, 6000, false); Thread.sleep(3000); step = random.nextInt(30) + 1 + 70;
 smoothlyScrolling(js, length, step, 6000, true);
 Thread.sleep(4000);
 step = random.nextInt(20) + 1 + 30;
 smoothlyScrolling(js, 0, step, 6000, false);
 Thread.sleep(3000);
 break;
 case 3:
 step = random.nextInt(10) + 1 + 30;
 smoothlyScrolling(js, 0, step, 7000, true);
 Thread.sleep(3000);
 step = random.nextInt(30) + 1 + 20;
 smoothlyScrolling(js, 0, step, 7000, false);
 Thread.sleep(3000);
 break;
 case 4:
 step = random.nextInt(10) + 1 + 50;
 smoothlyScrolling(js, 0, step, 7000, true);
 Thread.sleep(3000);
 length = random.nextInt(1500) + 1 + 2000;
 step = random.nextInt(25) + 1 + 20;
 smoothlyScrolling(js, length, step, 7000, false);
 Thread.sleep(3000);
 step = random.nextInt(20) + 1 + 30;
 smoothlyScrolling(js, length, step, 7000, true);
 Thread.sleep(3000);
 step = random.nextInt(20) + 1 + 40;
 smoothlyScrolling(js, 0, step, 7000, false);
 Thread.sleep(2000);
 break;
 case 5:
 step = random.nextInt(20) + 1 + 30;
 smoothlyScrolling(js, 0, step, 2500, true);
 Thread.sleep(3000);
 step = random.nextInt(30) + 1 + 40; 
 smoothlyScrolling(js, 0, step, 2500, false);
 Thread.sleep(3000);
 step = random.nextInt(20) + 1 + 30;
 smoothlyScrolling(js, 0, step, 7000, true);
 Thread.sleep(6000);
 step = random.nextInt(20) + 1 + 30;
 smoothlyScrolling(js, 0, step, 7000, false);
 Thread.sleep(2000);
 break;
 default:
 case 6:
 step = random.nextInt(30) + 1 + 30;
 length = random.nextInt(1500) + 1 + 2500;
 smoothlyScrolling(js, 0, step, length, true);
 step = random.nextInt(30) + 1 + 70;
 smoothlyScrolling(js, 0, step, length, false);
 step = random.nextInt(20) + 1 + 40;
 smoothlyScrolling(js, 0, step, 7000, true);
 step = random.nextInt(25) + 1 + 20;
 smoothlyScrolling(js, 0, step, 7000, false);
 Thread.sleep(3000);
 step = random.nextInt(25) + 1 + 20;
 length = random.nextInt(1500) + 1 + 3000;
 smoothlyScrolling(js, 0, step, length, true);
 Thread.sleep(3000);
 step = random.nextInt(15) + 1 + 30;
 smoothlyScrolling(js, 0, step, length, false);
 break;
 }
}

private void smoothlyScrolling(JavascriptExecutor js, int floor, int step, int celling, boolean direction) {
 if (direction) {
 for (int i = floor; i <= celling; i += step) {
 js.executeScript("scrollTo(0," + i + ")");
 }
 } else {
 for (int i = celling; i >= floor; i -= step) {
 js.executeScript("scrollTo(0," + i + ")");
 }
 }
}

用法如下:



两个参数,其中js表示的是driver, sum是一个随机数,表示随机的从7中浏览方式之间选一种


再然后是在python selenium中的应用,我专门写了一个类出来存放浏览界面的代码,然后在需要用到它的地方导入,引用

import random
import time


class PretendScrolling:
    @staticmethod
 def pretend_scrolling(driver, sum):
 step = 1
 length = 1
 if sum == 1:
 step = random.randint(0, 30) + 1 + 50
 length = random.randint(0, 3000) + 1 + 3000
 smoothlyScrolling(driver, 0, step, length, True)
 smoothlyScrolling(driver, 0, step, length, False)

 smoothlyScrolling(driver, 0, step, length, True)
 smoothlyScrolling(driver, length, step, 8000, False)
 step = random.randint(0, 20)+ 1+80
 smoothlyScrolling(driver, 0 ,step, 8000, False)

 length = random.randint(0, 2000) + 1 + 3000
 smoothlyScrolling(driver, 0 , step, length, True)
 smoothlyScrolling(driver, 0 , 20, length, False)
 elif sum == 2:
 step = random.randint(0, 10) + 1 + 40
 smoothlyScrolling(driver, 0, step, 6000, True)
 time.sleep(3)
 smoothlyScrolling(driver, 0, step, 6000, False)

 length = random.randint(0, 1500) + 1 + 1000
 step = random.randint(0, 10) + 1 + 40
 time.sleep(3)
 smoothlyScrolling(driver, 0, step, length, True)
 time.sleep(5)
 step = random.randint(0, 10)+1+40
 smoothlyScrolling(driver, length, step, 7000, True)
 time.sleep(3)

 step = random.randint(0, 20)+1+80
 smoothlyScrolling(driver, 0, step, 8000, False)
 time.sleep(3)
 elif sum == 3:
 step = random.randint(0, 10) + 1 + 30
 length = random.randint(0, 1300) + 1 + 1000
 smoothlyScrolling(driver, 0, step, length, True)
 time.sleep(8)
 step = random.randint(0, 10) + 1 + 20
 smoothlyScrolling(driver, length, step, 6000, True)
 time.sleep(3)
 length = random.randint(0, 1500) + 1 + 1500
 step = random.randint(0, 20) + 1 + 40
 smoothlyScrolling(driver, length, step, 6000, False)
 time.sleep(3)
 step = random.randint(0, 30) + 1 + 70
 smoothlyScrolling(driver, length, step, 6000, True)
 time.sleep(4)
 step = random.randint(0, 20) + 1 + 30
 smoothlyScrolling(driver, 0, step, 6000, False)
 time.sleep(3)
 elif sum == 4:
 step = random.randint(0, 10) + 1 + 30
 smoothlyScrolling(driver, 0, step, 7000, True)
 time.sleep(3)
 step = random.randint(0, 30) + 1 + 20
 smoothlyScrolling(driver, 0, step, 7000, False)
 time.sleep(3)
 elif sum == 5:
 step = random.randint(0, 10) + 1 + 50
 smoothlyScrolling(driver, 0, step, 7000, True)
 time.sleep(3)
 length = random.randint(0, 1500) + 1 + 2000
 step = random.randint(0, 25) + 1 + 20
 smoothlyScrolling(driver, length, step, 7000, False)
 time.sleep(3)
 step = random.randint(0, 20) + 1 + 30
 smoothlyScrolling(driver, length, step, 7000, True) time.sleep(3) step = random.randint(0, 20) + 1 + 40 smoothlyScrolling(driver, 0, step, 7000, False) time.sleep(2) elif sum == 6: step = random.randint(0, 20) + 1 + 30 smoothlyScrolling(driver, 0, step, 2500, True) time.sleep(3)
 step = random.randint(0, 30) + 1 + 40
 smoothlyScrolling(driver, 0, step, 2500, False)
 time.sleep(3)
 step = random.randint(0, 20) + 1 + 30
 smoothlyScrolling(driver, 0, step, 7000, True)
 time.sleep(6)
 step = random.randint(0, 20) + 1 + 30
 smoothlyScrolling(driver, 0, step, 7000, False)
 time.sleep(2)
 elif sum == 7:
 step = random.randint(0, 30) + 1 + 30
 length = random.randint(0, 1500) + 1 + 2500
 smoothlyScrolling(driver, 0, step, length, True)
 step = random.randint(0, 30) + 1 + 70
 smoothlyScrolling(driver, 0, step, length, False)
 step = random.randint(0, 20) + 1 + 40
 smoothlyScrolling(driver, 0, step, 7000, True)
 step = random.randint(0, 25) + 1 + 20
 smoothlyScrolling(driver, 0, step, 7000, False)
 time.sleep(3)
 step = random.randint(0, 25) + 1 + 20
 length = random.randint(0, 1500) + 1 + 3000
 smoothlyScrolling(driver, 0, step, length, True)
 time.sleep(3)
 step = random.randint(0, 15) + 1 + 30
 smoothlyScrolling(driver, 0, step, length, False)
 else:
 pass



#(不知道为啥会乱排版,这是另一个方法)def smoothlyScrolling(driver, floor, step, celling, direction):
 if direction is True:
 i = floor
 while i <= celling:
 i += step
 jscript = "window.scrollTo(1, {height_i})".format(height_i=i)
 driver.execute_script(jscript)
 else:
 i = celling
 while i >= floor:
 i -= step
 jscript = "window.scrollTo(1, {height_i})".format(height_i=i)
 driver.execute_script(jscript)





在python的其它类中引用如下:
首先先在相关类中引入滚动界面的类

然后调用:

以上就是selenium里边滑动界面的一组方法了,支持java和python.

 





因为图片好像没有加载出来,所以再放一下.然后也不知道为啥排版会乱,要用的话自己调整一下即可.

Python selenium —— 一定要会用selenium的等待，三种等待方式解读

发现太多人不会用等待了，博主今天实在是忍不住要给大家讲讲等待的必要性。

很多人在群里问，这个下拉框定位不到、那个弹出框定位不到…各种定位不到，其实大多数情况下就是两种问题：1 有frame，2 没有加等待。殊不知，你的代码运行速度是什么量级的，而浏览器加载渲染速度又是什么量级的，就好比闪电侠和凹凸曼约好去打怪兽，然后闪电侠打完回来之后问凹凸曼你为啥还在穿鞋没出门？凹凸曼分分中内心一万只羊驼飞过，欺负哥速度慢，哥不跟你玩了，抛个异常撂挑子了。

那么怎么才能照顾到凹凸曼缓慢的加载速度呢？只有一个办法，那就是等喽。说到等，又有三种等法，且听博主一一道来：

1. 强制等待
第一种也是最简单粗暴的一种办法就是强制等待sleep(xx)，强制让闪电侠等xx时间，不管凹凸曼能不能跟上速度，还是已经提前到了，都必须等xx时间。

看代码：

# -*- coding: utf-8 -*-
from selenium import webdriver
from time import sleep

driver = webdriver.Firefox()
driver.get(''https://huilansame.github.io'')

sleep(3) # 强制等待3秒再执行下一步

print driver.current_url
driver.quit()
1
2
3
4
5
6
7
8
9
10
11
这种叫强制等待，不管你浏览器是否加载完了，程序都得等待3秒，3秒一到，继续执行下面的代码，作为调试很有用，有时候也可以在代码里这样等待，不过不建议总用这种等待方式，太死板，严重影响程序执行速度。

2. 隐性等待
第二种办法叫隐性等待，implicitly_wait(xx)，隐性等待的意义是：闪电侠和凹凸曼约定好，不论闪电侠去哪儿，都要等凹凸曼xx秒，如果凹凸曼在这段时间内来了，则俩人立即出发去打怪兽，如果凹凸曼在规定时间内没到，则闪电侠自己去，那自然就等着凹凸曼给你抛异常吧。

看代码：

# -*- coding: utf-8 -*-
from selenium import webdriver

driver = webdriver.Firefox()
driver.implicitly_wait(30) # 隐性等待，最长等30秒
driver.get(''https://huilansame.github.io'')

print driver.current_url
driver.quit()
1
2
3
4
5
6
7
8
9
隐形等待是设置了一个最长等待时间，如果在规定时间内网页加载完成，则执行下一步，否则一直等到时间截止，然后执行下一步。注意这里有一个弊端，那就是程序会一直等待整个页面加载完成，也就是一般情况下你看到浏览器标签栏那个小圈不再转，才会执行下一步，但有时候页面想要的元素早就在加载完成了，但是因为个别js之类的东西特别慢，我仍得等到页面全部完成才能执行下一步，我想等我要的元素出来之后就下一步怎么办？有办法，这就要看selenium提供的另一种等待方式——显性等待wait了。

需要特别说明的是：隐性等待对整个driver的周期都起作用，所以只要设置一次即可，我曾看到有人把隐性等待当成了sleep在用，走哪儿都来一下…

3. 显性等待
第三种办法就是显性等待，WebDriverWait，配合该类的until()和until_not()方法，就能够根据判断条件而进行灵活地等待了。它主要的意思就是：程序每隔xx秒看一眼，如果条件成立了，则执行下一步，否则继续等待，直到超过设置的最长时间，然后抛出TimeoutException。

先看个代码示例：

# -*- coding: utf-8 -*-
from selenium import webdriver
from selenium.webdriver.support.wait import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By

driver = webdriver.Firefox()
driver.implicitly_wait(10) # 隐性等待和显性等待可以同时用，但要注意：等待的最长时间取两者之中的大者
driver.get(''https://huilansame.github.io'')
locator = (By.LINK_TEXT, ''CSDN'')

try:
WebDriverWait(driver, 20, 0.5).until(EC.presence_of_element_located(locator))
print driver.find_element_by_link_text(''CSDN'').get_attribute(''href'')
finally:
driver.close()
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
上例中，我们设置了隐性等待和显性等待，在其他操作中，隐性等待起决定性作用，在WebDriverWait..中显性等待起主要作用，但要注意的是：最长的等待时间取决于两者之间的大者，此例中为20，如果隐性等待时间 > 显性等待时间，则该句代码的最长等待时间等于隐性等待时间。

我们主要用到了WebDriverWait类与expected_conditions模块，下面博主带大家细看一下这两个模块：

WebDriverWait
wait模块的WebDriverWait类是显性等待类，先看下它有哪些参数与方法：

selenium.webdriver.support.wait.WebDriverWait（类）
1
__init__

driver: 传入WebDriver实例，即我们上例中的driver
timeout: 超时时间，等待的最长时间（同时要考虑隐性等待时间）
poll_frequency: 调用until或until_not中的方法的间隔时间，默认是0.5秒
ignored_exceptions: 忽略的异常，如果在调用until或until_not的过程中抛出这个元组中的异常，
则不中断代码，继续等待，如果抛出的是这个元组外的异常，则中断代码，抛出异常。默认只有NoSuchElementException。
1
2
3
4
5
until

method: 在等待期间，每隔一段时间（__init__中的poll_frequency）调用这个传入的方法，直到返回值不是False
message: 如果超时，抛出TimeoutException，将message传入异常
1
2
until_not

与until相反，until是当某元素出现或什么条件成立则继续执行，
until_not是当某元素消失或什么条件不成立则继续执行，参数也相同，不再赘述。
1
2
看了以上内容基本上很清楚了，调用方法如下：

WebDriverWait(driver, 超时时长, 调用频率, 忽略异常).until(可执行方法, 超时时返回的信息)
1
这里需要特别注意的是until或until_not中的可执行方法method参数，很多人传入了WebElement对象，如下：

WebDriverWait(driver, 10).until(driver.find_element_by_id(''kw'')) # 错误
1
这是错误的用法，这里的参数一定要是可以调用的，即这个对象一定有 __call__() 方法，否则会抛出异常：

TypeError: ''xxx'' object is not callable
1
在这里，你可以用selenium提供的 expected_conditions 模块中的各种条件，也可以用WebElement的 is_displayed() 、is_enabled()、is_selected() 方法，或者用自己封装的方法都可以，那么接下来我们看一下selenium提供的条件有哪些：

expected_conditions
expected_conditions是selenium的一个模块，其中包含一系列可用于判断的条件：

selenium.webdriver.support.expected_conditions（模块）
1
以下两个条件类验证title，验证传入的参数title是否等于或包含于driver.title
title_is
title_contains

以下两个条件验证元素是否出现，传入的参数都是元组类型的locator，如(By.ID, ‘kw’)
顾名思义，一个只要一个符合条件的元素加载出来就通过；另一个必须所有符合条件的元素都加载出来才行
presence_of_element_located
presence_of_all_elements_located

以下三个条件验证元素是否可见，前两个传入参数是元组类型的locator，第三个传入WebElement
第一个和第三个其实质是一样的
visibility_of_element_located
invisibility_of_element_located
visibility_of

以下两个条件判断某段文本是否出现在某元素中，一个判断元素的text，一个判断元素的value
text_to_be_present_in_element
text_to_be_present_in_element_value

以下条件判断frame是否可切入，可传入locator元组或者直接传入定位方式：id、name、index或WebElement
frame_to_be_available_and_switch_to_it

以下条件判断是否有alert出现
alert_is_present

以下条件判断元素是否可点击，传入locator
element_to_be_clickable

以下四个条件判断元素是否被选中，第一个条件传入WebElement对象，第二个传入locator元组
第三个传入WebElement对象以及状态，相等返回True，否则返回False
第四个传入locator以及状态，相等返回True，否则返回False
element_to_be_selected
element_located_to_be_selected
element_selection_state_to_be
element_located_selection_state_to_be

最后一个条件判断一个元素是否仍在DOM中，传入WebElement对象，可以判断页面是否刷新了
staleness_of

上面是所有17个condition，与until、until_not组合能够实现很多判断，如果能自己灵活封装，将会大大提高脚本的稳定性。

python+selenium十：selenium的二次封装

python+selenium十：基于原生selenium的二次封装

from selenium import webdriver
from selenium.webdriver.support.wait import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.support.select import Select
from selenium.webdriver.common.action_chains import ActionChains
from selenium.webdriver.common.by import By
# BY的用法
# driver.find_element("id", "kw")
# driver.find_element(By.ID, "kw")

class Bace():
    ''''''基于原生的selenium做二次封装''''''

    def __init__(self, driver:webdriver.Firefox):  # driver:webdriver.Firefox：映射driver 为webdriver.Firefox
        self.driver = driver
        self.timeout = 10
        self.t = 0.5

    def find(self, locator, value=''''):
        ''''''  定位到元素，返回元素对象，没定位到，Timeout异常 loctor 传元祖，如（"id", "kw"）  ''''''
        if not isinstance(locator, tuple):
            print(''locator参数类型错误，必须传元祖类型：loc = ("id", "value1")'')
        else:
            print(f"正在定位元素信息：定位方式->{locator[0]}, 元素值->{locator[1]}，value值->{value}")
            if value != '''':  # value值定位
                ele = WebDriverWait(self.driver, self.timeout, self.t).until(EC.text_to_be_present_in_element_value(locator, value))
                return ele
            else:  # 默认为此常规定位方法
                ele = WebDriverWait(self.driver, self.timeout, self.t).until(EC.presence_of_element_located(locator))
                if ele:
                    return ele
                else:
                    print(f"定位失败：定位方式->{locator[0]}, value值->{locator[1]}")
                    return False

    def finds(self, locator, value=''''):
        '''''' 定位到元素，返回元素对象，没定位到，Timeout异常  loctor 传元祖，如（"id", "kw"）  ''''''
        if not isinstance(locator, tuple):
            print(''locator参数类型错误，必须传元祖类型：loc = ("id", "value1")'')
        else:
            print(f"正在定位元素信息：定位方式->{locator[0]}, 元素值->{locator[1]}，value值->{value}")
            if value != '''':  # value值定位
                eles = WebDriverWait(self.driver, self.timeout, self.t).until(EC.text_to_be_present_in_element_value(locator, value))
                return eles
            else:  # 默认为此常规定位方法
                eles = WebDriverWait(self.driver, self.timeout, self.t).until(EC.presence_of_element_located(locator))
                if eles:
                    return eles
                else:
                    print(f"定位失败：定位方式->{locator[0]}, value值->{locator[1]}")
                    return []

    def sendKeys(self, locator, text):
        try:
            self.find(locator).send_keys(text)
        except:
            print(f"输入 {text} 失败")

    def click(self, locator):
        try:
            self.find(locator).click()
        except:
            print("点击失败")

    def clear(self, locator):
        try:
            self.find(locator).clear()
        except:
            print("清空内容失败")

    def isSelected(self, locator, Type=''''):
        '''''' 判断元素是否被选中，返回bool值 及点（选中/取消选中）''''''
        ele = self.find(locator)
        try:
            if Type == '''':  # 如果type参数为空，返回元素是否为选中状态，True/False (默认)
                r = ele.is_selected()
                return r
            elif Type == ''click'':  # 如果type参数为click，执行元素的点击操作
                ele.click()
            else:
                print(f"type参数 {Type} 错误，仅可为click或''''")
        except:
            return False

    def isElementExist(self, locator):
        '''''' 判断单个元素是否在DOM里面 （是否存在）''''''
        try:
            self.find(locator)
            return True
        except:
            return False

    def isElementExists(self, locator):
        '''''' 判断一组元素是否在DOM里面 （是否存在），若不存在，返回一个空的list''''''
        eles = self.finds(locator)
        n = len(eles)
        if n == 0:
            return False
        elif n == 1:
            return True
        else:
            print(f"定位到元素的个数：{n}")
            return True

    def title(self, title, Type=''contains''):
        ''''''  根据传入的type类型判断title  ''''''
        try:
            if Type == ''is'':  # 判断当前网页title名为title   返回bool值
                result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.title_is(title))
                return result
            elif Type == ''contains'':  # 判断当前网页title名含title   返回bool值 (默认)
                result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.title_contains(title))
                return result
            else:
                print(f"type参数 {Type} 错误，仅可为is、contains")
        except:
            return False

    def in_element(self, locator, value, Type=''text''):
        ''''''  根据传入的type判断内容是否在指定元素里面  ''''''
        if not isinstance(locator, tuple):
            print(''locator参数类型错误，必须传元祖类型：loc = ("id", "value1")'')
        try:
            if Type == ''text'':  # 判断当前获取到的text含value   返回bool值 （默认）
                result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.text_to_be_present_in_element(locator, value))
                return result
            elif Type == ''value'':  # 判断当前获取到的value含value 返回bool值, value为空字符串，返回False
                result = self.find(locator, value)
                return result
            else:
                print(f"type参数 {Type} 错误，仅可使用text或value属性定位")
                return False
        except:
            return False

    def alert(self, timeout=3, Type=''''):
        '''''' 根据传入的type判断alert弹窗及操作 ''''''
        result = WebDriverWait(self.driver, timeout, self.t).until(EC.alert_is_present())
        try:
            if Type == '''':  # 判断alert是否存在，如果有，就返回alert对象 （默认）
                if result:
                    return result
                else:
                    print("alert不存在")
                return False
            elif Type == ''yes'':  # 执行alert的确定按钮
                result.accept()
            elif Type == ''no'':  # 执行alert的取消按钮
                result.dismiss()
            else:
                print(f"type参数 {Type} 错误，仅可为yes、no、或''''")
        except:
            return False

    def get(self, locator, Type=''text'', name=''''):
        '''''' 根据传入的type判断获取指定的内容 （title、text、attribute）
        type==attribute:  获取元素属性  name:属性  className、name、text、value···  ''''''
        try:
            if Type == ''title'':  # 获取当前网页 title
                return self.driver.title
            elif Type == ''text'':  # 获取元素文本值（默认）
                return self.find(locator).text
            elif Type == ''attribute'':  # 获取当前元素属性
                return self.find(locator).get_attribute(name)
            else:
                print(f"给的type参数 {Type} 错误，仅可用title、text、attribute")
        except:
            print(f"获取 {Type} 值失败")
            return ''''

    def select(self, locator, value, Type=''index''):
        ''''''  下拉选项框 根据传入的type值判断（index、value、text）  ''''''
        element = self.find(locator)  # 定位select这一栏
        try:
            if Type == ''index'':  # 用下标选择 （默认）
                Select(element).select_by_index(value)
            elif Type == ''value'':  # 根据value值选择
                Select(element).select_by_value(value)
            elif Type == ''text'':  # 根据选项的文本内容选择
                Select(element).select_by_visible_text(value)
            else:
                print(f"给的type参数 {Type} 错误，仅可为：int、text、value")
        except:
            print(f"根据 {value} 操作下拉框失败")

    def iframe(self, id_index_locator):
        '''''' 常规切换 iframe''''''
        try:
            if isinstance(id_index_locator, int):  # 如果传入的是数字，则以该数字为下标取值
                self.driver.switch_to.frame(id_index_locator)
            elif isinstance(id_index_locator, str):  # 如果传入的是字符串，则用iframe名字取值
                self.driver.switch_to.frame(id_index_locator)
            elif isinstance(id_index_locator, tuple):  # 如果是元祖，则根据传入的locator取值
                ele = self.find(id_index_locator)
                self.driver.switch_to.frame(ele)
        except:
             print("iframe切换异常")

    def handle(self, value):
        '''''' 句柄切换，index、句柄名 ''''''
        try:
            if isinstance(value, int):  # 切换到该下标对应的窗口
                handles = driver.window_handles
                self.driver.switch_to.window(handles[value])
            elif isinstance(value, str):  # 切换到该句柄名称对应的窗口
                self.driver.switch_to.window(value)
            else:
                print(f"传入的type参数 {value} 错误，仅可传int、str")
        except:
            print(f"根据 {value} 获取句柄失败")

    def move_to_element(self, locator):
        '''''' 鼠标悬停操作 ''''''
        try:
            ele = self.find(locator)
            ActionChains(self.driver).move_to_element(ele).perform()
        except:
            print("鼠标悬停操作失败")
            return False
    ''''''==============================js与jQuery相关=====================================''''''

    def js_focus_element(self, locator):
        '''''' 聚焦元素 ''''''
        target = self.find(locator)
        self.driver.execute_script("arguments[0].scrollIntoView();", target)

    def js_scroll_top(self):
        '''''' 滚动到顶部 ''''''
        js = "window.scrollTo(0,0)"
        self.driver.execute_script(js)

    def js_scroll_end(self, x=0):
        '''''' 滚动到底部 ''''''
        js = f"window.scrollTo({x},document.body.scrollHeight)"
        self.driver.execute_script(js)

    def js_find(self, action):
        ''''''  js查找元素，并做相应操作（默认id属性） 输入值：value=''XXX''  点击：click() ''''''
        js = f"document.getElementById(“id”).{action}"
        self.driver.execute_script(js)

    def js_finds(self, Type, element, index, action):
        ''''''   js查找元素，并做相应操作   输入值：value=''XXX''     点击：click()
        js定位仅可为：id、Name、TagName、ClassName、Selector（CSS） ''''''
        list = [''Name'', ''TagName'', ''ClassName'', ''Selector'']
        if type in list:
            print(f"正在执行js操作：定位方式->{Type}, 元素值->{element}， 下标值->{index}， 执行操作->{action}")
            if type == ''Selector'':
                js = f''document.query{Type}All("{element}"){index}.{action}''
            else:
                js = f''document.getElementsBy{Type}({element})[{index}].{action};''
            self.driver.execute_script(js)
        else:
            print(f"type参数 {Type} 错误，js定位仅可为：''Name''、''TagName''、''ClassName''、''Selector''（CSS）")

    def js_readonly(self, idElement, value):
        '''''' 去掉只读属性，并输入内容 一般为id ''''''
        js = f''document.getElementById({idElement}).removeAttribute("readonly");document.getElementById({idElement}).value="{value}"''
        driver.execute_script(js)

    def js_iframe(self, Type, element, action, index=''''):
        ''''''  Js处理iframe   无需先切换到iframe上，再切回来操作
        输入值：value=''''         点击：click()           type=id时，index=''''  ''''''
        js = f''document.getElementBy{Type}({element}){index}.contentWindow.document.body.{action}''
        driver.execute_script(js)
    ''''''
        jquery = ''$(CSS).val("XXX");''   # 根据css语法定位到元素，输入内容
        jquery = ''$(CSS).val('''');''      # 清空
        jquery = ''$(CSS).click();''      # 点击
        driver.execute_script(jquery)
    ''''''

    # def switch_alert(self):
    #     '''''' 获取alert弹窗 ''''''
    #     r = self.is_alert()
    #     if not r:
    #         print("alert不存在")
    #     else:
    #         return r

    # def is_title(self, title):
    #     ''''''判断当前title名为title   返回bool值''''''
    #     try:
    #         result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.title_is(title))
    #         return result
    #     except:
    #         return False
    # def is_title_contains(self, title):
    #     ''''''判断当前title名含title   返回bool值''''''
    #     try:
    #         result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.title_contains(title))
    #         return result
    #     except:
    #         return False

    # def is_text_in_element(self, locator, _text=''''):
    #     ''''''判断当前获取到的text含_text=''''   返回bool值''''''
    #     if not isinstance(locator, tuple):
    #         print(''locator参数类型错误，必须传元祖类型：loc = ("id", "value1")'')
    #     try:
    #         result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.text_to_be_present_in_element(locator, _text))
    #         return result
    #     except:
    #         return False
    # def is_value_in_element(self, locator, _value=''''):
    #     ''''''返回bool值, value为空字符串，返回False''''''
    #     if not isinstance(locator, tuple):
    #         print(''locator参数类型错误，必须传元祖类型：loc = ("id", "value1")'')
    #     try:
    #         result = WebDriverWait(self.driver, self.timeout, self.t).until(EC.text_to_be_present_in_element_value(locator, _value))
    #         return result
    #     except:
    #         return False

    # def get_title(self):
    #     ''''''获取title''''''
    #     return self.driver.title
    # def get_text(self, locator):
    #     ''''''获取文本''''''
    #     try:
    #         t = self.find(locator).text
    #         return t
    #     except:
    #         print("获取text失败，返回'''' ")
    #         return ""
    # def get_attribute(self, locator, name):
    #     ''''''获取属性''''''
    #     try:
    #         element = self.find(locator)
    #         return element.get_attribute(name)
    #     except:
    #         print("获取%s属性失败，返回'''' "%name)
    #         return ""

    # def select_by_index(self, locator, index=0):
    #     ''''''通过索引,index是索引第几个，从0开始，默认选第一个''''''
    #         element = self.find(locator)  # 定位select这一栏
    #         Select(element).select_by_index(index)

    # def select_by_value(self, locator, value):
    #     ''''''通过value属性''''''
    #     element = self.find(locator)
    #     Select(element).select_by_value(value)

    # def select_by_text(self, locator, text):
    #     ''''''通过文本值定位''''''
    #     element = self.find(locator)
    #     Select(element).select_by_visible_text(text)

    # def switch_handle_window_name(self, window_name):
    #     '''''' 根据句柄名字切换句柄 ''''''
    #     self.driver.switch_to.window(window_name)
    # def switch_handle_index(self, index):
    #     ''''''  根据句柄下标切换句柄  ''''''
    #     handles = driver.window_handles
    #     self.driver.switch_to.window(handles[index])

    # def js_find(self, action):
    #     ''''''
    #     输入值：value=''XXX''     点击：click()
    #     ''''''
    #     print("正在执行js操作，操作行为：%s"%action)
    #     js = "document.getElementById(“id”).%s"%action
    #     self.driver.execute_script(js)

if __name__ == "__main__":
    driver = webdriver.Firefox()
    driver.get("")
    zentao = Base(driver)
    # loc1 = (By.ID, "account")
    # loc2 = (By.CSS_SELECTOR, "[name=''password'']")
    # loc3 = (By.XPATH, "//*[@id=''submit'']")

    loc1 = ("id", "account")
    loc2 = ("css selector", "[name=''password'']")
    loc3 = ("xpath", "//*[@id=''submit'']")
    zentao.sendKeys(loc2, 123)

    zentao.move_to_element(loc3)

今天关于用selenium刮网和selenium怎么用的介绍到此结束，谢谢您的阅读，有关c# – 是否可以在不安装Selenium Server的情况下使用ISelenium / DefaultSelenium？、java+selenium, python+selenium浏览界面,滑动页面、Python selenium —— 一定要会用selenium的等待，三种等待方式解读、python+selenium十：selenium的二次封装等更多相关知识的信息可以在本站进行查询。

本文标签：