想问一下各位大佬，用xpath爬取淘宝商品价格，结果定位到价格这里是繁体字，没有数字。这种情况该如何爬取。

爬取当前页的内容

    for row in range(1, 16):  # 假设有15行，可以根据实际情况进行调整
        for position in range(1, 5):  # 每行有4个商品，可以根据实际情况进行调整
            title_xpath = f'//*[@id="J_ShopSearchResult"]/div/div[3]/div[{row}]/dl[{position}]/dd[2]/a'
            # 爬取商品价格
            price_xpath = f'//*[@id="J_ShopSearchResult"]/div/div[3]/div[{row}]/dl[{position}]/dd[2]/div/div[1]'
            img_xpath = f'//*[@id="J_ShopSearchResult"]/div/div[3]/div[{row}]/dl[{position}]/dt/a/img'

            title_element = driver.find_element(By.XPATH, title_xpath)
            price_element = driver.find_element(By.XPATH, price_xpath)
            img_element = driver.find_element(By.XPATH, img_xpath)

            title = title_element.text
            price = price_element.text
            img_url = img_element.get_attribute('src')

            if title:
                if not img_url.startswith(('http://', 'https://')):
                    img_url = 'https:' + img_url  # 补全协议部分

                item_list.append({'title': title, 'price': price, 'img_url': img_url})

import re # 定义匹配规则 pattern = r'[\u4e00-\u9fa5]+元' # 爬取商品价格 html = requests.get('https://s.taobao.com/search?q=商品名称') soup = BeautifulSoup(html.content, 'html.parser') items = soup.select('.J_MouserOnverReq') # 提取价格 for item in items: price = item.select_one('.J_CalcPrice').text price = re.findall(pattern, price)[0] print(price)

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

想问一下各位大佬，用xpath爬取淘宝商品价格，结果定位到价格这里是繁体字，没有数字。这种情况该如何

爬取当前页的内容