使用python请求和BeatifulSoup在维基百科页面上抓取多个表及其标题？

使用Python请求和BeautifulSoup在维基百科页面上抓取多个表及其标题可以通过以下步骤实现：

导入所需的库：import requests from bs4 import BeautifulSoup
发送HTTP请求获取维基百科页面的HTML内容：url = 'https://zh.wikipedia.org/wiki/维基百科' response = requests.get(url) html_content = response.text
使用BeautifulSoup解析HTML内容：soup = BeautifulSoup(html_content, 'html.parser')
定位需要抓取的表格元素：tables = soup.find_all('table')
遍历每个表格，获取表格标题和内容：for table in tables: # 获取表格标题 caption = table.caption.text.strip() print('表格标题：', caption) # 获取表格内容 rows = table.find_all('tr') for row in rows: cells = row.find_all('td') for cell in cells: print(cell.text.strip())

完整代码示例：

import requests
from bs4 import BeautifulSoup

url = 'https://zh.wikipedia.org/wiki/维基百科'
response = requests.get(url)
html_content = response.text

soup = BeautifulSoup(html_content, 'html.parser')

tables = soup.find_all('table')

for table in tables:
    caption = table.caption.text.strip()
    print('表格标题：', caption)
    
    rows = table.find_all('tr')
    for row in rows:
        cells = row.find_all('td')
        for cell in cells:
            print(cell.text.strip())

这样就可以使用Python请求和BeautifulSoup在维基百科页面上抓取多个表及其标题了。

推荐的腾讯云相关产品和产品介绍链接地址：