Python-爬虫-爬取知乎的标题和当页显示的文字-白红宇

Python-爬虫-爬取知乎的标题和当页显示的文字

阅读量：7212 次

发布时间：2019-06-29

本文共 850 字，大约阅读时间需要 2 分钟。

# coding:utf-8import requestsfrom bs4 import BeautifulSoupquesNumStr = str(input("请输入搜索关键字："))url = 'https://www.zhihu.com/search?type=content&q='+quesNumStrheaders = {    'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.95 Safari/537.36'  # your user-Agent here}data = requests.get(url, headers=headers)soup = BeautifulSoup(data.text, 'lxml')liList = soup.select('li')print(len(liList))for li in liList:    try:        temp1 = li.select('a[class="js-title-link"]')        if temp1:            print('The title is :')            print(temp1[0].get_text())        temp2 = li.select('div[class="summary hidden-expanded"]')        if temp2:            print('The content is:')            print(temp2[0].text)    except:        pass

转载于:https://www.cnblogs.com/fredkeke/p/7003923.html

你可能感兴趣的文章

美国网络司令部133支网络部队已拥有初步作战能力

查看>>

如何看待阿里云加入Linux基金会金牌会员？

查看>>

三大应用需求：5G信道编码技术取得突破

查看>>