Coder Social home page Coder Social logo

lgl1227 / liepin_crawler Goto Github PK

View Code? Open in Web Editor NEW

This project forked from panser1636/liepin_crawler

0.0 0.0 0.0 24.91 MB

使用scrapy框架爬虫爬取猎聘求职网站的岗位信息,并存储于Mysql数据库中,可数据展示、数据分析和按职位类别、地点等查询信息

liepin_crawler's Introduction

Liepin_crawler

主要功能

1、使用scrapy框架对猎聘(招聘)网站进行多页爬取,爬取的信息为python行业的工作岗位,其对应的工作地点、工作薪资和工作要求;

2、对爬取到的信息基于管道化存储到Mysql数据库,为了可以导出运行文件,运行文件中为Sqlite3数据库(也可保存到本地数据库中);

3、对Mysql的数据进行处理和数据分析,分析得到每个城市的平均薪酬、每个岗位的平均薪酬以及每个城市的岗位数量图;

4、使用streamlit方法将数据内容和数据分析的结果在网页显示,前端用户可根据岗位名称、岗位地点、薪资字段进行筛选、查询相关数据。

使用手册

1、首先进入工程文件夹 : cd bosspro2 # 工程文件夹的路径

2、然后执行命令: scrapy crawl boss2 即可运行爬虫文件

image

3、打开 show_data 文件夹

直接在命令行执行 streamlit run D:\pachong\show_data\showdata.py # 该文件的路径

左边区域为复选框,选择不同的选项会筛选出三个选项中至少符合其中一个的选项的内容;

将表格数据拉至底端,会出现一个数据展示的按钮,点击该按钮,即可浏览数据分析的图

liepin_crawler's People

Contributors

panser1636 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.