Coder Social home page Coder Social logo

bossboss13 / easyspider Goto Github PK

View Code? Open in Web Editor NEW

This project forked from naibowang/easyspider

1.0 0.0 0.0 81.32 MB

A visual no-code/code-free web crawler/spider一个可视化爬虫软件,可以无代码图形化设计和执行的爬虫任务

License: GNU Affero General Public License v3.0

Shell 0.18% JavaScript 68.20% Python 13.74% TypeScript 0.18% CSS 0.42% HTML 14.67% Batchfile 0.20% Vue 2.41%

easyspider's Introduction

请您Star/Please Star

如果您觉得此工具不错,请轻轻点击此页面右上角Star按钮增加项目曝光度,谢谢!软件完全免费(商用除外),只求大家Star和宣传给其他需要的朋友,谢谢!

If you think this tool is good, please gently click the Star button in the upper right corner at this page to increase the project exposure, thank you! The software is completely free (except for commercial use), only ask everyone to Star and promote it to other friends in need, thank you!

官方网站/Official Website

访问易采集官网:www.easyspider.cn

Visit the official website of EasySpider: www.easyspider.net

易采集/EasySpider: Visual Code-Free Web Crawler

一个可视化爬虫软件,可以使用图形化界面,无代码可视化的设计和执行爬虫任务。只需要在网页上选择自己想要爬的内容并根据提示框操作即可完成爬虫设计和执行。同时软件还可以单独以命令行的方式进行执行,从而可以很方便的嵌入到其他系统中。

A visual code-free/no-code web crawler/spider, just select the content you want to crawl on the web page and operate according to the prompt box to complete the design and execution of the crawler. At the same time, the software can be executed by command line alone, so it can be easily embedded into other systems.

animation_zh

animation_en

下载易采集/Download EasySpider

进入 Releases Page 下载最新版本。如果下载速度慢,可以考虑**境内下载地址:**境内下载地址

加QQ群从群文件下载是国内下载最快的方式,但使用软件的过程中发生了问题求助还是请从GitHub提issue,因为群主不怎么看群,群号:682921940

Refer to the Releases Page to download the latest version of EasySpider.

文档/Documentation

请点此进入教程文档,如有英文可暂时翻译一下,或看作者的硕士毕业论文(主要看第三章和第五章)。

Ebay样例博客:https://blog.csdn.net/ihero/article/details/130805504

Documentation can be found from GitHub Wiki.

视频教程/Video Tutorials

Bilibili/B站视频教程:

EasySpider介绍 - **地震台网采集案例

设置页面向下滚动

如何无代码可视化的爬取需要登录才能爬的网站 - 知乎网站案例

实战采集汽车网文章内容并下载文章内图片

定时执行任务+选中子元素多种模式+将提取值作为变量输入

【重要】自定义条件判断之使用循环项内的JS命令返回值 - 第二弹

流程图执行逻辑解析 - 58同城房源描述采集案例

MacOS系统设计和执行eBay网站爬虫任务教程

如何执行自己写的JS代码和系统代码 (自定义操作)

如何自定义循环和判断条件 - 第一弹

如何对元素和网页截图及命令行执行指南

OCR识别元素内容功能

如何爬需要输入验证码的网站

如何切换IP池和使用隧道IP - 打开详情页采集案例

如何同时执行多个任务(并行多开)

Python代码运算后的结果作为文本框的输入

实例 - 反人类网站文章采集和代码调试

Refer to Youtube Playlist to see the video tutorials of EasySpider.

样例任务/Sample Tasks

从本项目的Examples文件夹中下载样例任务,更名为大于0的数字,导入到EasySpider中的tasks文件夹中,然后在EasySpider中打开即可。

Download sample tasks from the Examples folder of this project, rename them to numbers greater than 0, import them into the tasks folder in EasySpider, and then open them in EasySpider.

声明/Declaration

本软件仅供学习交流使用,严禁使用软件进行任何违法违规的操作,如爬取不允许爬取的政府/军事机关网站等。使用本软件所造成的一切后果由使用者自负,与作者本人无关,作者不会承担任何责任

This software is for learning and communication only. It is strictly forbidden to use the software for any illegal operations, such as crawling government/military websites that are not allowed to be crawled. All consequences caused by the use of this software are at the user's own risk, and the author is not responsible for any consequences.

对于政府和军事机关等网站的爬虫操作,作者将不会进行任何答疑,以免违反国家相关法律法规和政策。

For the crawler operations of government and military websites, the author will not answer any questions in order to avoid violating relevant national laws, regulations and policies.

同时,软件受到专利权保护,如要用于商业用途,请联系杭州天勤知识产权代理有限公司进行专利授权等付费操作。

At the same time, the software is protected by patent rights. If you want to use it for commercial purposes, please contact Hangzhou Tianqin Intellectual Property Agency for patent authorization and other paid operations.

出版物/Publications

编译说明/Compilation Instructions

查看编译说明

Refer to Compilation Instructions.

中文界面截图

软件界面示例

pic

块和子块及表单定义

pic

已选中和待选择示例

pic

京东商品块选择示例:

pic

京东商品标题自动匹配选择示例

pic

分块选择所有子元素示例

pic

同类型元素自动和手动匹配示例

pic

四种选择方式示例

pic

输入文字示例

pic

循环点击58同城房屋标题以进入详情页采集示例

pic

采集元素文本示例

pic

流程图界面介绍

pic

循环选项示例

pic

循环点击下一页示例

pic

条件分支示例

pic

完整采集流程图示例

pic

完整采集流程图转换为常规流程图示例

pic

服务信息示例

pic

服务调用示例

pic

58 同城房源信息采集服务部分采集结果展示

pic

easyspider's People

Contributors

naibowang avatar dependabot[bot] avatar eltociear avatar yfdyh000 avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.