Coder Social home page Coder Social logo

trainsong / darknet_chinesetrading Goto Github PK

View Code? Open in Web Editor NEW

This project forked from s045pd/darknet_chinesetrading

0.0 0.0 0.0 14.11 MB

🚇暗网中文网监控爬虫

Home Page: https://gcokedsa123.grafana.net/dashboard/snapshot/2OJ9OtmtitwiGcIqwIVhvzgmTKDBtTkF

License: MIT License

Python 99.72% Shell 0.28%

darknet_chinesetrading's Introduction

mosaic.jpg

DarkNet_ChineseTrading - 暗网中文网监控实时爬虫

En_Doc

监控大屏(grafana快速实现)

功能

  • Tor节点切换
  • 自动注册
  • 自动登录
  • 防封禁
  • ORM交互
  • 事件详情/样本信息录入
  • 事件提醒(telegram)[图文]

加入我们:https://t.me/fordarknetspiderbot

安装(Mac下)

  • python环境配置

    下载并安装 anaconda 3.5

     pip install -r ./requirements.txt
     pip install -U 'requests[socks]'
    
  • tor安装

    当前需更新tor至[0.4.0.0版本],旧版将有几率无法取得数据 如果无法通过如下命令安装最新版,推荐至官网编译安装最新源码包

     brew install tor
     	
     cd /usr/local/etc/tor
     cp torrc.sample ./torrc
     vi torrc
    

    将如下配置添加到 torrc 后,运行 restart_tor.sh 开启tor

     SOCKSPort 9150 					# socks5代理地址
     Socks5Proxy 127.0.0.1:1086 		# 科学上网代理地址(如已翻墙可不填)
     RunAsDaemon 1 					# 开启后台运行
     ControlPort 9151 				# 开启控制端口
    
  • OCR(mac)

    识别率略低,可在parser.py的get_captcha处替换

     brew install tesseract
    

    snum.traineddata

  • 存储环境

    安装Docker后下载Redis``Mysql即可

  • 运行

    配置config_dev.py中的连接设定与TelegramRobotToken

     mv config_dev.py conf.py 
     bash restart_tor.sh
     bash restart_task.sh
     python run.py
     
    
  • 运行逻辑

  • 运行结果截图

    • telegram

    • run.py

  • 额外命令

     python3 run.py --help
    
     Usage: run.py [OPTIONS]
    
     Options:
       --debug        Print debug log
       --domain TEXT  Target domain.
       --save_error   Whether to save the error log
       --update       Whether it has only been updated to crawl
       --help         Show this message and exit.
    
    

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.