A Generic Web Crawler.
https://drive.google.com/file/d/1QfTQnTM7aYRzOyj_ifgVrS9LRGwCw94_/view?usp=sharing
Port | Service Id | Service Name |
---|---|---|
8761 | service-registry | Service Registry |
9100-9109 | --- | Infrastructure |
9100 | gateway | Gateway Server |
9101 | config | Configuration Server |
9200-9299 | --- | API Service |
9210 | zhihu-api | Zhihu API |
9300-9399 | --- | API Service |
9310 | zhihu-api-throttle | Zhihu API Throttle |
9800 | hub-admin | Topic Administration |
9810 | hub-topic | Topic Hub |
9820 | hub-people | People Hub |
9830 | hub-deed | Deed Hub |
9840 | hub-tag | Tag Hub |
9900 | api-executor | API Request Executor |
The information collected is classified into 3 categories, topic, people, and deed. Topic: a subject or matter towards which people can express their opinions. People: the individual that performs independently on expressing their opinions. Deed: the verbal actions of each individual showing their attitudes towards topics. Tag: a label added to Topic/People/Deed for classification.
- Zhihu Question(知乎问题)
- Zhihu Column(知乎专栏)
- Zhihu Member(知乎会员)
- Zhihu Answer(知乎回答)
- Zhihu Comment(知乎评论)
- Zhihu Article(知乎专栏文章)
- Zhihu Topic(知乎话题)
- Start Kafka Service
docker run -d --network host --env ADVERTISED_HOST=localhost --env ADVERTISED_PORT=9092 transcendence/kafka
- Start Service Registry Server
- Start Config Server
- Start Api Request Executor Server
- Start Hub Servers(Topic Hub, People Hub, Deed Hub) (Just in case after starting the Api Servers, they start to deal with the unprocessed messages and try to save to Hub Servers)
- Start Api Servers(Zhihu API Server&Azhihu API Throttle Server)
- Start Gateway Server