Coder Social home page Coder Social logo

ddfeiyu's Projects

string-similarity icon string-similarity

java算法(1)---余弦相似度计算字符串相似率 1、功能需求:最近在做通过爬虫技术去爬取各大相关网站的新闻,储存到公司数据中。这里面就有一个技术点,就是如何保证你已爬取的新闻,再有相似的新闻 或者一样的新闻,那就不存储到数据库中。(因为有网站会去引用其它网站新闻,或者把其它网站新闻拿过来稍微改下内容就发布到自己网站中)。 2、解析方案:最终就是采用余弦相似度算法,来计算两个新闻正文的相似度。现在自己写一篇博客总结下。

submarine icon submarine

【机器学习】Submarine is Cloud Native Machine Learning Platform. - 云原生机器学习平台

taier icon taier

【袋鼠云 - 数栈-一站式大数据paas开发平台】大数据平台-分布式任务调度系统

takin icon takin

【全链路压测】Takin is an Java-based, open-source system designed to measure online environmental performance test for full-links, Especially for microservices. Through Takin, middlewares and applications can identify real online traffic and test traffic, ensure that they enter the right databases.

tensorflow icon tensorflow

【机器学习框架】An Open Source Machine Learning Framework for Everyone -(适合所有人的开源机器学习框架)

thingsboard icon thingsboard

【IOT】【物联网平台】Open-source IoT Platform - Device management, data collection, processing and visualization. - 开源物联网平台 - 设备管理、数据收集、处理和可视化。

tinyid icon tinyid

【滴滴】【分布式全局ID】ID Generator id生成器 分布式id生成系统,简单易用、高性能、高可用的id生成系统

ttskit icon ttskit

【文字转语音】text to speech toolkit. 好用的中文语音合成工具箱,包含语音编码器、语音合成器、声码器和可视化模块。

useful-scripts icon useful-scripts

【运维常用脚本】useful scripts for making developer's everyday life easier and happier, involved java, shell etc.【进程线程CPU定位排查】

whisperx icon whisperx

【语音转字幕】WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

wormhole icon wormhole

【宜信】【数据中台】Wormhole is a SPaaS (Stream Processing as a Service) Platform - 【Wormhole 是一个 SPaaS(流处理即服务)平台】

xlongc icon xlongc

杨步涛开源 - 基于NIO的长连接框架

yapi icon yapi

【接口管理】YApi 是一个可本地部署的、打通前后端及QA的、可视化的接口管理平台

yt-dlp icon yt-dlp

A youtube-dl fork with additional features and fixes 【一个针对油管youtube的视频下载工具】

zheng icon zheng

【微服务开发框架】基于Spring+SpringMVC+Mybatis分布式敏捷开发系统架构,提供整套公共微服务服务模块:集中权限管理(单点登录)、内容管理、支付中心、用户管理(支持第三方登录)、微信平台、存储系统、配置中心、日志分析、任务和通知等,支持服务治理、监控和追踪,努力为中小型企业打造全方位J2EE企业级开发解决方案。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.