Topic: dataset Goto Github
Some thing interesting about dataset
Some thing interesting about dataset
dataset,cluster data collected from production clusters in Alibaba for cluster management research
Organization: alibaba
dataset,Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖
User: ashvardanian
Home Page: https://ashvardanian.com/posts/stringzilla/
dataset,A synthetic data generator for text recognition
User: belval
dataset,大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
User: brightmart
dataset,用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
User: candlewill
dataset,📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.
User: charmve
Home Page: https://github.com/Charmve/computer-vision-in-action
dataset,中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Organization: cluebenchmark
Home Page: http://www.CLUEbenchmarks.com
dataset,Colour Science for Python
Organization: colour-science
Home Page: https://www.colour-science.org
dataset,Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Organization: cvat-ai
Home Page: https://cvat.ai
dataset,[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
User: detectrecog
dataset,Open source annotation tool for machine learning practitioners.
Organization: doccano
dataset,中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
User: ganjinzero
dataset,Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
User: georgeseif
dataset,Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Organization: google-research-datasets
dataset,Documentation on how to access and use the Quick, Draw! Dataset.
Organization: googlecreativelab
Home Page: https://quickdraw.withgoogle.com/data
dataset,Label Studio is a multi-type data labeling and annotation tool with standardized output format
Organization: humansignal
Home Page: https://labelstud.io
dataset,Transformer: PyTorch Implementation of "Attention Is All You Need"
User: hyunwoongko
dataset,We are building an open database of COVID-19 cases with chest X-ray or CT images.
User: ieee8023
dataset,A curated list of awesome JSON datasets that don't require authentication.
User: jdorfman
dataset,🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
User: jim-schwoebel
Home Page: https://neurolex.ai/voicebook
dataset,Faker is a Python package that generates fake data for you.
User: joke2k
Home Page: https://faker.readthedocs.io
dataset,医学影像数据集列表 『An Index for Medical Imaging Datasets』
User: linhandev
Home Page: https://linhandev.github.io/posts/Medical-Dataset/
dataset,Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
User: lonepatient
dataset,收集和梳理垂直领域的开源模型、数据集及评测基准。
Organization: luban-agi
dataset,pix2tex: Using a ViT to convert images of equations into LaTeX code.
User: lukas-blecher
Home Page: https://lukas-blecher.github.io/LaTeX-OCR/
dataset,FMA: A Dataset For Music Analysis
User: mdeff
Home Page: https://arxiv.org/abs/1612.01840
dataset,This repository contains compatibility data for Web technologies as displayed on MDN
Organization: mdn
Home Page: https://developer.mozilla.org
dataset,Large list of handpicked color names 🌈
User: meodai
Home Page: https://meodai.github.io/color-names/
dataset,Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
User: nirantk
Home Page: http://www.nirantk.com/awesome-project-ideas/
dataset,A global, public domain map dataset available at three scales and featuring tightly integrated vector and raster data.
User: nvkelso
Home Page: https://www.naturalearthdata.com/
dataset,Basic Utilities for PyTorch Natural Language Processing (NLP)
User: petrochukm
Home Page: https://pytorchnlp.readthedocs.io
dataset,Maintained collection of OSINT related resources. (All Free & Actionable)
User: ph055a
Home Page: https://osint.team
dataset,A collective list of free APIs
Organization: public-apis
Home Page: http://public-apis.org
dataset,Extract data from a wide range of Internet sources into a pandas DataFrame.
Organization: pydata
Home Page: https://pydata.github.io/pandas-datareader/stable/index.html
dataset,Models, data loaders and abstractions for language processing, powered by PyTorch
Organization: pytorch
Home Page: https://pytorch.org/text
dataset,Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
User: rom1504
dataset,A large annotated semantic parsing corpus for developing natural language interfaces.
Organization: salesforce
dataset,Techniques for deep learning with satellite & aerial imagery
Organization: satellite-image-deep-learning
dataset,Windows Events Attack Samples
User: sbousseaden
Home Page: https://github.com/sbousseaden/EVTX-ATTACK-SAMPLES
dataset,esProc SPL is a scripting language for data processing, with well-designed rich library functions and powerful syntax, which can be executed in a Java program through JDBC interface and computing independently.
User: splware
Home Page: http://doc.scudata.com/esproc/
dataset,TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Organization: tensorflow
Home Page: https://www.tensorflow.org/datasets
dataset,Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
Organization: universaldatatool
Home Page: https://universaldatatool.com
dataset,🎁 5,400,000+ Unsplash images made available for research and machine learning
Organization: unsplash
Home Page: https://unsplash.com/data
dataset,中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
User: wainshine
Home Page: https://open.namemoe.com/
dataset,Waymo Open Dataset
Organization: waymo-research
Home Page: https://www.waymo.com/open
dataset,SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
User: whoiskatrin
Home Page: https://www.sqltranslate.app/
dataset,An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Organization: whylabs
Home Page: https://whylogs.readthedocs.io/
dataset,A MNIST-like fashion product database. Benchmark :point_down:
Organization: zalandoresearch
Home Page: http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.