Coder Social home page Coder Social logo

ohsevin / odd-platform Goto Github PK

View Code? Open in Web Editor NEW

This project forked from opendatadiscovery/odd-platform

0.0 0.0 0.0 20.36 MB

First open-source data discovery and observability platform. ODD Platform is based on ODD Specification.

Home Page: https://opendatadiscovery.org

License: Apache License 2.0

Shell 0.08% JavaScript 0.12% Python 0.10% Java 55.73% Groovy 0.24% TypeScript 43.50% CSS 0.08% HTML 0.14%

odd-platform's Introduction

open-data-discovery-logoย 


Product Hunt

Apache2 Maintenance GitHub contributors Slack GitHub issues by-label


Open Data Discovery Platform: Next-Gen Data Discovery and Observability

Introduction

ODD is an open-source data discovery and observability tool for data teams that helps to efficiently democratise data, power collaboration and reduce time on data discovery through modern user-friendly environment.

Key wins

  • Shorten data discovery phase

  • Have transparency on how and by whom the data is used

  • Foster data culture by continuous compliance and data quality monitoring

  • Accelerate data insights

  • Know the sources of your dashboards and ad hoc reports

  • Deprecate outdated objects responsibly by assessing and mitigating the risks

  • ๐Ÿ‘‰ ODD Platform is a reference implementation of Open Data Discovery Spec.

Features

Data Discovery and Observability

  • Accumulate scattered data insights in Federated Data catalogue
  • Gain observability through E2E Data objects Lineage
  • Benefit from cutting-edge E2E microservices Lineage feature in tracking your data flow through the whole data landscape
  • Be warned and alerted by Pipeline Monitoring tools
  • Store your metadata
  • Use ODD-native modern lightweight UI

ML First citizen

  • Save results of your ML Experiments by automatically logging its parameters

Data Security & Compliance

  • Manage Tags and Labels to prevent any abuse of the data
  • Refer to Tags and Labels to stay compliant with data security standards
  • Have full transparency on how and by whom the data is used

Data Quality

  • Simplify DQ processes by using ODD and Great Expectations compatibility
  • Integrate ODD with any custom DQ framework

Getting Started

Running Locally with Docker Compose

docker-compose -f docker/demo.yaml up -d odd-platform-enricher

Deploying to Kubernetes with Helm Charts

Example configurations

There are various example configurations (via docker-compose) within docker/examples directory.

Contributing

Contributing to ODD Platform is very welcome. For basic contributions, all you need is being comfortable with GitHub and Git. The best ways to contribute are:

  • Work on new adapters
  • Work on documentation

To ensure equal and positive communication, we adhere to our Code of Conduct. Before starting any interactions with this repository, please read it and make sure to follow.

Please before contributing check out our Contributing Guide and issues labeled "good first issue":

GitHub issues by-label


Integrations

ODD Platform works with many of the tools you're already using:

Existing integrations
Airflow
Athena
Cassandra
Clickhouse
DBT
DynamoDB
Elasticsearch
Feast
Glue
Hive
Kafka
Kinesis
Kubeflow
Microsoft ODBC
MongoDB
MSSQL
MySQL
Neo4j
PostgreSQL
Quicksight
Redshift
S3
SageMaker
SageMaker Featurestore
Snowflake
Spark
SQS
Tableau
Tarantool
BigQuery
Cube
Vertica
Trino
Presto
MariaDB
SuperSet

ODD Data Model

ODD operates the following high-level types of entities:

  1. Datasets (collections of data: tables, topics, files, feature groups)
  2. Transformers (transformers of data: ETL or ML training jobs, experiments)
  3. Data Consumers (data consumers: ML models or BI dashboards)
  4. Data Quality Tests (data quality tests for datasets)
  5. Data Inputs (sources of data)
  6. Transformer Runs (executions of ETL or ML training jobs)
  7. Quality Test Runs executions of data quality tests

For more information, please check specification.md.

Contacts

If you have any questions or ideas, please don't hesitate to drop a line to any of us.

Team Member LinkedIn GitHub
German Osin LinkedIn germanosin
Nikita Dementev LinkedIn DementevNikita
Damir Abdullin LinkedIn damirabdul
Alexey Kozyurov LinkedIn Leshe4ka
Pavel Makarichev LinkedIn vixtir
Roman Zabaluev LinkedIn Haarolean

License

ODD Platform uses the Apache 2.0 License.

odd-platform's People

Contributors

dementevnikita avatar leshe4ka avatar evanto avatar damirabdul avatar rustamgimadiev avatar nelydavtyan avatar haarolean avatar alinamiryuk avatar anna-eg avatar chismur avatar denisalik avatar marselahmetov avatar shilgam avatar nikitosinar avatar esmeneev avatar germanosin avatar dependabot[bot] avatar vixtir avatar flyspot avatar r81ack avatar makonakro avatar maksimtereshin avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.