Coder Social home page Coder Social logo

dcbrain's Introduction

dcbrain

We release the following datasets that are collected at Alibaba:

  • Hard drive disks (HDDs) (diskdata/): It includes over 200 thousand HDDs in Alibaba Cloud's data centers.

    • Publication: "Large-Scale Disk Failure Prediction(book)."
      Cheng He, Mengling Feng, Patrick P. C. Lee, Pinghui Wang, Shujie Han, Yi Liu.
      PAKDD 2020 Competition and Workshop, AI Ops 2020, February 7 – May 15, 2020, Revised Selected Papers
  • Solid-state drives (SSDs) (ssd_open_data/): It includes nearly one million SSDs of 11 drive models from three vendors over a two-year span.

    • Publication: "An In-Depth Study of Correlated Failures in Production SSD-Based Data Centers."
      Shujie Han, Patrick P. C. Lee, Fan Xu, Yi Liu, Cheng He, and Jiongzhou Liu.
      Proceedings of the 19th USENIX Conference on File and Storage Technologies (FAST 2021), February 2021.
  • SMART logs of Solid-state drives (SSDs) (ssd_smart_logs/): It includes nearly 500K SSDs of six drive models from three vendors over a two-year span.

    • Publication: "General Feature Selection for Failure Prediction in Large-scale SSD Deployment."
      Fan Xu, Shujie Han, Patrick P. C. Lee, Yi Liu, Cheng He, and Jiongzhou Liu.
      Proceedings of the 51st IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2021), June 2021.

    • See details about the relationships between two datasets of SSDs.

  • DRAM error logs (mcelog) (dramdata/): It includes DRAM errors log, inventory logs, and trouble tickets of server failures due to DRAM errors collected from more than 250K servers over a eight-month span.

    • Publication: "An In-Depth Correlative Study Between DRAM Errors and Server Failures in Production Data Centers."
      Zhinan Cheng, Shujie Han, Patrick P. C. Lee, Xin Li, Jiongzhou Liu, and Zhan Li.
      Proceedings of the 41st International Symposium on Reliable Distributed Systems (SRDS 2022), September 2022.

We release the following prototypes:

  • AIHS Prototype (AIHS_prototype/): A prototype for automated intelligent healing in Alibaba Cloud's data centers.

    • Publication: "Automated Intelligent Healing in Cloud-Scale Data Centers."
      Rui Li, Zhinan Cheng, Patrick P. C. Lee, Pinghui Wang, Yi Qiang, Lin Lan, Cheng He, Jinlong Lu, Mian Wang, Xinquan Ding.
      Proceedings of the 40th International Symposium on Reliable Distributed Systems (SRDS 2021), September 2021.

dcbrain's People

Contributors

alibaba-oss avatar liruivah avatar shujiehan avatar unlearner avatar zncheng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dcbrain's Issues

SMART属性问题

请问,在您的数据表里,r_241代表的是写入数据的总量,那具体的怎么换算成写入量呢?我的理解是他代表自硬盘启用后主机向硬盘写入的数据总量,以4个字节表示,每写入64GB字节作为一个单位。那对于图片的这个数据写入量应该是:
495740215769/4 B=123935053942.25 KB=123935053942.25/1024/1024 GB=118193.6778GB,对吗?

image

数据标签问题

您好,在您的关于这篇数据集的论文中,有A1-A6、B1-B3、C1-C2这几种类型,但是在数据集中看到在“manufacturer”那一列只显示A,这是什么原因呢?

关于SMART属性的归一化

您好,请问在smartlog_data_*.csv文件中,smart原始属性是用什么公式进行归一化的呢?谢谢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.