Coder Social home page Coder Social logo

hpplinux / compilation-analysis Goto Github PK

View Code? Open in Web Editor NEW

This project forked from tuxml/compilation-analysis

0.0 1.0 0.0 9.31 MB

Analysis of 125K+ configurations of the Linux kernel (build/compilation phase)

Jupyter Notebook 99.91% Python 0.05% R 0.01% TeX 0.04%

compilation-analysis's Introduction

Analysis of TUXML builds (including failures)

Analysis of 95K+ configurations of the Linux kernel

Learning and querying data

  • pre-requisite is:
  • ClusterAnalysis.ipynb: cluster analysis and statistical learning about options
  • TUXML*.ipynb
    • sensitivity analysis of decision tree and rules extraction facilitiy
    • clustering attempts
  • pdf files are generated out of notebook scripts, we push them for convenience/reviewing

Extraction

We have a database out of which we can extract/encode some data. The following scripts serve this purpose.

database-to-CSV.py (and genCSV.py + DBCredentials.py)

script to extract configuration data from database (CSV file, only configuration and compilation status/sizes)

  • the script can be parameterized with From and To (cids to start and top).
  • normally you don't need to use it and beware it's bandwith/time consuming.
  • we use it to extract different CSVs that are then assembled into one

see also https://github.com/TuxML/tuxml-datasets for getting data using git-lfs

TUXML-csv-building.ipynb

script to assemble CSV files into one (aggregation), export using the pickle format (https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html)

deprecated

  • TUXML-analysis.ipynb: learning and some exploration of bugs out of config_bdd30-100.pkl pre-requisite: download config_bdd30-100.pkl
  • bdd-tuxml-facility: database facilities (e.g., logs) typically to analyze configuration (failures)
  • TUXML-basic.ipynb: learning and some exploration of bugs (out of config_10K.csv, see below)
    • if you want to process config_50K.csv, just change TUXML_CSV_FILENAME= './config_bdd10K.csv'
    • there is also a part about kernel sizes, but it's out of the scope right now

compilation-analysis's People

Contributors

arnobl avatar canisback avatar familiar-project avatar jualvespereira avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.