Coder Social home page Coder Social logo

scanningdata's Introduction

Welcome!

Welcome to the ScanningData GitHub Repository! This Repository contains sets of scripts essential for data scanning in support of the seeWaybeyond Platform, although it could be use completely stand-alone. Early days for the Repository.

Scanning the following Data

Structured Data

Structured data usually resides in relational databases (RDBMS). Fields store length-delineated data phone numbers, Social Security numbers, or ZIP codes. Even text strings of variable length like names are contained in records, making it a simple matter to search.

UnStructured Data

Unstructured data has internal structure but is not structured via pre-defined data models or schema. It may be textual or non-textual, and human- or machine-generated. It may also be stored within a non-relational database like NoSQL. Typically includes:

  • Text files: Word processing, spreadsheets, presentations, email, logs.
  • Email: Email has some internal structure thanks to its metadata, and we sometimes refer to it as semi-structured. However, its message field is unstructured and traditional analytics tools cannot parse it.
  • Social Media: Data from Facebook, Twitter, LinkedIn.
  • Website: YouTube, Instagram, photo sharing sites.
  • Mobile data: Text messages, locations.
  • Communications: Chat, IM, phone recordings, collaboration software.
  • Media: MP3, digital photos, audio and video files.
  • Business applications: MS Office documents, productivity applications.
  • Typical machine-generated unstructured data includes:
  • Satellite imagery: Weather data, land forms, military movements.
  • Scientific data: Oil and gas exploration, space exploration, seismic imagery, atmospheric data.
  • Digital surveillance: Surveillance photos and video.
  • Sensor data: Traffic, weather, oceanographic sensors.

Semi-Structured Data

Semi-structured data maintains internal tags and markings that identify separate data elements, which enables information grouping and hierarchies. Amongst others includes:

  • Markup language XML This is a semi-structured document language.
  • Open standard JSON (JavaScript Object Notation) JSON is another semi-structured data interchange format.

Other Data

Loosely grouped for now.

  • 3RD Party Data
  • Cloud Data
  • Big Data

scanningdata's People

Contributors

deonpollard avatar

Stargazers

 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.