Coder Social home page Coder Social logo

marekhorst / iis Goto Github PK

View Code? Open in Web Editor NEW

This project forked from openaire/iis

0.0 2.0 0.0 57.08 MB

Information Inference Service of the OpenAIRE system

License: Apache License 2.0

Java 96.77% Shell 0.15% Scala 0.90% Python 0.51% PigLatin 1.42% Roff 0.25%

iis's Introduction

About

Information Inference Service (IIS) a flexible data processing system for handling big data based on Apache Hadoop technologies. It is a subsystem of the OpenAIRE system (www.openaire.eu is its public web front-end) - see Fig.1 for a high-level overview.

Fig.1: The center of OpenAIRE system is the Information Space system which stores all information available in the system. IIS ingests data from Information Space, runs processing workflows, and produces inferred data which, in turn, is ingested by Information Space.

The goal of OpenAIRE is to provide an infrastructure for gathering, processing (including de-duplication), and providing unified access to research-related data (papers, datasets, researchers, projects, etc.). The goal of IIS is to provide data/text mining functionality for the OpenAIRE system. In practice, IIS defines data processing workflows that connect various modules, each one with well-defined input and output. A high-level overview of IIS can be found in paper "Information Inference in Scholarly Communication Infrastructures: The OpenAIREplus Project Experience", Procedia Computer Science, vol. 38, 2014, 92-99.

IIS was initially developed during OpenAIREplus project and has been further extended during OpenAIRE2020 project.

The original code was migrated to GitHub from D-NET SVN repository. The public read-only interface of the repository is available at https://svn-public.driver.research-infrastructures.eu/driver/dnet40/modules/ and this is where you can find the history of the code base before the migration (IIS-related Maven projects are the ones matching glob pattern *-iis-*).

Content of the most important subdirectories and files

  • docs - basic documentation
  • iis-core - generic common utilities used by other projects
  • iis-common - OpenAIRE-related common utilities
  • iis-wf - definitions of workflows used in the system
  • CONTRIBUTORS.markdown - list of contributors to the project

License

The code is licensed under Apache License, version 2.0. We also use 3rd party code from other projects compatible with this license. This 3rd party code can be found in directories with names starting with iis-3rdparty-; each directory corresponds to a different source project.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.