DockerPedia: A Knowledge Graph of Software Images and their Metadata

Datasets


  • DockerPedia Dataset v1.0-beta.1

    • Description: DockerPedia is a resource that allows user inspect Docker images before downloading and deploy the within a host. Normally Docker images are like a black box in which users do not really know what they are deploying. User only know the name of the image and the name of the main package within the image. With this resource now it is possible to inspect the Docker images stored at DockerHub before deploying them.
    • License: https://creativecommons.org/licenses/by/4.0/legalcode
  • Queries used to illustrate the DockerPedia KG

    • Description: Query results obtained when issuing the SPARQL queries in the listings of the paper to the SPARQL endpoint in https://dockerpedia.inf.utfsm.cl/
    • License: https://creativecommons.org/licenses/by/4.0/legalcode

Software


The pointers for the main software used can be found below:

Experiment reproducibility is the ability to run an experiment with the introduction of changes to it. To allow reproducibility, the scientific community encourages researchers to publish descriptions of the these experiments. However, these recommendations do not include an automated way for creating such descriptions: normally scientists have to annotate their experiments in a semi automated way. In this paper we propose a system to automatically describe computational environments used in in-silico experiments. We propose to use Operating System (OS) virtualization (containerization) for distributing software experiments throughout software images and an annotation system that will allow to describe these software images. The images are a minimal version of an OS (container) that allow the deployment of multiple isolated software packages within it.

Readme
License: MIT License
Download
go

A Knowledge Graph of Docker images

Readme
Download
javascripthtmlcss

About the authors


Daniel Garijo

Daniel Garijo

Researcher

Universidad Politécnica de Madrid, University of Southern California

http://w3id.org/people/dgarijo

I am a researcher at Universidad Politécnica de Madrid. My research activities focus on e-Science and the Semantic Web, specifically on how to increase the ease of use of software and scientific workflows using provenance, metadata, intermediate results and Linked Data.

Maximiliano Osorio

Maximiliano Osorio

Author

Research Programmer

Computer Scientist at the Information Sciences Institute of the University of Southern California.

Idafen Santana-Pérez

Idafen Santana-Pérez

Author

Lecturer

https://idafen.wordpress.com/

Lecturer at DSC department, at ULPGC on sensor data for light pollution and medical image processing. Idafen is also interested in topics related to Open Science and Linked Data in general.

Carlos Buil

Carlos Buil

Author

Lecturer

https://www.inf.utfsm.cl/quienes-somos/directorio-de-personas/academicos/15-carlos-buil

Lecturer at the Universidad Técnica Federico de Santa María, In Chile. Carlos work focuses on (graph) databases, Semantic Web and Knowledge Graphs.