Skip to content

rdf-connect/OP-vocab-feed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EU Publications Office vocabulary feed

This repository contains the architectural configuration to produce and publish a Linked Data Event Stream (LDES) containing a feed of changes for a given and configurable controlled vocabulary, as the ones managed by the EU Publications Office.

The vocabulary changes are modelled using the W3C Activity Streams 2 vocabulary.

The data processing workflow is built as an RDF-Connect pipeline that performs several data transformation steps, which include:

  • Raw vocabulary fetching over HTTP
  • SHACL-based change detection and semantic labeling with Activity Streams 2
  • Fragmentation based on temporal constraints
  • Ingestion into a given data store system

The publishing is done via an instance of the ldes-server, which sits on top of the data store used by the RDF-Connect pipeline to write the data.

System components and architecture

TODO: Diagram and description of pipeline components.

How to run it?

TODO: describe the steps to run without Docker

With Docker

This pipeline and the necessary data storage and interface components are containerized using Docker and can be executed altogether using docker-compose as follows:

$ docker-compose up --build 

The conf.env file contains the main configuration variables to be set.

About

LDES-based vocabulary feed for the controlled vocabularies of the EU Publications Office

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors