Skip to content
datacorner edited this page Dec 2, 2023 · 2 revisions

joinTR Transformer

Purpose

The purpose of this transformer is to make a join (equivalent of a SQL join) between 2 data sources. It returns only one dataset in output which is the result of this join.

Configuration by using the SQL directly in the configuration file

The specific configuration (as a Datasource) in the configuration file section parameters includes the following parameters:

  • join: Join type (can be one of this value : inner, left, right or outer)
  • left: Left dataset id and left dataset columns keys (in an sorted array)
  • right: Right dataset id and right dataset columns keys (in an sorted array)

Data sources configuration

  • Inputs : 2 inputs
  • Outputs : 1 output

Configuration example:

    "transformers":  [ { 
        "id": "T1",
        "classname": "pipelite.transformers.joinTR",
        "inputs" : [ "E1", "E2" ],
        "outputs" : [ "L1" ],
        "parameters" : {
            "join" : "inner",
            "left" : { "dsid" : "E1", 
                       "keys" : ["col2"] },
            "right" :  { "dsid" : "E2", 
                          "keys" : ["tcol1"] }
            }
    } ],
    ... ] ...

🏠 Home
πŸ”‘ Main concepts
πŸ’» Installation
πŸ”¨ Configuration
πŸš€ Running

Supported Data Sources
πŸ“„ CSV File
πŸ“‘ XES File
πŸ“ƒ Excel File
πŸ“€ ODBC
🏒 SAP
🎒 ABBYY Timeline

Supported Transformations
πŸ”€ Pass Through
πŸ“Ά Dataset Profiling
πŸ”‚ Concat 2 Data sources
πŸ†– SubString
πŸ†’ Column Transformation
πŸ”ƒ Join data sources
πŸ”ƒ Lookup
πŸ”€ Rename Column Name

Extending pipelite
βœ… how to
βœ… Adding new Data sources
βœ… Adding new Transformers
βœ… Adding new Pipelines

Clone this wiki locally