Skip to content
This repository was archived by the owner on Oct 26, 2023. It is now read-only.

Latest commit

 

History

History
24 lines (17 loc) · 2 KB

File metadata and controls

24 lines (17 loc) · 2 KB

Problem set for ramping up on the shrike package

Contents

This document is a proposal of a list of problems to help people learn how to use shrike.

List of problems

⚠️ Note that these problems are meant to be tackled sequentially, as the solution of problem N+1 builds upon the solution of problem N.

Creating, submitting, and validating pipelines

  • Pipelines Problem 01 Submit a pipeline with a single "Hello, world!"-type component.
  • Pipelines Problem 02 Submit a single-component pipeline where the component operates on a value passed as parameter (pass the parameter value through a config file or via the command line at pipeline submission time).
  • Pipelines Problem 03 Submit a single-component pipeline which consumes a dataset (for example count the number of records).
  • Pipelines Problem 04 Submit a multi-component pipeline where one component's output is the input of a subsequent component.
  • Pipelines Problem 05 Submit a multi-component pipeline which uses a subgraph.
  • Pipelines Problem 06 Submit a pipeline where a component is chosen based on a parameter value.
  • Pipelines Problem 07 Add integration tests to ensure a pipeline does not break.

Logging

  • Logging Problem 01 Submit a pipeline using the compliant logger to log various properties of the dataset consumed by a component (such as number of records or average of a numerical field, for instance).
  • Logging Problem 02 Experiment with the different data categories available to the compliant logger.
  • Logging Problem 03 Experiment with the various options about stack trace prefixing (customize the prefix and the exception message, scrub the exception message unless it is in an allowed list).