Databricks Assignment

Project Description

This project involves processing and transforming data using PySpark. The data is read from a Parquet file, transformed using window functions, and then saved as a table in Parquet format.

Execution

solution/run.py - main program solution/DataProcessor.py - class for data processing solution/quick_and_dirty - databricks notebook for quick solution

tests - folder with tests

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
solution		solution
tests		tests
typings		typings
.gitignore		.gitignore
Pipfile		Pipfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Databricks Assignment

Project Description

Execution

About

Uh oh!

Releases

Packages

Uh oh!

Languages

tkorba1/databricks_assigment

Folders and files

Latest commit

History

Repository files navigation

Databricks Assignment

Project Description

Execution

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages