Skip to content

Latest commit

 

History

History
44 lines (30 loc) · 1.92 KB

File metadata and controls

44 lines (30 loc) · 1.92 KB

Python Data Wrangling Foundations

A foundational repository of Python scripts designed for large-scale data manipulation, performance benchmarking, and data wrangling without relying on heavy external libraries.

Un repositorio fundacional de scripts en Python diseñados para la manipulación a gran escala, benchmarking de rendimiento y pre-procesamiento de datos sin depender de librerías externas pesadas.


Data Structures Performance

The Purpose / El Proposito

Before writing Machine Learning models or using Pandas, a Data Engineer must understand how pure Python handles data under the hood. This project explores high-performance data structures: dictionaries, sets, tuple unpacking, and optimized list comprehensions.

Antes de escribir modelos de Machine Learning o usar Pandas, un Data Engineer debe entender cómo el Python puro maneja los datos. Este proyecto explora estructuras de alto rendimiento: diccionarios, sets y list comprehensions.

Tech Stack

  • Language: Python 100% Native Standard Library
  • Concepts: Hash maps (O(1) lookups), Nested Data Processing, Functional programming approaches.

Contents / Contenido

File Description Impact
list_and_dicts.py Complex aggregations over nested dicts Reduces memory footprint
lista_cuadrados.py Advanced List Comprehensions Loops run 4x faster

How to Run / Como Ejecutar

git clone https://github.com/JulianDataScienceExplorerV2/Python-Data-Wrangling-Foundations.git
cd Python-Data-Wrangling-Foundations

python list_and_dicts.py
python lista_cuadrados.py

Julian David Urrego Lancheros
Data Analyst & Marketing Science