Skip to content

QST: Prerequisites to understand pandas-dev/pandas repoΒ #50000

@jmferreirab

Description

@jmferreirab

Research

  • I have searched the [pandas] tag on StackOverflow for similar questions.

  • I have asked my usage related question on StackOverflow.

Link to question on StackOverflow

https://softwareengineering.stackexchange.com/questions/442574/roadmap-to-understand-pandas-dev-pandas-open-source-project

Question about pandas

tldr:

What topics do I need to study to understand how everything in the pandas
repo meshes together (py, pyx, c files, cmake) to reach the final
result which is a hybrid library that is consumed purely from a python
api (import pandas as pd)?

Context

I have been trying for a while to understand how pandas is built by looking at the repo.
It seems it keeps .pyx files that are later converted by Cython into c files (?), there are also have some .h and .c files under https://github.com/pandas-dev/pandas/tree/main/pandas/_libs.

I'm looking for a sequence of topics (or materials) that a python developer that has never touched Cython, cmake or similar, needs to study to gain the knowledge necessary to understand the pandas project source code, or something equivalent. Alternatively, an explanation on how the project internally meshes together would be amazing.

I read the pandas how to contribute page but it was not enough to answer this question.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions