Skip to content

pyarrow version #2

@danodonovan

Description

@danodonovan

The file setup.py doesn't include versioning information for packages and I couldn't find a compatible version of pyarrow.

Using the latest versions of pyarrow results in errors that look like;

Traceback (most recent call last):
  File "/Users/.../schema_from_parquet/cell_imager_schema.py", line 13, in <module>
    status = parquet_to_athena(file_path, location, database, table, partition, create_table, workgroup, reqion)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/.../schema_from_parquet/parquet_to_athena/parquet_to_athena.py", line 63, in parquet_to_athena
    physical_col_type = dataframe.schema[col].physical_type
                        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'pyarrow.lib.Field' object has no attribute 'physical_type'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions