For approximate join queries, duplicate column names in two tables are not allowed.

https://github.com/ddkang/aidb/blob/bef78d2339b0f6f654467f58651f80a21ae1f2da/aidb/engine/approx_aggregate_join_engine.py#L215-L218
The program utilizes a column name as a key since the dataframe retrieved from the database use this column name instead of the format {table}.{column}. Consequently, when two dataframes are concatenated, duplicate column names may appear in the merged dataframe if the same column names exist across both tables.

	if agg_col.split('.')[1] not in agg_col_to_alias:
	agg_col_to_alias[agg_col.split('.')[1]] = f'agg_col__{i}'
	udf_query = udf_query.add_select(f'{agg_col} AS agg_col__{i}')
	agg_list.append((agg_type, agg_col_to_alias[agg_col.split('.')[1]]))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For approximate join queries, duplicate column names in two tables are not allowed. #171

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

For approximate join queries, duplicate column names in two tables are not allowed. #171

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions