currently, aggregation group-by's logical property is like:
select v1 from t1 group by v1;
Agg group=v1 <- schema=[v1], column_ref=[v1]
Scan t1
but actually, group by could change the distribution of the column, so probably we should set it to derived, or find a way to represent it? if a later join refers to this column, we should treat it differently.