How to make transfer data faster while using databricks-sql-go

I am using the driver to run a data migration. When dealing with a table of 24 million rows and 9 columns, the performance is excellent when I fetch 10 thousand rows. When I increase the fetch size to 100 thousand rows, the transfer speed is still good. However, when fetching 1 million rows or more, the data transfer becomes very slow. A quick test made me think that the driver tries to fetch all the data in a single batch. Is there a way to improve this process?

This is the connection string

 ```bash

"token:xxx@host:443$xxx-path?catalog=sample&database=big_table&useCloudFetch=true&maxRows=10000"

```

I've tried to use this two settings, data transfer is still slow to big tables.

- useCloudFetch=true
- maxRows=10000

In forums users suggest change spark.driver.maxResultSize ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to make transfer data faster while using databricks-sql-go #235

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to make transfer data faster while using databricks-sql-go #235

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions