InMemoryFIleIndex 缓存filelist信息，怎么解决两个sparksession 长期运行同步问题？ #3253

melin · 2022-08-16T14:30:53Z

melin
Aug 16, 2022

例如两个SparkSession A和B，在A中先查询表T1，接着在B中往表T1写入数据，之后在A中查询不到B 写入的数据。目前有两种做法：
1，关闭SparkSession A，重启动
2、调用 refresh table command。
这两种做法对于业务用户不知道什么时候去操作，不具可操作性。频繁刷新refresh，影响交互性能。

最理想做法是支持redis 等集中式缓存方案

pan3793 · 2022-08-17T02:02:03Z

pan3793
Aug 17, 2022
Collaborator

It's by design in Hive tables and Spark built-in file format tables, the datalake table formats like delta and iceberg could resolve your problems.

0 replies

yaooqinn · 2022-08-17T02:17:04Z

yaooqinn
Aug 17, 2022
Collaborator

I have a PR(maybe not solve your problem yet, but going the right direction with follow-up might achieve that goal).

Sadly, it haven't get much attention from spark community yet, I do see a chance to merge it and do followups

0 replies

cloud-fan · 2022-11-14T08:03:47Z

cloud-fan
Nov 14, 2022

How is apache/spark#37355 related to this issue? This is about the file listing cache, not the table data cache.

1 reply

yaooqinn Nov 14, 2022
Collaborator

Not directly related, just my maybe-followups for other cache stuff skipping

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

InMemoryFIleIndex 缓存filelist信息，怎么解决两个sparksession 长期运行同步问题？ #3253

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

InMemoryFIleIndex 缓存filelist信息，怎么解决两个sparksession 长期运行同步问题？ #3253

Uh oh!

Uh oh!

melin Aug 16, 2022

Replies: 3 comments · 1 reply

Uh oh!

Uh oh!

pan3793 Aug 17, 2022 Collaborator

Uh oh!

yaooqinn Aug 17, 2022 Collaborator

Uh oh!

cloud-fan Nov 14, 2022

Uh oh!

yaooqinn Nov 14, 2022 Collaborator

melin
Aug 16, 2022

Replies: 3 comments 1 reply

pan3793
Aug 17, 2022
Collaborator

yaooqinn
Aug 17, 2022
Collaborator

cloud-fan
Nov 14, 2022

yaooqinn Nov 14, 2022
Collaborator