Avoid deadlock on write transaction

I've noticed that writing relationship table [(here)](https://github.com/opencypher/morpheus/blob/9f500f5364c0ba14933d96d088d5a8501d4644ad/morpheus-spark-cypher/src/main/scala/org/opencypher/morpheus/api/io/neo4j/Neo4jPropertyGraphDataSource.scala#L202) containing dozens of millions of rows with large batches (default value in the configuration, 100000) generates a lot of warnings like this
```
00:26:41 WARN RetryLogic: Transaction failed and will be retried in 1166ms
org.neo4j.driver.exceptions.TransientException: LockClient[x] can't wait on
resource RWLock[NODE(x), hash=x] since => LockClient[x] <-[:HELD_BY]- 
RWLock[NODE(x), hash=x] <-[:WAITING_FOR]- LockClient[x] <-[:HELD_BY]-
RWLock[NODE(x), hash=x]
```

I wonder whether this can be minimized by re-partitioning of the Spark DF. Maybe it is not actually needed to write relationships in parallel because of a lot of retried transactions?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid deadlock on write transaction #942

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Avoid deadlock on write transaction #942

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions