Skip to content

Conversation

@brownbaerchen
Copy link
Contributor

We need to replace some lines in the matrix with the boundary conditions in tau methods. Then we also need to replace the respective values in the right hand side with the values at the boundary. This PR makes the second part more efficient. After profiling on GPUs, it turns out that the previous implementation was rather expensive. By caching a mask that can streamline this operation, solving systems with already factored matrices is up to 10x faster on GPUs and 2x faster on CPUs than before, depending on resolution.

@pancetta
Copy link
Member

Please merge master again

@pancetta pancetta merged commit 3d59549 into Parallel-in-Time:master Oct 8, 2024
86 checks passed
@brownbaerchen brownbaerchen deleted the tau_method_performance branch October 8, 2024 16:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants