-
Notifications
You must be signed in to change notification settings - Fork 93
Open
Description
Hi,
Thanks for making this library and it's amazing to have these different CRFs wrapped up in a common and easy to use framework.
I've been playing with the LinearChainCRF and one thing I noticed is the memory usage can be very high during loss backward pass on both CPU and GPU. I found the FastLogSemiring
in fast_semirings.py
uses genbmm.logbmm()
and significantly reduce memory usage on GPU if I change the default LogSemiring
used in StructDistribution
class to FastLogSemiring
. However, I haven't seen this being documented anywhere so my questions are:
- Is
FastLogSemiring
ready to be used? It's not being included intest_semirings.py
- If so, what would be the best way to switch between
LogSemiring
andFastLogSemiring
? Is there a plan to introduce a parameter to choose between the semirings inStructDistribution
class?
Metadata
Metadata
Assignees
Labels
No labels