Commit f4342a3
committed
New time-fused LSTM API. Performance improvements.
This change adds a new LSTM API that fuses operations across RNN
time steps. It performs significantly faster than the existing
iterative API.
The existing iterative LSTM API also received performance
improvements.
BREAKING CHANGES:
Previously, callers were expected to transpose `h` before passing
it to `BackwardPass`. Callers must not transpose `h` anymore.
The `dv` parameter in `BackwardPass` has been removed and `v` must
now be mutable.1 parent 0a0222e commit f4342a3
File tree
7 files changed
+611
-228
lines changed- examples
- frameworks/tf
- lib
7 files changed
+611
-228
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
26 | 30 | | |
27 | 31 | | |
28 | 32 | | |
| |||
38 | 42 | | |
39 | 43 | | |
40 | 44 | | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
41 | 63 | | |
42 | 64 | | |
43 | 65 | | |
| |||
48 | 70 | | |
49 | 71 | | |
50 | 72 | | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
51 | 77 | | |
52 | 78 | | |
53 | 79 | | |
| |||
57 | 83 | | |
58 | 84 | | |
59 | 85 | | |
60 | | - | |
| 86 | + | |
61 | 87 | | |
0 commit comments