Skip to content

Commit 088be20

Browse files
committed
Update Changelog
1 parent ae7c963 commit 088be20

File tree

1 file changed

+9
-4
lines changed

1 file changed

+9
-4
lines changed

CHANGELOG.md

Lines changed: 9 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@ This file contains the changelog for the Deeploy project. The changelog is divid
44
## Unreleased (Planned Release Target: v0.2.1)
55

66
### List of Pull Requests
7+
- Support Fully Asynchronous DMAs [#114](https://github.com/pulp-platform/Deeploy/pull/114)
78
- Disallow shape inference [#128](https://github.com/pulp-platform/Deeploy/pull/128)
89
- Remove memory-aware node bindings [#123](https://github.com/pulp-platform/Deeploy/pull/123)
910
- Fix missing const's layout transformation and refactor NCHWtoNHWC passes [#122](https://github.com/pulp-platform/Deeploy/pull/122)
@@ -53,6 +54,8 @@ This file contains the changelog for the Deeploy project. The changelog is divid
5354
- Memory/I/O summaries and input/output logging in deployers
5455
- RequantHelpers.py for Neureka's TileConstraints
5556
- Added assertion that all the graph tensors after lowering have a shape annotated
57+
- Profiling support and optional comments in generated DMA code for better traceability
58+
- Added new waiting-strategy logic with fine-grained `PerTensorWaitingStrategy`
5659

5760
### Changed
5861
- Replaced platform-specific tags (`*-amd64`, `*-arm64`) with direct digest references in `Noelware/docker-manifest-action`.
@@ -89,6 +92,7 @@ This file contains the changelog for the Deeploy project. The changelog is divid
8992
- Removed Wmem variants of bindings and tile constraints from Neureka
9093
- Disabled ICCT_ITA_8 MemPool test because it was using a lowering that created shapeless tensors
9194
- Added missing shape annotation to the testTypeInferenceDifferentTypes
95+
- Refactored DMA code generation (`SnitchDma`, `Mchan`) to correctly overlap transfers and compute in double-buffering mode
9296

9397
### Fixed
9498
- Prevent node duplication for graphs generated via GraphSurgeon
@@ -102,6 +106,7 @@ This file contains the changelog for the Deeploy project. The changelog is divid
102106
- Fixed aliasing
103107
- Missing layout transformation of the const's (bias, mul, add, shift in Conv/RequantizedConv)
104108
- Keep mul/add rank of requantized Neureka tile constraints
109+
- DMA synchronization bug causing reduced DB performance on memory-bound kernels.
105110

106111
### Removed
107112
- Delete outdated and unused `.gitlab-ci.yml` file
@@ -177,9 +182,9 @@ This release containing major architectural changes, new platform support, enhan
177182

178183

179184
### Added
180-
- BatchNorm kernel
181-
- ConvTranspose kernel
182-
- MaxPool1D kernel
185+
- BatchNorm kernel
186+
- ConvTranspose kernel
187+
- MaxPool1D kernel
183188
- Template for 1D Convolution
184189
- Support for float32 data type in the previous kernels
185190
- Float binding for Pad1D kernel
@@ -318,7 +323,7 @@ This release containing major architectural changes, new platform support, enhan
318323

319324
### Changed
320325
- FloatConvTemplate file
321-
- Platform.py file
326+
- Platform.py file
322327
- Bump the CMake version to 3.24 as required for the chimera-sdk
323328
- Bump GVSoC's version and add chimera simulation target
324329
- Rename the generic source util to utils to avoid name collision with chimera-sdk

0 commit comments

Comments
 (0)