@@ -4,6 +4,7 @@ This file contains the changelog for the Deeploy project. The changelog is divid
44## Unreleased (Planned Release Target: v0.2.1)
55
66### List of Pull Requests
7+ - Support Fully Asynchronous DMAs [ #114 ] ( https://github.com/pulp-platform/Deeploy/pull/114 )
78- Disallow shape inference [ #128 ] ( https://github.com/pulp-platform/Deeploy/pull/128 )
89- Remove memory-aware node bindings [ #123 ] ( https://github.com/pulp-platform/Deeploy/pull/123 )
910- Fix missing const's layout transformation and refactor NCHWtoNHWC passes [ #122 ] ( https://github.com/pulp-platform/Deeploy/pull/122 )
@@ -53,6 +54,8 @@ This file contains the changelog for the Deeploy project. The changelog is divid
5354- Memory/I/O summaries and input/output logging in deployers
5455- RequantHelpers.py for Neureka's TileConstraints
5556- Added assertion that all the graph tensors after lowering have a shape annotated
57+ - Profiling support and optional comments in generated DMA code for better traceability
58+ - Added new waiting-strategy logic with fine-grained ` PerTensorWaitingStrategy `
5659
5760### Changed
5861- Replaced platform-specific tags (` *-amd64 ` , ` *-arm64 ` ) with direct digest references in ` Noelware/docker-manifest-action ` .
@@ -89,6 +92,7 @@ This file contains the changelog for the Deeploy project. The changelog is divid
8992- Removed Wmem variants of bindings and tile constraints from Neureka
9093- Disabled ICCT_ITA_8 MemPool test because it was using a lowering that created shapeless tensors
9194- Added missing shape annotation to the testTypeInferenceDifferentTypes
95+ - Refactored DMA code generation (` SnitchDma ` , ` Mchan ` ) to correctly overlap transfers and compute in double-buffering mode
9296
9397### Fixed
9498- Prevent node duplication for graphs generated via GraphSurgeon
@@ -102,6 +106,7 @@ This file contains the changelog for the Deeploy project. The changelog is divid
102106- Fixed aliasing
103107- Missing layout transformation of the const's (bias, mul, add, shift in Conv/RequantizedConv)
104108- Keep mul/add rank of requantized Neureka tile constraints
109+ - DMA synchronization bug causing reduced DB performance on memory-bound kernels.
105110
106111### Removed
107112- Delete outdated and unused ` .gitlab-ci.yml ` file
@@ -177,9 +182,9 @@ This release containing major architectural changes, new platform support, enhan
177182
178183
179184### Added
180- - BatchNorm kernel
181- - ConvTranspose kernel
182- - MaxPool1D kernel
185+ - BatchNorm kernel
186+ - ConvTranspose kernel
187+ - MaxPool1D kernel
183188- Template for 1D Convolution
184189- Support for float32 data type in the previous kernels
185190- Float binding for Pad1D kernel
@@ -318,7 +323,7 @@ This release containing major architectural changes, new platform support, enhan
318323
319324### Changed
320325- FloatConvTemplate file
321- - Platform.py file
326+ - Platform.py file
322327- Bump the CMake version to 3.24 as required for the chimera-sdk
323328- Bump GVSoC's version and add chimera simulation target
324329- Rename the generic source util to utils to avoid name collision with chimera-sdk
0 commit comments