Skip to content

Commit 4191627

Browse files
authored
[Docs] Update deeprec2306 release images and notes in README.md & RELEASE.md. (#922)
Signed-off-by: candy.dc <[email protected]>
1 parent 8b02c47 commit 4191627

8 files changed

+94
-10
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -95,13 +95,13 @@ $ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux
9595
#### Image for CPU
9696

9797
```
98-
alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04
98+
alideeprec/deeprec-release:deeprec2306-cpu-py38-ubuntu20.04
9999
```
100100

101101
#### Image for GPU CUDA11.6
102102

103103
```
104-
alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04
104+
alideeprec/deeprec-release:deeprec2306-gpu-py38-cu116-ubuntu20.04
105105
```
106106

107107
***

RELEASE.md

Lines changed: 84 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,87 @@
1+
# Release r1.15.5-deeprec2306
2+
3+
## **Major Features and Improvements**
4+
5+
### **Embedding**
6+
7+
- Support StaticGPUHashMap to optimize EmbeddingVariable in inference.
8+
- Update logic of GroupEmbedding in feature_column API.
9+
- Refine APIs for foward-backward optimization.
10+
- Move insertions of new features into the backward process when lti-tier storage.
11+
- Move insertion of new features into the backward ops.
12+
- Modify calculation logic of embedding lookup sparse combiner.
13+
- Add memory and performance tests of EmbeddingVariable.
14+
15+
### **Graph & Grappler Optimization**
16+
17+
- Support IteratorGetNext for SmartStage as a starting node for searching.
18+
- Reimplement PrefetchRunner in C++.
19+
20+
### **Runtime Optimization**
21+
22+
- Dispatch expensive ops via multiple threads in theadpool.
23+
- Enable multi-stream in session_group by default.
24+
- Support for loading saved_model with device information when use p and multi_stream.
25+
- Make ARENA_ARRAY_SIZE to be configurable.
26+
- Optimize EV allocator performance.
27+
- Integrate HybridBackend in collective training mode.
28+
29+
### **Ops & Hardware Acceleration**
30+
31+
- Disable MatMul fused with LeakyRule when MKL is disabled.
32+
33+
### **Serving**
34+
35+
- Clear virtual_device configurations before load new checkpoint.
36+
37+
### **Environment & Build**
38+
39+
- Update docker images in user documents.
40+
- Update DEFAULT_CUDA_VERSION and DEFAULT_CUDNN_VERSION in configure.py.
41+
- Move thirdparties from WORKSPACE to workspace.bzl.
42+
- Update urls corresponding to colm, ragel, aliyun-oss-sdk and uuid.
43+
- Update default TF_CUDA_COMPUTE_CAPABILITIES to 7.0,7.5,8.0,8.6.
44+
- Update SparseOperationKit to v23.5.01 and docker file.
45+
46+
### **BugFix**
47+
48+
- Fix issue of missing params while constructing the ngScope.
49+
- Fix memory leak to avoid OOM.
50+
- Fix shape validation in API shared_embedding_columns.
51+
- Fix the device placement bug of stage_subgraph_on_cpu in distributed.
52+
- Fix hung issue when using both SOK and SmartStaged simultaneously.
53+
- Fix bug: init global_step before saving variables
54+
- Fix bug: reserve input nodes, clear saver devices on demand.
55+
- Fix memory leak when a graph node is invalid.
56+
57+
### **ModelZoo**
58+
59+
- Add examples and docs to demonstrate Collective Training.
60+
- Update documents and config files for modelzoo benchmark.
61+
- Update modelzoo README.
62+
63+
### **Tool & Documents**
64+
65+
- Update cases of configure TF_CUDA_COMPUTE_CAPABILITIES for H100.
66+
- Update COMMITTERS.md.
67+
- Update device placement documents.
68+
- Update document for SmartStage.
69+
- Update session_group documents.
70+
- Update the download link of the library that Processor depends on.
71+
- Update sok to 1.20.
72+
73+
More details of features: [https://deeprec.readthedocs.io/zh/latest/](url)
74+
75+
## **Release Images**
76+
77+
### **CPU Image**
78+
79+
`alideeprec/deeprec-release:deeprec2306-cpu-py38-ubuntu20.04`
80+
81+
### **GPU Image**
82+
83+
`alideeprec/deeprec-release:deeprec2306-gpu-py38-cu116-ubuntu20.04`
84+
185
# Release r1.15.5-deeprec2304
286

387
## **Major Features and Improvements**

docs/docs_en/DeepRec-Compile-And-Install.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -111,7 +111,7 @@ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux_x
111111

112112
x86_64:
113113
```
114-
alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04
114+
alideeprec/deeprec-release:deeprec2306-cpu-py38-ubuntu20.04
115115
```
116116

117117
arm64:
@@ -122,5 +122,5 @@ alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu22.04-arm64
122122
**GPU Image with CUDA 11.6**
123123

124124
```
125-
alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04
125+
alideeprec/deeprec-release:deeprec2306-gpu-py38-cu116-ubuntu20.04
126126
```

docs/docs_en/Estimator-Compile-And-Install.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ DeepRec provide new distributed protocols such as grpc++ and star_server, which
4040

4141
Source Code: [https://github.com/DeepRec-AI/estimator](https://github.com/DeepRec-AI/estimator)
4242

43-
Develop Branch:master, Latest Release Branch: deeprec2304
43+
Develop Branch:master, Latest Release Branch: deeprec2306
4444

4545
## Estimator Build
4646

docs/docs_en/TFServing-Compile-And-Install.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ We provide optimized TFServing which could highly improve performance in inferen
3939

4040
Source Code: [https://github.com/DeepRec-AI/serving](https://github.com/DeepRec-AI/serving)
4141

42-
Develop Branch: master, Latest Release Branch: deeprec2304
42+
Develop Branch: master, Latest Release Branch: deeprec2306
4343

4444
## TFServing Build
4545

docs/docs_zh/DeepRec-Compile-And-Install.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -108,7 +108,7 @@ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux_x
108108

109109
x86_64:
110110
```
111-
alideeprec/deeprec-release:deeprec2304-cpu-py38-ubuntu20.04
111+
alideeprec/deeprec-release:deeprec2306-cpu-py38-ubuntu20.04
112112
```
113113

114114
arm64:
@@ -119,7 +119,7 @@ alideeprec/deeprec-release:deeprec2302-cpu-py38-ubuntu22.04-arm64
119119
**GPU CUDA11.6镜像**
120120

121121
```
122-
alideeprec/deeprec-release:deeprec2304-gpu-py38-cu116-ubuntu20.04
122+
alideeprec/deeprec-release:deeprec2306-gpu-py38-cu116-ubuntu20.04
123123
```
124124

125125
## DeepRec Processor编译打包

docs/docs_zh/Estimator-Compile-And-Install.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@
4040

4141
代码库:[https://github.com/DeepRec-AI/estimator](https://github.com/DeepRec-AI/estimator)
4242

43-
开发分支:master,最新Release分支:deeprec2304
43+
开发分支:master,最新Release分支:deeprec2306
4444

4545
## Estimator编译
4646

docs/docs_zh/TFServing-Compile-And-Install.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@
3939

4040
代码库:[https://github.com/DeepRec-AI/serving](https://github.com/DeepRec-AI/serving)
4141

42-
开发分支:master,最新Release分支:deeprec2304
42+
开发分支:master,最新Release分支:deeprec2306
4343

4444
## TFServing编译&打包
4545

0 commit comments

Comments
 (0)