Skip to content

Commit c66f0e0

Browse files
author
Jenkins
committed
Compute Library v21.08
1 parent dfc8109 commit c66f0e0

File tree

1,439 files changed

+201865
-70048
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

1,439 files changed

+201865
-70048
lines changed

Android.bp

Lines changed: 247 additions & 106 deletions
Large diffs are not rendered by default.

README.md

Lines changed: 81 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -1,34 +1,70 @@
1+
2+
<br>
13
<div align="center">
2-
<img src="https://raw.githubusercontent.com/ARM-software/ComputeLibrary/gh-pages/ACL_logo.png"><br><br>
4+
<img src="https://raw.githubusercontent.com/ARM-software/ComputeLibrary/gh-pages/ACL_logo.png"/><br><br>
35
</div>
46

5-
Release repository: https://github.com/arm-software/ComputeLibrary
7+
# Compute Library ![](https://img.shields.io/badge/latest_release-21.08-green)
68

7-
Development repository: https://review.mlplatform.org/#/admin/projects/ml/ComputeLibrary
89

9-
Please report issues here: https://github.com/ARM-software/ComputeLibrary/issues
10+
The Compute Library is a collection of low-level machine learning functions optimized for Arm® Cortex®-A and Arm® Mali™ GPUs architectures.<br>
1011

11-
**Make sure you are using the latest version of the library before opening an issue. Thanks**
12+
The library provides superior performance to other open source alternatives and immediate support for new Arm® technologies e.g. SVE2.
1213

13-
News:
14+
Key Features:
1415

15-
- [Gian Marco's talk on Performance Analysis for Optimizing Embedded Deep Learning Inference Software](https://www.embedded-vision.com/platinum-members/arm/embedded-vision-training/videos/pages/may-2019-embedded-vision-summit)
16-
- [Gian Marco's talk on optimizing CNNs with Winograd algorithms at the EVS](https://www.embedded-vision.com/platinum-members/arm/embedded-vision-training/videos/pages/may-2018-embedded-vision-summit-iodice)
17-
- [Gian Marco's talk on using SGEMM and FFTs to Accelerate Deep Learning](https://www.embedded-vision.com/platinum-members/arm/embedded-vision-training/videos/pages/may-2016-embedded-vision-summit-iodice)
16+
- Open source software available under a permissive MIT license
17+
- Over 100 machine learning functions for CPU and GPU
18+
- Multiple convolution algorithms (GeMM, Winograd, FFT, Direct and indirect-GeMM)
19+
- Support for multiple data types: FP32, FP16, INT8, UINT8, BFLOAT16
20+
- Micro-architecture optimization for key ML primitives
21+
- Highly configurable build options enabling lightweight binaries
22+
- Advanced optimization techniques such as kernel fusion, Fast math enablement and texture utilization
23+
- Device and workload specific tuning using OpenCL tuner and GeMM optimized heuristics
1824

19-
Related projects:
25+
<br>
2026

21-
- [Arm NN SDK](https://github.com/arm-software/armnn)
27+
| Repository | Link |
28+
| ----------- | ----------- |
29+
| Release | https://github.com/arm-software/ComputeLibrary |
30+
| Development | https://review.mlplatform.org/#/admin/projects/ml/ComputeLibrary |
2231

23-
Tutorials:
32+
<br>
2433

25-
- [Tutorial: Running AlexNet on Raspberry Pi with Compute Library](https://community.arm.com/processors/b/blog/posts/running-alexnet-on-raspberry-pi-with-compute-library)
34+
## Documentation
35+
[![Documentation](https://img.shields.io/badge/documentation-21.08-green)](https://arm-software.github.io/ComputeLibrary/latest)
36+
37+
> Note: The documentation includes the reference API, changelogs, build guide, contribution guide, errata, etc.
38+
39+
<br>
40+
41+
## Pre-built binaries
42+
All the binaries can be downloaded from [here](https://github.com/ARM-software/ComputeLibrary/releases) or from the tables below.
43+
44+
<br>
45+
46+
| Platform | Operating System | Release archive (Download) |
47+
| ----------- | ----------- | ----------- | ----------- |
48+
| Raspberry Pi 4 | Linux 32bit | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-armv7a-neon.tar.gz) |
49+
| Raspberry Pi 4 | Linux 64bit | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon.tar.gz) |
50+
| Odroid N2 | Linux 64bit | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon-cl.tar.gz) |
51+
| HiKey960 | Linux 64bit | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon-cl.tar.gz) |
2652

27-
Documentation (API, changelogs, build guide, contribution guide, errata, etc.) available at https://github.com/ARM-software/ComputeLibrary/wiki/Documentation.
53+
<br>
2854

29-
Binaries available at https://github.com/ARM-software/ComputeLibrary/releases.
55+
| Architecture | Operating System | Release archive (Download) |
56+
| ----------- | ----------- | ----------- | ----------- |
57+
| armv7 | Android | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-armv7a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-armv7a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-armv7a-neon-cl.tar.gz) |
58+
| armv7 | Linux | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-armv7a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-armv7a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-armv7a-neon-cl.tar.gz) |
59+
| arm64-v8a | Android | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-arm64-v8a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-arm64-v8a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-arm64-v8a-neon-cl.tar.gz) |
60+
| arm64-v8a | Linux | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8a-neon-cl.tar.gz) |
61+
| arm64-v8.2-a | Android | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-arm64-v8.2-a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-arm64-v8.2-a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-android-arm64-v8.2-a-neon-cl.tar.gz) |
62+
| arm64-v8.2-a | Linux | [![](https://img.shields.io/badge/build-neon-orange)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8.2-a-neon.tar.gz) [![](https://img.shields.io/badge/build-opencl-blue)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8.2-a-cl.tar.gz) [![](https://img.shields.io/badge/build-neon+cl-yellowgreen)](https://arm-software.github.io/ComputeLibrary/releases/download/v21.08/arm_compute-v21.08-bin-linux-arm64-v8.2-a-neon-cl.tar.gz) |
3063

31-
### Supported Architectures/Technologies
64+
<br>
65+
66+
67+
## Supported Architectures/Technologies
3268

3369
- Arm® CPUs:
3470
- Arm® Cortex®-A processor family using Arm® Neon™ technology
@@ -41,32 +77,52 @@ Binaries available at https://github.com/ARM-software/ComputeLibrary/releases.
4177

4278
- x86
4379

44-
### Supported Systems
80+
<br>
81+
82+
## Supported Systems
4583

4684
- Android™
4785
- Bare Metal
4886
- Linux®
4987
- macOS®
5088
- Tizen™
5189

52-
## License and Contributions
90+
<br>
5391

54-
The software is provided under MIT license. Contributions to this project are accepted under the same license.
92+
## Resources
93+
- [Tutorial: Running AlexNet on Raspberry Pi with Compute Library](https://community.arm.com/processors/b/blog/posts/running-alexnet-on-raspberry-pi-with-compute-library)
94+
- [Gian Marco's talk on Performance Analysis for Optimizing Embedded Deep Learning Inference Software](https://www.embedded-vision.com/platinum-members/arm/embedded-vision-training/videos/pages/may-2019-embedded-vision-summit)
95+
- [Gian Marco's talk on optimizing CNNs with Winograd algorithms at the EVS](https://www.embedded-vision.com/platinum-members/arm/embedded-vision-training/videos/pages/may-2018-embedded-vision-summit-iodice)
96+
- [Gian Marco's talk on using SGEMM and FFTs to Accelerate Deep Learning](https://www.embedded-vision.com/platinum-members/arm/embedded-vision-training/videos/pages/may-2016-embedded-vision-summit-iodice)
5597

56-
### Public mailing list
57-
For technical discussion, the ComputeLibrary project has a public mailing list: [email protected]
58-
The list is open to anyone inside or outside of Arm to self subscribe. In order to subscribe, please visit the following website:
59-
https://lists.linaro.org/mailman/listinfo/acl-dev
98+
<br>
99+
100+
## How to contribute
101+
102+
Contributions to the Compute Library are more than welcome. If you are interested on contributing, please have a look at our [how to contribute guidelines](https://arm-software.github.io/ComputeLibrary/latest/contribution_guidelines.xhtml).
60103

61104
### Developer Certificate of Origin (DCO)
62-
Before the ComputeLibrary project accepts your contribution, you need to certify its origin and give us your permission. To manage this process we use the Developer Certificate of Origin (DCO) V1.1 (https://developercertificate.org/)
105+
Before the Compute Library accepts your contribution, you need to certify its origin and give us your permission. To manage this process we use the Developer Certificate of Origin (DCO) V1.1 (https://developercertificate.org/)
63106

64107
To indicate that you agree to the the terms of the DCO, you "sign off" your contribution by adding a line with your name and e-mail address to every git commit message:
65108

66109
```Signed-off-by: John Doe <[email protected]>```
67110

68111
You must use your real name, no pseudonyms or anonymous contributions are accepted.
69112

113+
### Public mailing list
114+
For technical discussion, the ComputeLibrary project has a public mailing list: [email protected]
115+
The list is open to anyone inside or outside of Arm to self subscribe. In order to subscribe, please visit the following website:
116+
https://lists.linaro.org/mailman/listinfo/acl-dev
117+
118+
<br>
119+
120+
## License and Contributions
121+
122+
The software is provided under MIT license. Contributions to this project are accepted under the same license.
123+
124+
<br>
125+
70126
## Trademarks and Copyrights
71127

72128
Android is a trademark of Google LLC.
@@ -79,3 +135,4 @@ Mac and macOS are trademarks of Apple Inc., registered in the U.S. and other
79135
countries.
80136

81137
Tizen is a registered trademark of The Linux Foundation.
138+

0 commit comments

Comments
 (0)