Skip to content

Commit 7e10758

Browse files
authored
Merge branch 'main' into lzf_24.05-01
2 parents e26ced0 + 808a503 commit 7e10758

File tree

154 files changed

+2314
-169
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

154 files changed

+2314
-169
lines changed

CONTRIBUTING.md

Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,55 @@
1+
# Contributing to this repository
2+
3+
We welcome your contributions! There are multiple ways to contribute.
4+
5+
## Opening issues
6+
7+
For bugs or enhancement requests, please file a GitHub issue unless it's
8+
security related. When filing a bug remember that the better written the bug is,
9+
the more likely it is to be fixed. If you think you've found a security
10+
vulnerability, do not raise a GitHub issue and follow the instructions in our
11+
[security policy](./SECURITY.md).
12+
13+
## Contributing code
14+
15+
We welcome your code contributions. Before submitting code via a pull request,
16+
you will need to have signed the [Oracle Contributor Agreement][OCA] (OCA) and
17+
your commits need to include the following line using the name and e-mail
18+
address you used to sign the OCA:
19+
20+
```text
21+
Signed-off-by: Your Name <[email protected]>
22+
```
23+
24+
This can be automatically added to pull requests by committing with `--sign-off`
25+
or `-s`, e.g.
26+
27+
```text
28+
git commit --signoff
29+
```
30+
31+
Only pull requests from committers that can be verified as having signed the OCA
32+
can be accepted.
33+
34+
## Pull request process
35+
36+
1. Ensure there is an issue created to track and discuss the fix or enhancement
37+
you intend to submit.
38+
1. Fork this repository.
39+
1. Create a branch in your fork to implement the changes. We recommend using
40+
the issue number as part of your branch name, e.g. `1234-fixes`.
41+
1. Ensure that any documentation is updated with the changes that are required
42+
by your change.
43+
1. Ensure that any samples are updated if the base image has been changed.
44+
1. Submit the pull request. *Do not leave the pull request blank*. Explain exactly
45+
what your changes are meant to do and provide simple steps on how to validate.
46+
your changes. Ensure that you reference the issue you created as well.
47+
1. We will assign the pull request to 2-3 people for review before it is merged.
48+
49+
## Code of conduct
50+
51+
Follow the [Golden Rule](https://en.wikipedia.org/wiki/Golden_Rule). If you'd
52+
like more specific guidelines, see the [Contributor Covenant Code of Conduct][COC].
53+
54+
[OCA]: https://oca.opensource.oracle.com
55+
[COC]: https://www.contributor-covenant.org/version/1/4/code-of-conduct/

SECURITY.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
# Reporting security vulnerabilities
2+
3+
Oracle values the independent security research community and believes that
4+
responsible disclosure of security vulnerabilities helps us ensure the security
5+
and privacy of all our users.
6+
7+
Please do NOT raise a GitHub Issue to report a security vulnerability. If you
8+
believe you have found a security vulnerability, please submit a report to
9+
[[email protected]][1] preferably with a proof of concept. Please review
10+
some additional information on [how to report security vulnerabilities to Oracle][2].
11+
We encourage people who contact Oracle Security to use email encryption using
12+
[our encryption key][3].
13+
14+
We ask that you do not use other channels or contact the project maintainers
15+
directly.
16+
17+
Non-vulnerability related security issues including ideas for new or improved
18+
security features are welcome on GitHub Issues.
19+
20+
## Security updates, alerts and bulletins
21+
22+
Security updates will be released on a regular cadence. Many of our projects
23+
will typically release security fixes in conjunction with the
24+
Oracle Critical Patch Update program. Additional
25+
information, including past advisories, is available on our [security alerts][4]
26+
page.
27+
28+
## Security-related information
29+
30+
We will provide security related information such as a threat model, considerations
31+
for secure use, or any known security issues in our documentation. Please note
32+
that labs and sample code are intended to demonstrate a concept and may not be
33+
sufficiently hardened for production use.
34+
35+
[1]: mailto:[email protected]
36+
[2]: https://www.oracle.com/corporate/security-practices/assurance/vulnerability/reporting.html
37+
[3]: https://www.oracle.com/security-alerts/encryptionkey.html
38+
[4]: https://www.oracle.com/security-alerts/

app-dev/devops-and-containers/oke/README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,10 @@ Reviewed: 20.12.2023
3737
- [How to deploy Ingress Nginx using Helm in OCI](https://medium.com/@ivan-delic/deploy-ingress-nginx-using-helm-in-oci-c3ff4d9d5450)
3838
- [Version Control Your Database in a Microservices Architecture](https://www.youtube.com/watch?v=qh9jppe_yNI)
3939
- [Kubernetes Service Operator for OCI Streaming — First steps](https://blogs.oracle.com/developers/post/using-oci-with-k8s)
40+
- [Disaster Recovery — Notes on Velero and OKE, Part 1: Stateless Pods](https://medium.com/oracledevs/disaster-recovery-notes-on-velero-and-oke-part-1-stateless-pods-b4ba3e737386)
41+
- [Disaster Recovery — Notes on Velero and OKE, Part 2: Stateful Pods with Persistent Volumes and Block Volume](https://medium.com/oracledevs/disaster-recovery-notes-on-velero-and-oke-part-2-stateful-pods-with-persistent-volumes-and-80204b3ac6d7)
42+
- [Disaster Recovery: Notes on Velero and OKE — part 3: Stateful Pods with Persistent Volumes and File Storage](https://medium.com/oracledevs/oke-disaster-recovery-notes-on-velero-and-oke-part-3-stateful-pods-with-persistent-volumes-and-a6eacef7600b)
43+
- [Test S3 Compatibility - Preparing Backups and DR for OKE and Velero](https://github.com/fharris/oci-s3-compatibility)
4044

4145

4246
# Useful Links
Binary file not shown.
Binary file not shown.

cloud-infrastructure/ai-infra-gpu/Benchmarks/gds-benchmarks/README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,10 @@
11
# DISCLAIMER
2-
This code is provided as a starting point for your own benchmarks or for adaptation to your specific needs. It is not production-ready, and may lack a testing strategy, requiring modifications to function properly.
32

3+
This code is provided as a starting point for your own benchmarks or for adaptation to your specific needs. It is not production-ready, and may lack a testing strategy, requiring modifications to function properly.
44

55
# gds-benchmakrs
6-
```
6+
7+
```bash
78
[root@machine gds_benchmarks]# tree -L 2
89
.
910
├── backups
@@ -34,12 +35,12 @@ The **'scripts'** directory contains all the bash scripts used for initializing
3435

3536
During operation, **'launch.sh'** will conduct multiple tests using a variety of block and file sizes, which are predefined (Block sizes include: 4k, 8k, 16k, 32k, 64k, 128k, 256k, 512k, 1M, 2M, 4M, 8M, 16M). Each test iteration is performed four times. The file size varies depending on the number of threads, and to maintain efficiency, file sizes are kept relatively small. The GDSIO benchmark scales the file size with the number of threads, which can lead to extensive data processing. For example, running eight threads with a 128G file size would result in a total of 1024 TB of data being written, reflecting the scale of operations handled by this setup.
3637

37-
3838
Each test will be logged under the corresponding directory. The naming of the log files has the following format: ``gdsio_s<block size>_m<mode number>_w<threads / workers>_r<repetition number>_d<timestamp>.log``
3939
gdsio_s1M_m2_w1_r1_d1671836744.log
4040
For example, this log refers to a test that used 1M as block size, Mode 2 (CPU_GPU), 1 thread, and its timestamp.
4141
Each log also contains the full command that was issued to run:
42-
```
42+
43+
```bash
4344
/usr/local/cuda/gds/tools/gdsio -D /mnt/nvme0/ -d 0 -w 1 -s 8G -x 2 -i 1M -I 1 -V >> /home/ubuntu/gds_benchmarks/scripts/../logs/CPU_GPU//gdsio_s1M_m2_w1_r1_d1671836744.log
4445
IoType: WRITE XferType: CPU_GPU Threads: 1 DataSetSize: 8388608/8388608(KiB) IOSize: 1024(KiB) Throughput: 4.179353 GiB/sec, Avg_Latency: 233.618286 usecs ops: 8192 total_time 1.914172 secs
4546
Verifying data

cloud-infrastructure/ai-infra-gpu/README.md

Lines changed: 17 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,28 +1,31 @@
11
# AI infrastructure and GPU
22

3-
We are a team of technical specialists for AI infrastructure, graphics, and HPC on NVIDIA GPUs at OCI for
4-
- AI infrastructure for large-scale AI (Generative AI Large language model training, NLP) and small-scale AI (predictive AI, recommender systems, speech recognition, computer vision)
5-
- Computational Fluid Dynamics, Finite Element Analysis, Molecular Dynamics, Monte Carlo Simulations
6-
- Parallel File Systems (Weka, GlusterFS, BeeGFS, Lustre, IBM Spectrum Scale/GPFS)
7-
- Graphics (streaming, gaming, VDI)
8-
who support customers across various industries and geographies in EMEA (local support in 10 languages incl. German, French, Dutch, Italian & Spanish) with
9-
- Presentations
10-
- Demonstrations
11-
- PoCs/benchmarks
12-
- Architecture designs & sizing
13-
14-
# Useful Links
3+
We are a team of technical specialists for AI infrastructure, graphics, and HPC on NVIDIA GPUs at OCI for:
4+
5+
- AI infrastructure for large-scale AI (Generative AI Large language model training, NLP) and small-scale AI (predictive AI, recommender systems, speech recognition, computer vision)
6+
- Computational Fluid Dynamics, Finite Element Analysis, Molecular Dynamics, Monte Carlo Simulations
7+
- Parallel File Systems (Weka, GlusterFS, BeeGFS, Lustre, IBM Spectrum Scale/GPFS)
8+
- Graphics (streaming, gaming, VDI)
9+
10+
Who support customers across various industries and geographies in EMEA (local support in 10 languages incl. German, French, Dutch, Italian & Spanish) with:
11+
12+
- Presentations
13+
- Demonstrations
14+
- PoCs/benchmarks
15+
- Architecture designs & sizing
16+
17+
## Useful Links
1518

1619
- [Oracle Cloud GPU Instances](https://www.oracle.com/uk/cloud/compute/gpu/)
1720
- High-level description of Oracle Cloud GPU offering
1821
- [Oracle Cloud AI Infrastructure](https://www.oracle.com/uk/ai-infrastructure/)
19-
- High-level description of Oracle Cloud Artificial Intelligence infrastructure offering
22+
- High-level description of Oracle Cloud Artificial Intelligence infrastructure offering
2023
- [GPU instances details](https://docs.oracle.com/en-us/iaas/Content/Compute/References/computeshapes.htm#bm-gpu)
2124
- Oracle Cloud Bare-Metal GPU specifications
2225
- [AI infrastructure - Best practices on OCI](https://www.oracle.com/cloud/oci-best-practices-guide/#ai-infra-on-oci)
2326
- Guides, architectures, videos, and automation scripts to plan, design, and deploy AI workloads on our GPU compute and OCI services.
2427

25-
# License
28+
## License
2629

2730
Copyright (c) 2024 Oracle and/or its affiliates.
2831

cloud-infrastructure/ai-infra-gpu/AI Infrastructure/README.md renamed to cloud-infrastructure/ai-infra-gpu/ai-infrastructure/README.md

Lines changed: 17 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,18 @@
11
# GPU Compute Instances
2-
2+
33
This repository contains detailed information related to Oracle Cloud Infrastructure GPU compute instances.
4-
5-
Reviewed: 26.02.20224
6-
4+
5+
Reviewed: 26.02.2024
6+
77
# Table of Contents
8-
8+
99
1. [Team Publications](#team-publications)
1010
2. [Useful Links](#useful-links)
11-
11+
1212
# Team Publications
13-
13+
1414
## Blog Posts
15-
15+
1616
- [Oracle and UbiOps make your AI workloads land on OCI with NVIDIA GPUs](https://blogs.oracle.com/cloud-infrastructure/post/ubiops-ai-workloads-on-oci)
1717
- Deploy UbiOps on Oracle Cloud using Oracle Kubernetes Engines (OKE) and GPU instances
1818
- [A Deep Fake Example on Oracle Cloud Infrastructure](https://blogs.oracle.com/cloud-infrastructure/post/a-deepfake-example-on-oracle-cloud-infrastructure)
@@ -34,7 +34,7 @@ Reviewed: 26.02.20224
3434

3535
- [Deploying NVIDIA NIM on OKE with model repository stored on Object Storage](https://docs.oracle.com/en/learn/deploy-nvidia-nim-on-oke/)
3636
- Step-by-step tutorial to run inference at scale using NVIDIA NIM.
37-
37+
3838
## LiveLabs
3939

4040
- [Running Faceswap on OCI](https://apexapps.oracle.com/pls/apex/r/dbpm/livelabs/view-workshop?wid=798&session=4731494759204)
@@ -51,26 +51,25 @@ Reviewed: 26.02.20224
5151

5252
- [Running AI on NVIDIA A10 GPUs in Oracle Cloud](https://www.youtube.com/watch?v=auvNHWoblIA)
5353
- How to run AI on NVIDIA A10 GPUs in Oracle Cloud
54-
55-
## Re-useable Assets
54+
55+
## Re-useable Assets
5656

5757
- [Navigating the AI Revolution: Opportunities and Challenges - German - efcom Symposium 2023](https://www.youtube.com/watch?v=r_ZG2sUzhqo)
5858
- For all German speakers interested in hearing high-level about the foundations of GPU, what AI is (and how it is connected to Machine Learning, Data Science, LLM, GenAI), and how Oracle is approaching it.
5959

6060
# Useful Links
61-
61+
6262
- [NVIDIA H100 GPUs on Oracle Cloud](https://blogs.oracle.com/cloud-infrastructure/post/general-availability-oci-compute-nvidia-h100)
6363
- Announcing general availability for OCI Compute bare metal instances powered by NVIDIA H100 GPUs
6464
- [NVIDIA L40S GPUs on Oracle Cloud](https://blogs.oracle.com/cloud-infrastructure/post/announcing-oci-compute-nvidia-l40s-gpus)
65-
- Announcing OCI Compute with NVIDIA L40S GPUs
65+
- Announcing OCI Compute with NVIDIA L40S GPUs
6666
- [Superclusters with RDMA](https://blogs.oracle.com/cloud-infrastructure/post/superclusters-rdma-high-performance)
67-
- Deep dive explanation about Oracle Cloud-specific network for GPU workload at scale
67+
- Deep dive explanation about Oracle Cloud-specific network for GPU workload at scale
6868

69-
7069
# License
71-
70+
7271
Copyright (c) 2024 Oracle and/or its affiliates.
73-
72+
7473
Licensed under the Universal Permissive License (UPL), Version 1.0.
75-
74+
7675
See [LICENSE](https://github.com/oracle-devrel/technology-engineering/blob/main/LICENSE) for more details.

0 commit comments

Comments
 (0)