You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/genai/02_external_llms/01_llm_access.mdx
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,8 +6,8 @@
6
6
7
7
Broadly it facilitates:
8
8
-**Route requests** via configurations that can allow for fallbacks, load balance, etc
9
-
-**Observability** via the control pane that displays your usage statistics and logs. You can retrive your logs via an API.
10
-
-**Prompt management** via the prompt playground. You can compare the respone for a prompt across multiple LLMs, collaborate with your team and export your prompts.
9
+
-**Observability** via the control pane that displays your usage statistics and logs. You can retrieve your logs via an API.
10
+
-**Prompt management** via the prompt playground. You can compare the response for a prompt across multiple LLMs, collaborate with your team and export your prompts.
11
11
-**Guardrails** usage by allowing you to define them or integrate with third party guardrails.
12
12
-**Agentic workflows** by integrating with various agentic frameworks like langchain, llamaindex, etc.
13
13
-**Security & Governance** by allowing you to set budget and rate limits on the API keys created for your workspace
Copy file name to clipboardExpand all lines: docs/genai/05_how_to_guides/01_temperature.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,12 +1,12 @@
1
1
# Temperature
2
2
3
-
Generating text (or images) from LLMs is inherently probabilistic. However, as an end user you have many parameters at your disposal to tweak the behavior of LLMs. Of these, temperature is the most commonly used. Broadly, it controls the randonmess of the generated text. A lower temperature produces more deterministic outputs, while a higher temperature produces more random "creative" output. For a more comprehensive explanation on this topic, refer to the following:
3
+
Generating text (or images) from LLMs is inherently probabilistic. However, as an end user you have many parameters at your disposal to tweak the behavior of LLMs. Of these, temperature is the most commonly used. Broadly, it controls the randomness of the generated text. A lower temperature produces more deterministic outputs, while a higher temperature produces more random "creative" output. For a more comprehensive explanation on this topic, refer to the following:
4
4
-[How to generate text: using different decoding methods for language generation with Transformers](https://huggingface.co/blog/how-to-generate)
5
5
-[What is LLM Temperature?](https://www.ibm.com/think/topics/llm-temperature)
6
6
7
7
:::tip
8
8
9
-
The effect of temperature is probabalistic, so you might need to run the script repeatedly to obtain a representative sample of generated text from the LLM.
9
+
The effect of temperature is probabilistic, so you might need to run the script repeatedly to obtain a representative sample of generated text from the LLM.
Copy file name to clipboardExpand all lines: docs/hpc/01_getting_started/02_getting_and_renewing_an_account.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@
8
8
9
9
[hpc account request form link for courses]: https://docs.google.com/forms/d/e/1FAIpQLSdehngqL1xso-YV6MOhplKNwxXjASHYnDtM_5THB3P2vrDKcg/viewform?usp=sf_link
10
10
11
-
This section deals with the eligibility for getting HPC accounts and the process to create new ones, renew existing oaccountsThis section deals with the eligibility for getting HPC accounts, the process to create new accounts, renew existing ones and touches on access policies after graduation fom NYU and access for non-NYU researchers.
11
+
This section deals with the eligibility for getting HPC accounts and the process to create new ones, renew existing oaccountsThis section deals with the eligibility for getting HPC accounts, the process to create new accounts, renew existing ones and touches on access policies after graduation from NYU and access for non-NYU researchers.
12
12
13
13
:::info
14
14
@@ -61,7 +61,7 @@ HPC faculty sponsors are expected to:
61
61
62
62
- Respond promptly to account-related requests from HPC staff
63
63
64
-
Each year, your sponosred users must renew their account. You will need to approve the renewal by logging into the [NYU Identity Management service][nyu ims link]. We have a [walkthrogh of the approval process here](./03_walkthrough_approve_hpc_account_request.md)
64
+
Each year, your sponsored users must renew their account. You will need to approve the renewal by logging into the [NYU Identity Management service][nyu ims link]. We have a [walkthrogh of the approval process here](./03_walkthrough_approve_hpc_account_request.md)
65
65
66
66
## Bulk HPC Accounts for Courses
67
67
@@ -95,7 +95,7 @@ If you are not part of a collaboration, your access to cluster will end together
95
95
96
96
:::note
97
97
98
-
In order to request a new HPC account or renew an expired one, you need to be connected to the NYU VPN if you are working remotely, Please see [instructions on how to install and use the NYU VPN][nyu vpn link]. Linux clients are not officially supported, however we were able to successfully use openVPN client. Here are installation and connection instructions for a debian linux distribution with apt pacakge manager:
98
+
In order to request a new HPC account or renew an expired one, you need to be connected to the NYU VPN if you are working remotely, Please see [instructions on how to install and use the NYU VPN][nyu vpn link]. Linux clients are not officially supported, however we were able to successfully use openVPN client. Here are installation and connection instructions for a debian linux distribution with apt package manager:
After clicking "+Request Resource Allocation", you'll see a list of resources you can request for.
14
-
Basically, you'll see general "Univeristy HPC" which is default for all and school-wise resources (e.g., Tandon(Genric)) based on a school that your project belongs to.
15
-
Please select a resource and fill in justification to complete the allocation reqeust process.
Basically, you'll see general "University HPC" which is default for all and school-wise resources (e.g., Tandon(Generic)) based on a school that your project belongs to.
15
+
Please select a resource and fill in justification to complete the allocation request process.
Copy file name to clipboardExpand all lines: docs/hpc/03_storage/01_intro_and_data_management.mdx
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -77,7 +77,7 @@ There are **No Back ups of the scratch file system.** ***Files that were deleted
77
77
:::
78
78
79
79
#### HPC Vast
80
-
The HPC Vast all-flash file system is the HPC file system where users store research data needed during the analysis phase of their research projects, particuarly for high I/O data that can bottleneck on the scratch file system. The Vast file system provides ***temporary*** storage for datasets needed for running jobs.
80
+
The HPC Vast all-flash file system is the HPC file system where users store research data needed during the analysis phase of their research projects, particularly for high I/O data that can bottleneck on the scratch file system. The Vast file system provides ***temporary*** storage for datasets needed for running jobs.
81
81
82
82
Files stored in the HPC vast file system are subject to the <ins>***HPC Vast old file purging policy:*** Files on the `/vast` file system that have not been accessed for **60 or more days** will be purged.</ins>
Copy file name to clipboardExpand all lines: docs/hpc/03_storage/03_data_transfers.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,7 +24,7 @@ logout
24
24
### Linux & Mac Tools
25
25
#### scp and rsync
26
26
:::warning
27
-
Please use Data Transfer Nodes (DTNs) with these tools. While one can transfer data while on login nodes, it is considered a bad practice because it can degrate the node's performance.
27
+
Please use Data Transfer Nodes (DTNs) with these tools. While one can transfer data while on login nodes, it is considered a bad practice because it can degrade the node's performance.
28
28
:::
29
29
30
30
Sometimes these two tools are convenient for transferring small files. Using the DTNs does not require to set up an SSH tunnel; use the hostname `dtn.hpc.nyu.edu` for one-step copying. See below for examples of commands invoked on the command line on a laptop running a Unix-like operating system:
Copy file name to clipboardExpand all lines: docs/hpc/03_storage/08_transferring_cloud_storage_data_with_rclone.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -344,7 +344,7 @@ Please enter 'q' and we're done with configuration.
344
344
345
345
### Step 4: Transfer
346
346
:::warning
347
-
Please be sure to perform data transters on a data transfer node (DTN). It can degrade performacefor other users to perform transfers on other types of nodes. For more information please see [Data Transfers](./03_data_transfers.md)
347
+
Please be sure to perform data transters on a data transfer node (DTN). It can degrade performancefor other users to perform transfers on other types of nodes. For more information please see [Data Transfers](./03_data_transfers.md)
Copy file name to clipboardExpand all lines: docs/hpc/05_submitting_jobs/01_slurm_submitting_jobs.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -312,7 +312,7 @@ or as a command-line option to sbatch when you submit the job:
312
312
### Options for running many similar jobs
313
313
314
314
-`-a, --array=indexes`
315
-
- Submit an array of jobs with array ids as specified. Array ids can be specified as a numerical range, a comma-seperated list of numbers, or as some combination of the two. Each job instance will have an environment variable `SLURM_ARRAY_JOB_ID` and `SLURM_ARRAY_TASK_ID`. For example:
315
+
- Submit an array of jobs with array ids as specified. Array ids can be specified as a numerical range, a comma-separated list of numbers, or as some combination of the two. Each job instance will have an environment variable `SLURM_ARRAY_JOB_ID` and `SLURM_ARRAY_TASK_ID`. For example:
316
316
-`--array=1-11`, to start an array job with index from 1 to 11
317
317
-`--array=1-7:2`, to submit an array job with index step size 2
318
318
-`--array=1-9%4`, to submit an array job with simultaneously running job elements set to 4
@@ -489,7 +489,7 @@ cd /scratch/$USER/myambertest
489
489
pmemd.cuda -O
490
490
```
491
491
492
-
The demo Amber job should take ~2 minutes to finish once it starts runnning. When the job is done, several output files are generated. Check the one named "mdout", which has a section most relevant here:
492
+
The demo Amber job should take ~2 minutes to finish once it starts running. When the job is done, several output files are generated. Check the one named "mdout", which has a section most relevant here:
493
493
494
494
```sh
495
495
|--------------------- INFORMATION ----------------------
@@ -599,7 +599,7 @@ If you do not request resources you will get the default settings. If after some
599
599
- Enable X forwarding, so programs using a GUI can be used during the session (provided you have X forwarding to your workstation set up)
600
600
- To leave an interactive batch session, type `exit` at the command prompt
601
601
602
-
Certain tasks need user iteraction - such as debugging and some GUI-based applications. However the HPC clusters rely on batch job scheduling to efficiently allocate resources. Interactive batch jobs allow these apparently conflicting requirements to be met.
602
+
Certain tasks need user interaction - such as debugging and some GUI-based applications. However the HPC clusters rely on batch job scheduling to efficiently allocate resources. Interactive batch jobs allow these apparently conflicting requirements to be met.
0 commit comments