Skip to content

Commit 64f84c9

Browse files
committed
update documentation
1 parent 4f8e266 commit 64f84c9

File tree

2 files changed

+29
-0
lines changed

2 files changed

+29
-0
lines changed

README.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,31 @@
33
This repository is a place to contain the tools developed over the course of the DS4CG 2025 summer
44
internship project with Unity.
55

6+
## DS4CG Job Analytics
7+
8+
9+
DS4CG Job Analytics is a data analytics and reporting platform developed during the DS4CG 2025 summer internship with Unity. It provides tools for analyzing HPC job data, generating interactive reports, and visualizing resource usage and efficiency.
10+
11+
## Motivation
12+
High-performance GPUs are a critical resource on shared clusters, but they are often underutilized due to inefficient job scheduling, over-allocation, or lack of user awareness. Many jobs request more GPU memory or compute than they actually use, leading to wasted resources and longer queue times for others. This project aims to address these issues by providing analytics and reporting tools that help users and administrators understand GPU usage patterns, identify inefficiencies, and make data-driven decisions to improve overall cluster utilization.
13+
14+
## Project Overview
15+
This project includes:
16+
- Python scripts and modules for data preprocessing, analysis, and report generation
17+
- Jupyter notebooks for interactive exploration and visualization
18+
- Automated report generation scripts (see the `feature/reports` branch for the latest versions)
19+
- Documentation built with MkDocs and Quarto
20+
21+
## Example Notebooks
22+
The following notebooks demonstrate key analyses and visualizations:
23+
24+
- `notebooks/Basic Visualization.ipynb`: Basic plots and metrics
25+
- `notebooks/Efficiency Analysis.ipynb`: Efficiency metrics and user comparisons
26+
- `notebooks/Resource Hoarding.ipynb`: Analysis of resource hoarding
27+
- `notebooks/SlurmGPU.ipynb`: GPU job analysis
28+
29+
See the `notebooks/` directory for more examples.
30+
631
## Contributing to this repository
732

833
The following guidelines may prove helpful in maximizing the utility of this repository:

docs/about.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,13 @@
11
# About DS4CG Unity Job Analytics
22

3+
34
## Project Background
45

56
The DS4CG Unity Job Analytics project was initiated as part of the DS4CG 2025 summer internship program in collaboration with the Unity HPC cluster at UMass. The goal is to provide robust tools and documentation for analyzing job data, improving resource utilization, and supporting research and operations on the Unity cluster.
67

8+
### Motivation
9+
High-performance GPUs are a critical resource on shared clusters, but they are often underutilized due to inefficient job scheduling, over-allocation, or lack of user awareness. Many jobs request more GPU memory or compute than they actually use, leading to wasted resources and longer queue times for others. This project aims to address these issues by providing analytics and reporting tools that help users and administrators understand GPU usage patterns, identify inefficiencies, and make data-driven decisions to improve overall cluster utilization.
10+
711
## Team & Contributors
812
- Project Lead: Benjamin Pachev
913
- Contributors: DS4CG Summer 2025 Internship Team

0 commit comments

Comments
 (0)