Merge pull request #386 from cleophass/AvoidNonPinnedMemoryForDataloaders

dedece35 · web-flow · commit 72376b606e81 · 2025-07-18T23:34:58.000+02:00
GCI102 AvoidNonPinnedMemoryForDataloaders #Python #DLG #RulesSpecifications
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -9,6 +9,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ### Added
 
+- [#386](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/386) Add rule GCI102, recommending the use of pinned memory for the dataloader when transferring data from the CPU to the GPU.
 - [#385](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/385) Added documentation for the rule : disables bias in convolutional layers preceding Batch Normalization.
 - [#384](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/384) Add specifications for rule GCI100, this rule is specific to Python because it's based on the `PyTorch` library, a library used for Deep Learning.
 - [#379](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/379) Add rule GCI99 Avoid CSV Format, this rule is designed for Python but it can be implemented in other languages. The rule suggests using more efficient formats like Feather or Parquet instead of CSV.
diff --git a/RULES.md b/RULES.md
@@ -78,6 +78,7 @@ Some are applicable for different technologies.
 | GCI99    | Data: Avoid CSV Format                                            | The Parquet format is faster to write to, lighter in weight and faster to read data from. It is suitable for use cases where there would be a lot of data I/O, especially with Cloud storage.                                                                                                                                                                                                                                 |                                                                                                                                                                                                                                               | 🚀   | 🚀  | 🚀 | 🚀     | 🚀   | 🚀 | 🚀   |
 | GCI100   | Wrap PyTorch Inference in `torch.no_grad()`                       | Using a PyTorch model in evaluation mode without wrapping inference in `torch.no_grad()` leads to unnecessary gradient tracking                                                                                                                                                                                                                                                                                               |                                                                                                                                                                                                                                               | 🚫   | 🚫  | 🚫 | 🚀     | 🚫   | 🚫 | 🚫   |
 | GCI101   | AI: Avoid Bias in Conv Layers Before Batch Norm                   | Disable bias in convolutional layers when it's followed by a batch norm layer                                                                                                                                                                                                                                                                                                                                                 |                                                                                                                                                                                                                                               | 🚫   | 🚫  | 🚫 | 🚀     | 🚫   | 🚫 | 🚫   |
+| GCI102   | Use pinned memory on DataLoader when using GPU                               | This rule applies to PyTorch data loading, where the use of pinned memory can significantly optimize data transfer between CPU and GPU.                                                                                                                                                                                                                                                                                                                                                          |                                                                                                                                                                         | 🚫 | 🚫     | 🚫   | ✅ | 🚫     | 🚫   | 🚫    |
 | GCI203   | Detect unoptimized file formats                                   | When it is possible, to use svg format image over other image format                                                                                                                                                                                                                                                                                                                                                          |                                                                                                                                                                                                                                               | 🚧   | 🚀  | 🚀 | ✅      | 🚀   | 🚀 | 🚫   |
 | GCI404   | Avoid list comprehension in iterations                            | Use generator comprehension instead of list comprehension in for loop declaration                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                               | 🚫   | 🚫  | 🚫 | ✅      | 🚫   | 🚫 | 🚫   |
 | GCI522   | Sobriety: Brightness Override                                     | To avoid draining the battery, iOS and Android devices adapt the brightness of the screen depending on the environment light.                                                                                                                                                                                                                                                                                                 |                                                                                                                                                                                                                                               | 🚫   | 🚫  | ✅  | 🚫     | 🚫   | 🚫 | 🚫   |
diff --git a/src/main/rules/GCI102/GCI102.json b/src/main/rules/GCI102/GCI102.json
@@ -0,0 +1,18 @@
+{
+  "title": "AI Use pinned memory on DataLoader when using GPU",
+  "type": "CODE_SMELL",
+  "status": "ready",
+  "remediation": {
+    "func": "Constant/Issue",
+    "constantCost": "10min"
+  },
+  "tags": [
+    "creedengo",
+    "eco-design",
+    "performance",
+    "memory",
+    "ai",
+    "pytorch"
+  ],
+  "defaultSeverity": "Minor"
+}
diff --git a/src/main/rules/GCI102/python/GCI102.asciidoc b/src/main/rules/GCI102/python/GCI102.asciidoc
@@ -0,0 +1,78 @@
+This rule applies to PyTorch data loading, where the use of pinned memory can significantly optimize data transfer between CPU and GPU.
+
+== Non Compliant Code Example
+
+[source,python]
+----
+train_loader = torch.utils.data.DataLoader(
+    dataset,
+    batch_size=64,
+    shuffle=True,
+    pin_memory=False  # Not using pinned memory
+)
+----
+
+In this example, the DataLoader does not use pinned memory, which leads to slower host-to-device data transfers.
+
+== Compliant Solution
+
+[source,python]
+----
+train_loader = torch.utils.data.DataLoader(
+    dataset,
+    batch_size=64,
+    shuffle=True,
+    pin_memory=True  # Enables faster transfer to GPU
+)
+----
+
+When `pin_memory=True`, PyTorch allocates page-locked memory on the host side, allowing for faster data transfer to the GPU via DMA (Direct Memory Access).
+
+== Relevance Analysis
+
+Experiments were conducted to evaluate the performance and environmental impact of using pinned memory in DataLoaders.
+
+=== Configuration
+
+* Processor: Intel(R) Xeon(R) CPU 3.80GHz
+* RAM: 64GB
+* GPU: NVIDIA Quadro RTX 6000
+* CO₂ Emissions Measurement: https://mlco2.github.io/codecarbon/[CodeCarbon]  
+* Framework: PyTorch 
+
+=== Context
+
+Two training configurations were compared:
+- One using standard memory allocation (`pin_memory=False`)
+- One using pinned memory (`pin_memory=True`)
+
+Metrics assessed:
+- Average batch processing time
+- Total training time
+- CO₂ emissions
+
+=== Impact Analysis
+
+image:image.png[]
+
+image::results.png[]
+
+- **Batch Processing Time:** Reduced from 0.0472s to 0.0378s (~20% improvement).
+- **Training Time:** Decreased by 9.82% when using pinned memory.
+- **Carbon Emissions:** Lowered by 7.56%, indicating a measurable environmental benefit.
+
+The improvements observed are particularly significant in large-scale or long-running training tasks, where data transfer becomes a bottleneck.
+
+== Conclusion
+
+Enabling pinned memory in PyTorch DataLoaders:
+- Reduces batch processing time significantly
+- Slightly shortens total training duration
+- Contributes to lowering CO₂ emissions
+- Is a recommended best practice for GPU-accelerated training
+
+== References
+Credit : https://github.com/AghilesAzzoug/GreenPyData
+
+- https://pytorch.org/docs/stable/data.html  
+- NVIDIA CUDA Documentation on Pinned Memory: https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#page-locked-host-memory
diff --git a/src/main/rules/GCI102/python/image.png b/src/main/rules/GCI102/python/image.png
diff --git a/src/main/rules/GCI102/python/results.png b/src/main/rules/GCI102/python/results.png