green-code-initiative
diff --git a/‎CHANGELOG.md‎
Lines changed: 9 additions & 3 deletions b/‎CHANGELOG.md‎
Lines changed: 9 additions & 3 deletions
diff --git a/‎RULES.md‎
Lines changed: 13 additions & 12 deletions b/‎RULES.md‎
Lines changed: 13 additions & 12 deletions
diff --git a/‎src/main/rules/GCI100/GCI100.json‎
Lines changed: 1 addition & 1 deletion b/‎src/main/rules/GCI100/GCI100.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/main/rules/GCI104/GCI104.json‎
Lines changed: 1 addition & 1 deletion b/‎src/main/rules/GCI104/GCI104.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/main/rules/GCI107/GCI107.json‎
Lines changed: 21 additions & 0 deletions b/‎src/main/rules/GCI107/GCI107.json‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎src/main/rules/GCI107/python/GCI107.asciidoc‎
Lines changed: 119 additions & 0 deletions b/‎src/main/rules/GCI107/python/GCI107.asciidoc‎
Lines changed: 119 additions & 0 deletions
diff --git a/‎src/main/rules/GCI107/python/dot.png‎
31.5 KB b/‎src/main/rules/GCI107/python/dot.png‎
31.5 KB
diff --git a/‎src/main/rules/GCI107/python/matrix.png‎
31 KB b/‎src/main/rules/GCI107/python/matrix.png‎
31 KB
diff --git a/‎src/main/rules/GCI107/python/outer.png‎
26.7 KB b/‎src/main/rules/GCI107/python/outer.png‎
26.7 KB
diff --git a/‎src/main/rules/GCI96/python/GCI96.asciidoc‎
Lines changed: 2 additions & 1 deletion b/‎src/main/rules/GCI96/python/GCI96.asciidoc‎
Lines changed: 2 additions & 1 deletion
@@ -10,16 +10,21 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 ### Added
 
 - [#381](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/381) Add rule GCI 108 Prefer Append Left
-- [#400](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/400) Add rule GCI535 - Prefer usage of Intl.NumberFormat
+- [#380](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/380) Added rule GCI107 : DATA - Avoid Iterative Matrix Operations
 
 ### Changed
 
 ### Deleted
 
+## [2.4.1] - 2025-07-24
+
+### Changed
+
+- [#417](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/417) Fixed a typo in the tags GCI 100 and 104.
+
 ## [2.4.0] - 2025-07-20
 
 ### Added
-
 - [#390](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/390) Added rule GCI106 : Detect scalar sqrt usage in loops and suggest vectorized alternatives
 - [#389](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/389) Add rule GCI105, Add a rule on Python String Concatenation
 - [#388](https://github.com/green-code-initiative/creedengo-rules-specifications/pull/388) Added rule GCI104 on Torch Tensor types
@@ -444,7 +449,8 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ## Comparison List
 
-[unreleased](https://github.com/green-code-initiative/creedengo-rules-specifications/compare/2.4.0...HEAD)
+[unreleased](https://github.com/green-code-initiative/creedengo-rules-specifications/compare/2.4.1...HEAD)
+[2.4.1](https://github.com/green-code-initiative/creedengo-rules-specifications/compare/2.4.0...2.4.1)
 [2.4.0](https://github.com/green-code-initiative/creedengo-rules-specifications/compare/2.3.0...2.4.0)
 [2.3.0](https://github.com/green-code-initiative/creedengo-rules-specifications/compare/2.2.3...2.3.0)
 [2.2.3](https://github.com/green-code-initiative/creedengo-rules-specifications/compare/2.2.2...2.2.3)
 
@@ -12,7 +12,7 @@
     "performance",
     "memory",
     "ai",
-    "PyTorch"
+    "pytorch"
   ],
   "defaultSeverity": "Minor"
 }
@@ -11,7 +11,7 @@
     "eco-design",
     "performance",
     "ai",
-    "PyTorch"
+    "pytorch"
   ],
   "defaultSeverity": "Minor"
 }
@@ -0,0 +1,21 @@
+{
+    "title": "DATA : Avoid Iterative Matrix Operations",
+    "type": "CODE_SMELL",
+    "status": "ready",
+    "remediation": {
+      "func": "Constant\/Issue",
+      "constantCost": "10min"
+    },
+    "tags": [
+      "creedengo",
+      "eco-design",
+      "performance",
+      "data",
+      "ai",
+      "vector",
+      "pandas",
+      "numpy"
+    ],
+    "defaultSeverity": "Minor"
+  }
+  
@@ -0,0 +1,119 @@
+Before going into more detail, it's important to understand how vectorization works in Python. When performing a calculation on an array/matrix, there are several feasible methods:
+
+The first is to go through the list and perform the calculation element by element, known as an iterative approach.
+The second method consists of applying the calculation to the entire array/matrix at once, which is known as vectorization.
+
+Although it's not feasible to do this in all cases without applying real parallelism using a GPU, for example, we speak of vectorization when we use the built-in functions of TensorFlow, NumPy or Pandas.
+
+We'll also have an iterative loop, but it will be executed in lower-level code (C). As with the use of built-in functions in general, since low-level languages like C are optimized, execution will be much faster and therefore emit less CO2.
+
+== Non compliant Code Example
+
+[source,python]
+----
+results = [[0 for _ in range(cols_B)] for _ in range(rows_A)]
+
+
+for i in range(len(A)):
+    for j in range(len(B[0])):
+        for k in range(len(B)):
+            results[i][j] += A[i][k] * B[k][j]
+----
+
+== Compliant Solution
+
+[source,python]
+----
+results = np.dot(A, B)
+# np stands for NumPy, the Python library used to manipulate data series.
+----
+
+== Relevance Analysis
+
+The following results were obtained through local experiments.
+
+=== Configuration
+
+* Processor: Intel(R) Core(TM) Ultra 5 135U, 2100 MHz, 12 cores, 14 logical processors
+* RAM: 16 GB
+* CO2 Emissions Measurement: Using CodeCarbon
+
+=== Context
+
+This study is divided into 3 parts, comparing a vectorized and an iterative method: 
+measuring the impact on a dot product between two vectors,
+measuring the impact on an outer product between two vectors,
+measuring the impact on a matrix calculation.
+
+=== Impact Analysis
+
+*1. dot product:*
+
+*Non compliant*
+[source,python]
+----
+def iterative_dot_product(x,y):
+    total = 0
+    for i in range(len(x)):
+        total += x[i] * y[i]
+    return total
+----
+*Compliant* 
+[source,python]
+----
+def vectorized_dot_product(x,y):
+    return np.dot(x,y)
+----
+image::dot.png[]
+
+*2. Outer product:*
+
+*Non compliant*
+[source,python]
+----
+def iterative_outer_product(x, y):
+    o = np.zeros((len(x), len(y)))
+    for i in range(len(x)):
+        for j in range(len(y)):
+            o[i][j] = x[i] * y[j]
+    return o
+----
+*Compliant* 
+[source,python]
+----
+def vectorized_outer_product(x, y):
+    return np.outer(x, y)
+----
+image::outer.png[]
+
+*3. Matrix product:*
+
+*Non compliant*
+[source,python]
+----
+def iterative_matrix_product(A, B):
+    for i in range(len(A)):
+        for j in range(len(B[0])):
+            for k in range(len(B)):
+                results[i][j] += A[i][k] * B[k][j]
+    return results
+----
+*Compliant* 
+[source,python]
+----
+def vectorized_outer_product(A, B):
+    return np.dot(A, B)
+----
+image::matrix.png[]
+
+=== Conclusion
+
+The results show that the vectorized method is significantly faster than the iterative method. The CO2 emissions are also lower. This is a clear example of how using built-in functions can lead to more efficient code, both in terms of performance and environmental impact.
+
+=== References
+
+https://sciresol.s3.us-east-2.amazonaws.com/IJST/Articles/2024/Issue-24/IJST-2024-914.pdf
+
+https://arxiv.org/pdf/2308.01269
+
+https://www.db-thueringen.de/servlets/MCRFileNodeServlet/dbt_derivate_00062165/ilm1-2024200012.pdf
@@ -37,12 +37,14 @@ Local experiments were conducted to assess the environmental impact of reading C
 === Context
 
 We generated CSV files with the following row sizes:
+
 * 1,000
 * 10,000
 * 100,000
 * 1,000,000
 
 Each file contains 5 columns (`A`, `B`, `C`, `D`, `E`). We measured the carbon emissions required to read:
+
 * 1 column
 * 2 columns
 * 3 columns
@@ -71,4 +73,3 @@ This is especially critical when working with large datasets or in environments
 == References
 https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html
 https://medium.com/@amit25173/what-is-usecols-in-pandas-7a6a43885f4b
-
Original file line number	Diff line number	Diff line change
`@@ -12,7 +12,7 @@`
`12`	`12`	`"performance",`
`13`	`13`	`"memory",`
`14`	`14`	`"ai",`
`15`		`- "PyTorch"`
	`15`	`+ "pytorch"`
`16`	`16`	`],`
`17`	`17`	`"defaultSeverity": "Minor"`
`18`	`18`	`}`
Original file line number	Diff line number	Diff line change
`@@ -11,7 +11,7 @@`
`11`	`11`	`"eco-design",`
`12`	`12`	`"performance",`
`13`	`13`	`"ai",`
`14`		`- "PyTorch"`
	`14`	`+ "pytorch"`
`15`	`15`	`],`
`16`	`16`	`"defaultSeverity": "Minor"`
`17`	`17`	`}`