You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -915,6 +922,8 @@ From [K8s cluster components](https://kubernetes.io/docs/concepts/architecture/)
915
922
916
923
</details>
917
924
925
+
926
+
918
927
### Energy & Sustainability Milestones (1992-present)
919
928
920
929
> Energy efficiency evolved from component-level to system-level to facility-level approaches:
@@ -927,72 +936,102 @@ From [K8s cluster components](https://kubernetes.io/docs/concepts/architecture/)
927
936
<details>
928
937
<summary><b>ENERGY STAR Program (1992)</b></summary>
929
938
930
-
-**Administrator**: U.S. Environmental Protection Agency
939
+
- Administrator: U.S. EPA (Environmental Protection Agency) with U.S. DOE (Department of Energy) participation.
931
940
-**Technical focus**: Energy consumption standards for computers, monitors
932
941
-**Measurement methodology**: Standardized power consumption testing
933
942
-**Impact**: Created baseline efficiency metrics for IT equipment
943
+
- Scope evolution:
944
+
- Started with monitors/computers; later added servers, storage, networking gear, and data centers.
945
+
- Labeling indicates products meeting energy efficiency criteria under standardized tests.
946
+
- Measurement methodology
947
+
- TEC (Typical Energy Consumption) models: integrates active/idle/sleep/off over a duty cycle to yield kWh/year.
948
+
- Server testing leverages SPEC SERT (Standard Performance Evaluation Corporation—Server Efficiency Rating Tool) to couple performance with power.
949
+
- Power states and sleep: wake latency and user experience constraints included in eligibility criteria.
950
+
- Technical levers: Display power management (DPMS—Display Power Management Signaling), low‑power SoCs (Systems on Chip), aggressive idle states (C‑states) and frequency scaling (DVFS—Dynamic Voltage and Frequency Scaling).
951
+
- Impact: Baselines for procurement; nudged component vendors toward better idle efficiency and automatic sleep policies.
934
952
935
953
</details>
936
954
937
955
938
956
<details>
939
957
<summary><b>80 PLUS (2004)</b></summary>
940
958
941
-
-**Focus**: Power Supply Unit efficiency certification
942
-
-**Technical standards**: Efficiency targets at different load levels (20%, 50%, 100%)
-**Significance**: Reduced energy waste in power conversion
959
+
- Focus: PSU (Power Supply Unit) conversion efficiency and PF (Power Factor) at defined loads.
960
+
- Technical standards:
961
+
- Tests at 115 V/230 V AC; load points typically 20%/50%/100% (Titanium adds 10%).
962
+
- Targets include minimum efficiency and PF ≥ 0.9 at 100% load (varies by tier/voltage).
963
+
- Tiers: Standard → Bronze → Silver → Gold → Platinum → Titanium (tightening efficiency, especially at low load).
964
+
- Design implications:
965
+
- Topologies: active PFC (Power Factor Correction), LLC (Inductor‑Inductor‑Capacitor) resonant converters, synchronous rectification to cut switching/conduction losses.
966
+
- Better efficiency curves reduce waste heat, fan speeds, and upstream cooling demand.
967
+
- Ops note: Right‑sizing PSUs and running near the 30–60% `efficiency knee` avoids low‑load penalties; DC bus (e.g., 48 V) at rack scale reduces conversion stages.
-**Technical focus**: Environmental specifications for datacenters
974
+
- Technical focus: inlet environmental specs for IT gear; psychrometric (temperature/humidity) operating envelopes.
953
975
-**Key innovation**: Standardized temperature and humidity ranges
954
-
-**Classes**: A1-A4 with different allowable ranges
976
+
-**Classes**: Recommended vs Allowable ranges; IT classes A1–A4 define maximum inlet temperatures and humidity/dew‑point limits (A3/A4 allow warmer/drier air).
- Enables economization (using outdoor air) and higher supply‑air setpoints, cutting chiller/CRAC (Computer Room Air Conditioner)/CRAH (Computer Room Air Handler) run time.
979
+
- Emphasizes airflow management (containment, blanking panels, leakage control) to keep inlets within spec.
956
980
957
981
</details>
958
982
959
983
<details>
960
984
<summary><b>PUE Defined by The Green Grid (2007)</b></summary>
961
985
962
986
-**Formula**: Total Facility Energy ÷ IT Equipment Energy
963
-
-**Ideal value**: 1.0 (all energy goes to computing)
964
-
-**Industry evolution**: Average PUE improved from ~2.0 to ~1.2 in hyperscale facilities
965
-
-**Limitations**: Doesn't measure computational efficiency, only facility overhead
966
987
988
+
PUE = Total Facility Energy ÷ IT Equipment Energy; DCiE (Data Center infrastructure Efficiency) = 1 ÷ PUE
989
+
990
+
-**Ideal value**: 1.0 (all energy goes to computing):
991
+
- Clarifies measurement boundaries and intervals; pPUE (partial PUE) for modules/containers.
992
+
- Related metrics: WUE (Water Usage Effectiveness), ERE (Energy Reuse Effectiveness) for heat reuse credit.
993
+
-**Industry evolution**: Hyperscalers drove PUE from ~2.0 toward ~1.1–1.2 via economization, UPS (Uninterruptible Power Supply) optimization, and distribution efficiency.
994
+
-**Limitations**:PUE captures facility overhead, not computational efficiency (e.g., work per joule at the server/application layer).
-**Key contributions**: Simplified chassis, higher efficiency power systems, rack-scale designs
975
-
-**Impact**: Standardized efficient designs across industry
1006
+
-**Impact**: Supply‑chain standardization; higher PSU and VRM (Voltage Regulator Module) efficiency; faster service times and better airflow containment.
1007
+
976
1008
977
1009
</details>
978
1010
979
1011
<details>
980
1012
<summary><b>ASHRAE Widened Temperature Ranges (2015)</b></summary>
981
1013
982
-
-**Technical change**: Expanded recommended and allowable temperature ranges
983
-
-**Impact**: Reduced cooling requirements, enabled more free cooling hours
1014
+
-**Technical change**: Expanded Recommended/Allowable envelopes (A1–A4) enabling higher server inlet temperatures; guidance on humidity/dew point to manage corrosion/ESD (Electrostatic Discharge).
1015
+
-**Impact**: Reduced cooling requirements, enabled more free cooling hours:
1016
+
- More free‑cooling hours; reduced chiller hours; variable‑speed fans and setpoint optimization.
1017
+
- Typical rule‑of‑thumb: a ~1 °C increase in supply air can save a few percent of cooling energy (facility‑dependent).
984
1018
-**Classes**: New A1-A4 classes with wider ranges for different equipment types
985
1019
-**Energy savings**: Up to 4% energy reduction per 1°C increase in setpoint
1020
+
- Risk management: Component derating and reliability modeling vs temperature (e.g., capacitor life, HDD: Hard Disk Drive error rates) to set safe upper limits.
986
1021
987
1022
</details>
988
1023
989
1024
<details>
990
1025
<summary><b>NVMe-oF 1.0 (2016)</b></summary>
991
1026
992
-
-**Technical innovation**: Extended NVMe over network fabrics (RDMA, FC, TCP)
1027
+
-**Technical innovation**: Extends NVMe queue model over networks using transports: RDMA (Remote Direct Memory Access: RoCE/iWARP), FC (Fibre Channel), and TCP (Transmission Control Protocol).
1028
+
- Preserves submission/completion queues, doorbells, and MSI‑X (Message Signaled Interrupts eXtended) semantics for low‑latency I/O (Input/Output).
993
1029
-**Energy efficiency**: Reduced CPU overhead for storage operations
994
-
-**Performance**: Lower latency and higher IOPS per watt
995
-
-**Impact**: Enabled disaggregation of storage resources
1030
+
-**Performance**: Lower latency and higher IOPS per watt.
1031
+
- Lower CPU cycles per I/O and microsecond‑level latencies; higher IOPS (Input/Output Operations per Second) per watt vs legacy stacks.
1032
+
-**Impact**: Enabled disaggregation of storage resources.
1033
+
- JBOF (Just a Bunch Of Flash) and disaggregated storage pools
1034
+
- Flexible scaling and better media utilization across hosts.
996
1035
997
1036
</details>
998
1037
@@ -1004,6 +1043,27 @@ From [K8s cluster components](https://kubernetes.io/docs/concepts/architecture/)
1004
1043
-**Implementation**: APIs for carbon intensity, scheduler plugins, policy engines
1005
1044
-**Impact**: Shifting flexible workloads to times of abundant renewable energy
1006
1045
1046
+
- Signals and metrics:
1047
+
- Carbon intensity (gCO₂/kWh: grams CO₂ per kilowatt-hour): average vs marginal; forecasts (24–72h) and real-time feeds by region/zone.
1048
+
- Combine with PUE (Power Usage Effectiveness), WUE (Water Usage Effectiveness), and energy price to form a multi-objective score.
1049
+
- Workload classes:
1050
+
- Deferrable (batch/ETL/ML training), geo-movable (stateless jobs), and latency-critical (pin, optimize rather than move).
1051
+
- SLOs (Service Level Objectives) define what can be delayed or relocated without breaching latency/availability.
1052
+
- Scheduling patterns:
1053
+
- Time shifting: cron windows aligned to low-carbon forecast bands; backoff and catch-up windows.
1054
+
- Geo shifting: weighted queues or schedulers prefer cleaner regions; fallbacks when capacity or data-residency blocks moves.
1055
+
- Admission control: policy engines gate non-urgent starts when carbon score is high.
1056
+
- Platform integration:
1057
+
- Kubernetes: custom scheduler plugins, HPA (Horizontal Pod Autoscaler) with external carbon metrics, KEDA (Kubernetes-based Event-Driven Autoscaling) pause/resume.
1058
+
- Queues: priority tiers (critical/standard/background) with carbon-aware dispatchers.
1059
+
- CI/CD: carbon windows for non-urgent build/test; artifact promotion decoupled from deploy.
1060
+
- Data sources and governance:
1061
+
- Grid APIs (forecast + realtime), internal telemetry, and cloud region signals; cache and smooth noisy signals.
1062
+
- Budgets: kgCO₂e (kilograms CO₂-equivalent) per team/service; dashboards and alerts; exception process for critical incidents.
1063
+
- Risks and safeguards:
1064
+
- Don’t violate data residency/compliance; validate capacity and failover paths.
1065
+
- Avoid “rebound” effects (extra retries); measure net CO₂e delta versus baseline.
1066
+
1007
1067
</details>
1008
1068
1009
1069
<details>
@@ -1014,6 +1074,26 @@ From [K8s cluster components](https://kubernetes.io/docs/concepts/architecture/)
1014
1074
-**Benefits**: Higher efficiency, enables >100kW per rack densities
1015
1075
-**Adoption**: From niche HPC to mainstream in hyperscale facilities
1016
1076
1077
+
- Terminology:
1078
+
- TDP (Thermal Design Power): sustained heat the cooling solution must remove.
1079
+
- AI (Artificial Intelligence), HPC (High-Performance Computing).
- Supply/return loops (secondary fluid) with 30–45°C supply enable chiller-less or elevated setpoints; plate heat exchangers to primary loop.
1083
+
- Immersion cooling:
1084
+
- Single-phase: dielectric oils; heat removed via circulation through heat exchangers.
1085
+
- Two-phase: fluorinated fluids boil on hotspots; vapor condenses on a coil: very high heat flux handling; manage fluid loss and GWP (Global Warming Potential).
1086
+
- Assisted air/liquid hybrids: RDHx (Rear-Door Heat Exchanger) for retrofit; reduces hot-aisle temps and fan power without full DLC/immersion conversion.
1087
+
- Facility integration and operations
1088
+
- Leak detection, fluid monitoring (particulates, moisture), material compatibility (seals, plastics), and service workflows (lift/tilt fixtures).
1089
+
- Heat reuse: recover low-grade heat for district heating; improves site ERE (Energy Reuse Effectiveness).
1090
+
- Performance and efficiency:
1091
+
- Lower fan power, improved component temps, better turbo headroom; PUE (Power Usage Effectiveness) and WUE (Water Usage Effectiveness) improvements possible.
0 commit comments