|
| 1 | +--- |
| 2 | +title: Functional Safety for automotive software development |
| 3 | +weight: 2 |
| 4 | + |
| 5 | +### FIXED, DO NOT MODIFY |
| 6 | +layout: learningpathall |
| 7 | +--- |
| 8 | + |
| 9 | +## Why Functional Safety Matters in Automotive Software |
| 10 | + |
| 11 | +[Functional Safety](https://en.wikipedia.org/wiki/Functional_safety) refers to a system's ability to detect potential faults and respond appropriately to ensure that the system remains in a safe state, preventing harm to individuals or damage to equipment. |
| 12 | + |
| 13 | +This is particularly important in **automotive, autonomous driving, medical devices, industrial control, robotics and aerospace** applications, where system failures can lead to severe consequences. |
| 14 | + |
| 15 | +In software development, Functional Safety focuses on minimizing risks through **software design, testing, and validation** to ensure that critical systems operate in a predictable, reliable, and verifiable manner. This means developers must consider: |
| 16 | +- **Error detection mechanisms** |
| 17 | +- **Exception handling** |
| 18 | +- **Redundancy design** |
| 19 | +- **Development processes compliant with safety standards** |
| 20 | + |
| 21 | +### Definition and Importance of Functional Safety |
| 22 | + |
| 23 | +The core of Functional Safety lies in **risk management**, which aims to reduce the impact of system failures. |
| 24 | + |
| 25 | +In autonomous vehicles, Functional Safety ensures that if sensor data is incorrect, the system can enter a **safe state**, preventing incorrect driving decisions. |
| 26 | + |
| 27 | +The three core objectives of Functional Safety are: |
| 28 | +1. **Prevention** |
| 29 | + - Reducing the likelihood of errors through rigorous software development processes and testing. In the electric vehicle, the battery systems monitor temperature to prevent overheating. |
| 30 | +2. **Detection** |
| 31 | + - Quickly identifying errors using built-in diagnostic mechanisms (e.g., Built-in Self-Test, BIST). |
| 32 | +3. **Mitigation** |
| 33 | + - Controlling the impact of failures to ensure the overall safety of the system. |
| 34 | + |
| 35 | +This approach is critical in applications such as **autonomous driving, flight control, and medical implants**, where failures can result in **severe consequences**. |
| 36 | + |
| 37 | +### ISO 26262: Automotive Functional Safety Standard |
| 38 | + |
| 39 | +[ISO 26262](https://www.iso.org/standard/68383.html) is a functional safety standard specifically for **automotive electronics and software systems**. It defines a comprehensive [V-model](https://en.wikipedia.org/wiki/V-model) aligned safety lifecycle, covering all phases from **requirement analysis, design, development, testing, to maintenance**. |
| 40 | + |
| 41 | +Key Concepts of ISO 26262: |
| 42 | +- **ASIL (Automotive Safety Integrity Level)** |
| 43 | + - Evaluates the risk level of different system components (A, B, C, D, where **D represents the highest safety requirement**). |
| 44 | + - For example: ASIL A can be Dashboard light failure (low risk) and ASIL D is Brake system failure (high risk). |
| 45 | + https://en.wikipedia.org/wiki/Automotive_Safety_Integrity_Level |
| 46 | +- **HARA (Hazard Analysis and Risk Assessment)** |
| 47 | + - Analyzes hazards and assesses risks to determine necessary safety measures. |
| 48 | +- **Safety Mechanisms** |
| 49 | + - Includes real-time error detection, system-level fault tolerance, and defined fail-safe or fail-operational fallback states. |
| 50 | + |
| 51 | +Typical Application Scenarios: |
| 52 | +- **Autonomous Driving Systems**: |
| 53 | + - Ensures that even if sensors (e.g., LiDAR, radar, cameras) provide faulty data, the vehicle will not make dangerous decisions. |
| 54 | +- **Powertrain Control**: |
| 55 | + - Prevents braking system failures that could lead to loss of control. |
| 56 | +- **Battery Management System (BMS)**: |
| 57 | + - Prevents battery overheating or excessive discharge in electric vehicles. |
| 58 | + |
| 59 | +For more details, you can check this video: [What is Functional Safety?](https://www.youtube.com/watch?v=R0CPzfYHdpQ) |
| 60 | + |
| 61 | + |
| 62 | +### Common Use Cases of Functional Safety in Automotive |
| 63 | +- **Autonomous Driving**: |
| 64 | + - Ensures the vehicle can operate safely or enter a fail-safe state when sensors like LiDAR, radar, or cameras malfunction. |
| 65 | + - Functional Safety enables real-time fault detection and fallback logic to prevent unsafe driving decisions. |
| 66 | + |
| 67 | +- **Powertrain Control**: |
| 68 | + - Monitors throttle and brake signals to prevent unintended acceleration or braking loss. |
| 69 | + - Includes redundancy, plausibility checks, and emergency overrides to maintain control under failure conditions. |
| 70 | + |
| 71 | +- **Battery Management Systems (BMS)**: |
| 72 | + - Protects EV batteries from overheating, overcharging, or deep discharge. |
| 73 | + - Safety functions include temperature monitoring, voltage balancing, and relay cut-off mechanisms to prevent thermal runaway. |
| 74 | + |
| 75 | +These use cases highlight the need for a dedicated architectural layer that can enforce Functional Safety principles with real-time guarantees. |
| 76 | +A widely adopted approach in modern automotive platforms is the Safety Island—an isolated compute domain designed to execute critical control logic independently of the main system. |
| 77 | + |
| 78 | +### Safety Island: Enabling Functional Safety in Autonomous Systems |
| 79 | + |
| 80 | +In automotive systems, a **General ECU (Electronic Control Unit)** typically runs non-critical tasks such as infotainment or navigation, whereas a **Safety Island** is dedicated to executing safety-critical control logic (e.g., braking, steering) with strong isolation, redundancy, and determinism. |
| 81 | + |
| 82 | +The table below compares the characteristics of a General ECU and a Safety Island in terms of their role in supporting Functional Safety. |
| 83 | + |
| 84 | +| Feature | General ECU | Safety Island | |
| 85 | +|------------------------|----------------------------|--------------------------------------| |
| 86 | +| Purpose | Comfort / non-safety logic | Safety-critical decision making | |
| 87 | +| OS/Runtime | Linux, Android | RTOS, Hypervisor, or bare-metal | |
| 88 | +| Isolation | Soft partitioning | Hard isolation (hardware-enforced) | |
| 89 | +| Functional Safety Req | None to moderate | ISO 26262 ASIL-B to ASIL-D compliant | |
| 90 | +| Fault Handling | Best-effort recovery | Deterministic safe-state response | |
| 91 | + |
| 92 | +This contrast highlights why safety-focused software needs a dedicated hardware domain with certified execution behavior. |
| 93 | + |
| 94 | +**Safety Island** is an independent safety subsystem separate from the main processor. It is responsible for monitoring and managing system safety. If the main processor fails or becomes inoperable, Safety Island can take over critical safety functions such as **deceleration, stopping, and fault handling** to prevent catastrophic system failures. |
| 95 | + |
| 96 | +Key Capabilities of Safety Island |
| 97 | +- **System Health Monitoring** |
| 98 | + - Continuously monitors the operational status of the main processor (e.g., ADAS control unit, ECU) and detects potential errors or anomalies. |
| 99 | +- **Fault Detection and Isolation** |
| 100 | + - Independently evaluates and initiates emergency handling if the main processing unit encounters errors, overheating, computational failures, or unresponsiveness. |
| 101 | +- **Providing Essential Safety Functions** |
| 102 | + - Even if the main system crashes, Safety Island can still execute minimal safety operations, such as: |
| 103 | + - Autonomous Vehicles → Safe stopping (Fail-Safe Mode) |
| 104 | + - Industrial Equipment → Emergency power cutoff or speed reduction |
| 105 | + |
| 106 | + |
| 107 | +### Why Safety Island Matters for Functional Safety |
| 108 | + |
| 109 | +Safety Island plays a critical role in Functional Safety by ensuring that the system can handle high-risk scenarios and minimize catastrophic failures. |
| 110 | + |
| 111 | +How Safety Island Enhances Functional Safety |
| 112 | +1. **Acts as an Independent Redundant Safety Layer** |
| 113 | + - Even if the main system fails, it can still operate independently. |
| 114 | +2. **Supports ASIL-D Safety Level** |
| 115 | + - Monitors ECU health status and executes emergency safety strategies (e.g., emergency braking). |
| 116 | +3. **Provides Independent Fault Detection and Recovery Mechanisms** |
| 117 | + - **Fail-Safe**: Activates a **safe mode**, such as limiting vehicle speed or switching to manual control. |
| 118 | + - **Fail-Operational**: Ensures that high-safety applications (e.g., aerospace systems) can continue operating under certain conditions. |
| 119 | + |
| 120 | +For more insights on **Arm's Functional Safety solutions**, you can refer to: [Arm Functional Safety Compute Blog](https://community.arm.com/arm-community-blogs/b/automotive-blog/posts/functional-safety-compute) |
| 121 | + |
| 122 | + |
| 123 | +### Functional Safety in the Software Development Lifecycle |
| 124 | + |
| 125 | +Functional Safety impacts **both hardware and software development**, particularly in areas such as requirement changes, version management, and testing validation. |
| 126 | +For example, in ASIL-D level applications, every code modification requires a complete impact analysis and regression testing to ensure that new changes do not introduce additional risks. |
| 127 | + |
| 128 | +### Functional Safety Requirements in Software Development |
| 129 | +These practices ensure the software development process meets industry safety standards and can withstand system-level failures: |
| 130 | +- **Requirement Specification** |
| 131 | + - Clearly defining **safety-critical requirements** and conducting risk assessments. |
| 132 | +- **Safety-Oriented Programming** |
| 133 | + - Following **MISRA C, CERT C/C++ standards** and using static analysis tools to detect errors. |
| 134 | +- **Fault Handling Mechanisms** |
| 135 | + - Implementing **redundancy design and health monitoring** to handle anomalies. |
| 136 | +- **Testing and Verification** |
| 137 | + - Using **Hardware-in-the-Loop (HIL)** testing to ensure software safety in real hardware environments. |
| 138 | +- **Version Management and Change Control** |
| 139 | + - Using **Git, JIRA, Polarion** to track changes for safety audits. |
| 140 | + |
| 141 | +This learning path builds upon the previous containerized [learning path](https://learn.arm.com/learning-paths/automotive/openadkit1_container) guide and introduces Functional Safety design practices from the earliest development stages. |
| 142 | + |
| 143 | +By establishing an ASIL Partitioning software development environment and leveraging [**SOAFEE**](https://www.soafee.io/) technologies, developers can enhance software consistency and maintainability in Functional Safety applications. |
0 commit comments