|
| 1 | +--- |
| 2 | +title: HLS4ML - AI SoC Design |
| 3 | +description: Explore using HLS4ML, an open-source project developed at organisations including CERN, Fermilab, and MIT, to create hardware for AI, compatible with Arm IP to form an SoC capable of tape-out. |
| 4 | +subjects: |
| 5 | +- ML |
| 6 | +- Performance and Architecture |
| 7 | +requires-team: |
| 8 | +- No |
| 9 | +platform: |
| 10 | +- AI |
| 11 | +sw-hw: |
| 12 | +- Software |
| 13 | +- Hardware |
| 14 | +support-level: |
| 15 | +- Self-Service |
| 16 | +- Arm Ambassador Support |
| 17 | +publication-date: 2026-01-16 |
| 18 | +license: |
| 19 | +status: |
| 20 | +- Hidden |
| 21 | +layout: article |
| 22 | +sidebar: |
| 23 | + nav: projects |
| 24 | +full_description: |- |
| 25 | + <img class="image image--xl" src="/Arm-Developer-Labs/images/Learn_on_Arm_banner.png" loading="lazy" decoding="async" /> |
| 26 | +
|
| 27 | + # SoC Labs Logo |
| 28 | +
|
| 29 | + ## Description |
| 30 | +
|
| 31 | + ### Why is this important? |
| 32 | +
|
| 33 | + Tools such as HLS4ML have proven extremely successful for rapid FPGA-based AI acceleration, particularly in research and edge-AI contexts. However, there is currently a significant gap between FPGA-targeted HLS workflows and reproducible, tape-out-ready ASIC design flows. |
| 34 | +
|
| 35 | + Many AI accelerator projects focus on a single model, dataset, or hardware target. This project instead focuses on defining a repeatable and auditable methodology for translating AI models into hardware IP suitable for integration into an Arm-based SoC and potential fabrication. |
| 36 | +
|
| 37 | + We are looking for a clear, well-documented path from AI model → HLS → RTL → Arm-based SoC → tape-out. This reduces the cost, risk, and expertise barrier for startups, academics, and independent developers to innovate on silicon. |
| 38 | +
|
| 39 | + ### Project Summary |
| 40 | +
|
| 41 | + This project explores the use of HLS4ML (and comparable HLS-based AI tools) to develop a repeatable, tape-out-capable workflow for creating AI accelerator IP that can be integrated with Arm IP into a full SoC design. |
| 42 | +
|
| 43 | + The focus is not on a single accelerator or model, but on defining a methodology that includes: |
| 44 | +
|
| 45 | + AI and software layer |
| 46 | + - Selection of AI models (e.g. CNNs, transformers, or classical ML models). |
| 47 | + - Quantisation, pruning, and model optimisation strategies compatible with hardware implementation. |
| 48 | + - Clear mapping between software frameworks (e.g. TensorFlow / PyTorch) and hardware-friendly representations. |
| 49 | +
|
| 50 | + Hardware generation layer |
| 51 | + - Use of HLS4ML (or alternatives where appropriate) to generate synthesizable RTL. |
| 52 | + - Identification of constraints and modifications required to move from FPGA-oriented HLS outputs to ASIC-suitable RTL. |
| 53 | + - Definition of design rules, interfaces, and verification steps that ensure reproducibility. |
| 54 | +
|
| 55 | + SoC Design |
| 56 | + - Integration of the generated accelerator as an IP block within an Arm-based SoC. |
| 57 | + - Documentation of the additional steps required beyond FPGA validation: synthesis targets, timing closure considerations, verification, and power/area trade-offs. |
| 58 | + - Clear identification of what changes are needed to move from “works on FPGA” to “ready for fabrication”. |
| 59 | +
|
| 60 | + The output of this project is a reference flow, example designs, and documentation that others can follow to reproducibly generate AI accelerator IP suitable for integration into real Arm-based silicon. |
| 61 | +
|
| 62 | + ## What will you use? |
| 63 | + You should be familiar with, or willing to learn about: |
| 64 | + - High-level Synthesis (HLS), including HLS4ML or similar tools |
| 65 | + - Machine learning frameworks (e.g. TensorFlow, PyTorch) |
| 66 | + - AI model optimisation for hardware (quantisation, pruning) |
| 67 | + - RTL and digital design basics |
| 68 | + - Arm-based SoC and accelerator integration concepts |
| 69 | + - Software–hardware co-design |
| 70 | + - Reproducible design workflows |
| 71 | +
|
| 72 | + ## Resources from Arm and our partners |
| 73 | +
|
| 74 | + # TODO |
| 75 | +
|
| 76 | + ## Support Level |
| 77 | +
|
| 78 | + This project is designed to be self-serve but comes with opportunity of some community support from Arm Ambassadors, who are part of the Arm Developer program. If you are not already part of our program, [click here to join](https://www.arm.com/resources/developer-program?#register). |
| 79 | +
|
| 80 | + This project can also be supported by SoC Labs in Southampton. |
| 81 | + # TODO |
| 82 | +
|
| 83 | + ## Benefits |
| 84 | +
|
| 85 | + Standout project contributions will result in digital badges for CV building, recognised by Arm Talent Acquisition. We are currently discussing with national agencies the potential for funding streams for Arm Developer Labs projects, which would flow to you, not us. |
| 86 | +
|
| 87 | +
|
| 88 | + To receive the benefits, you must show us your project through our [online form](https://forms.office.com/e/VZnJQLeRhD). Please do not include any confidential information in your contribution. Additionally if you are affiliated with an academic institution, please ensure you have the right to share your material. |
| 89 | +--- |
| 90 | + |
| 91 | +<img class="image image--xl" src="/Arm-Developer-Labs/images/Learn_on_Arm_banner.png" loading="lazy" decoding="async" /> |
| 92 | + |
| 93 | +# SoC Labs Logo |
| 94 | + |
| 95 | +## Description |
| 96 | + |
| 97 | +### Why is this important? |
| 98 | + |
| 99 | +Tools such as HLS4ML have proven extremely successful for rapid FPGA-based AI acceleration, particularly in research and edge-AI contexts. However, there is currently a significant gap between FPGA-targeted HLS workflows and reproducible, tape-out-ready ASIC design flows. |
| 100 | + |
| 101 | +Many AI accelerator projects focus on a single model, dataset, or hardware target. This project instead focuses on defining a repeatable and auditable methodology for translating AI models into hardware IP suitable for integration into an Arm-based SoC and potential fabrication. |
| 102 | + |
| 103 | +We are looking for a clear, well-documented path from AI model → HLS → RTL → Arm-based SoC → tape-out. This reduces the cost, risk, and expertise barrier for startups, academics, and independent developers to innovate on silicon. |
| 104 | + |
| 105 | +### Project Summary |
| 106 | + |
| 107 | +This project explores the use of HLS4ML (and comparable HLS-based AI tools) to develop a repeatable, tape-out-capable workflow for creating AI accelerator IP that can be integrated with Arm IP into a full SoC design. |
| 108 | + |
| 109 | +The focus is not on a single accelerator or model, but on defining a methodology that includes: |
| 110 | + |
| 111 | +AI and software layer |
| 112 | +- Selection of AI models (e.g. CNNs, transformers, or classical ML models). |
| 113 | +- Quantisation, pruning, and model optimisation strategies compatible with hardware implementation. |
| 114 | +- Clear mapping between software frameworks (e.g. TensorFlow / PyTorch) and hardware-friendly representations. |
| 115 | + |
| 116 | +Hardware generation layer |
| 117 | +- Use of HLS4ML (or alternatives where appropriate) to generate synthesizable RTL. |
| 118 | +- Identification of constraints and modifications required to move from FPGA-oriented HLS outputs to ASIC-suitable RTL. |
| 119 | +- Definition of design rules, interfaces, and verification steps that ensure reproducibility. |
| 120 | + |
| 121 | +SoC Design |
| 122 | +- Integration of the generated accelerator as an IP block within an Arm-based SoC. |
| 123 | +- Documentation of the additional steps required beyond FPGA validation: synthesis targets, timing closure considerations, verification, and power/area trade-offs. |
| 124 | +- Clear identification of what changes are needed to move from “works on FPGA” to “ready for fabrication”. |
| 125 | + |
| 126 | +The output of this project is a reference flow, example designs, and documentation that others can follow to reproducibly generate AI accelerator IP suitable for integration into real Arm-based silicon. |
| 127 | + |
| 128 | +## What will you use? |
| 129 | +You should be familiar with, or willing to learn about: |
| 130 | +- High-level Synthesis (HLS), including HLS4ML or similar tools |
| 131 | +- Machine learning frameworks (e.g. TensorFlow, PyTorch) |
| 132 | +- AI model optimisation for hardware (quantisation, pruning) |
| 133 | +- RTL and digital design basics |
| 134 | +- Arm-based SoC and accelerator integration concepts |
| 135 | +- Software–hardware co-design |
| 136 | +- Reproducible design workflows |
| 137 | + |
| 138 | +## Resources from Arm and our partners |
| 139 | + |
| 140 | +# TODO |
| 141 | + |
| 142 | +## Support Level |
| 143 | + |
| 144 | +This project is designed to be self-serve but comes with opportunity of some community support from Arm Ambassadors, who are part of the Arm Developer program. If you are not already part of our program, [click here to join](https://www.arm.com/resources/developer-program?#register). |
| 145 | + |
| 146 | +This project can also be supported by SoC Labs in Southampton. |
| 147 | +# TODO |
| 148 | + |
| 149 | +## Benefits |
| 150 | + |
| 151 | +Standout project contributions will result in digital badges for CV building, recognised by Arm Talent Acquisition. We are currently discussing with national agencies the potential for funding streams for Arm Developer Labs projects, which would flow to you, not us. |
| 152 | + |
| 153 | + |
| 154 | +To receive the benefits, you must show us your project through our [online form](https://forms.office.com/e/VZnJQLeRhD). Please do not include any confidential information in your contribution. Additionally if you are affiliated with an academic institution, please ensure you have the right to share your material. |
0 commit comments