What's Changed
- Use explicit type for dims in ParallelReduce by @PhilipFackler in #292
- Fix backend bits by @PhilipFackler in #293
- Fix type instability causing crash in 2D parallel_for on AMDGPU by @PhilipFackler in #299
- Add to_device and create_stream by @PhilipFackler in #300
- Versions of
array()for allocating uninitialized arrays by @PhilipFackler in #301 - Add Apple GPU CI on ExCL by @williamfgc in #305
- Fix runners in CI by @williamfgc in #307
- Metal backend by @williamfgc in #306
- Use correct arch label in CI by @williamfgc in #308
- Update README by @williamfgc in #310
- Correct dimensions for
JACC.sharedby @PhilipFackler in #309 - Use explicit type for workspace member to avoid type instability by @PhilipFackler in #313
- Add basic macro syntax by @PhilipFackler in #312
- Refactored
ParallelReduceouter constructors into JACC.reducer by @PhilipFackler in #314 - Update AMDGPU perf-test kernel by @luraess in #315
- Custom ranges for parallel_for and parallel_reduce by @PhilipFackler in #316
- Repo GPU CI JuliaORNL to JuliaGPU by @williamfgc in #318
- Fix ReadMe by @williamfgc in #317
- Fix documentation link in badge by @williamfgc in #319
- Update deploydocs org to JuliaGPU by @williamfgc in #321
- Add API documentation by @williamfgc in #320
- Update api_usage.md by @PhilipFackler in #322
New Contributors
Full Changelog: v0.6.0...v1.0.0