Skip to content

Releases: openucx/ucx

v1.9.0-rc3

30 Aug 02:18
cac4887

Choose a tag to compare

v1.9.0-rc3 Pre-release
Pre-release

1.9.0-rc3 (August 29, 2020)

Features:

UCX Core

  • Added a new class of communication APIs '*_nbx' that enable API extendability while
    preserving ABI backward compatibility
  • Added asynchronous event support to UCT/IB/DEVX
  • Added support for latest CUDA library version
  • Added NAK-based reliability protocol for UCT/IB/UD to optimize resends
  • Added new tests for ROCm
  • Added new configuration parameters for protocol selection
  • Added performance optimization for Fujitsu A64FX with InfiniBand
  • Added performance optimization for clear cache code aarch64
  • Added support for relaxed-order PCIe access in IB RDMA transports
  • Added new TCP connection manager
  • Added support for UCT/IB PKey with partial membership in IB transports
  • Added support for RoCE LAG
  • Added flow control for RDMA read operations
  • Improved endpoint flush implementation for UCT/IB
  • Improved UD timer to avoid interrupting the main thread when not in use
  • Improved latency estimation for network path with CUDA
  • Improved error reporting messages
  • Improved performance in active message flow (removed malloc call)
  • Improved performance in ptr_array flow
  • Improved performance in UCT/SM progress engine flow
  • Improved I/O demo code
  • Updated examples code

UCX Java (API Preview)

  • Added support for UCX shared library loading from both classpath and LD_LIBRARY_PATH
  • Added configuration map to ucp_params to be able to set UCX properties programmatically

Bugfixes:

  • Fixes for most resent versions of GCC, CLANG, ARMCLANG, PGI
  • Fixes in UCT/IB for strict order keys
  • Fixes in memory barrier code for aarch64
  • Fixes in UCT/IB/DEVX for fork system call
  • Fixes in UCT/IB for rand() call in rdma-core
  • Fixed in group rescheduling for UCT/IB/DC
  • Fixes in UCT/CUDA bandwidth reporting
  • Fixes in rkey_ptr protocol
  • Fixes in lane selection for rendezvous protocol based on get-zero-copy flow
  • Fixes for ROCm build
  • Fixes for XPMEM transport
  • Fixes in closing endpoint code
  • Fixes in RDMACM code
  • Fixes in memcpy selection for AMD
  • Fixed in UCT/UD endpoint flush functionality
  • Fixes in XPMEM detection
  • Multiple fixes in RPM spec file
  • Multiple fixes in UCP documentation
  • Multiple fixes in socket connection manager
  • Multiple fixes in gtest
  • Multiple fixes in JAVA API implementation

v1.9.0-rc2

27 Aug 23:14
8f9e291

Choose a tag to compare

v1.9.0-rc2 Pre-release
Pre-release

1.9.0-rc2 (August 27, 2020)

Features:

UCX Core

  • Added a new class of communication APIs '*_nbx' that enable API extendability while
    preserving ABI backward compatibility
  • Added asynchronous event support to UCT/IB/DEVX
  • Added support for latest CUDA library version
  • Added NAK-based reliability protocol for UCT/IB/UD to optimize resends
  • Added new tests for ROCm
  • Added new configuration parameters for protocol selection
  • Added performance optimization for Fujitsu A64FX with InfiniBand
  • Added performance optimization for clear cache code aarch64
  • Added support for relaxed-order PCIe access in IB RDMA transports
  • Added new TCP connection manager
  • Added support for UCT/IB PKey with partial membership in IB transports
  • Added support for RoCE LAG
  • Added flow control for RDMA read operations
  • Improved endpoint flush implementation for UCT/IB
  • Improved UD timer to avoid interrupting the main thread when not in use
  • Improved latency estimation for network path with CUDA
  • Improved error reporting messages
  • Improved performance in active message flow (removed malloc call)
  • Improved performance in ptr_array flow
  • Improved performance in UCT/SM progress engine flow
  • Improved I/O demo code
  • Updated examples code

UCX Java (API Preview)

  • Added support for UCX shared library loading from both classpath and LD_LIBRARY_PATH
  • Added configuration map to ucp_params to be able to set UCX properties programmatically

Bugfixes:

  • Fixes for most resent versions of GCC, CLANG, ARMCLANG, PGI
  • Fixes in UCT/IB for strict order keys
  • Fixes in memory barrier code for aarch64
  • Fixes in UCT/IB/DEVX for fork system call
  • Fixes in UCT/IB for rand() call in rdma-core
  • Fixed in group rescheduling for UCT/IB/DC
  • Fixes in UCT/CUDA bandwidth reporting
  • Fixes in rkey_ptr protocol
  • Fixes in lane selection for rendezvous protocol based on get-zero-copy flow
  • Fixes for ROCm build
  • Fixes for XPMEM transport
  • Fixes in closing endpoint code
  • Fixes in RDMACM code
  • Fixes in memcpy selection for AMD
  • Fixed in UCT/UD endpoint flush functionality
  • Fixes in XPMEM detection
  • Multiple fixes in RPM spec file
  • Multiple fixes in UCP documentation
  • Multiple fixes in socket connection manager
  • Multiple fixes in gtest
  • Multiple fixes in JAVA API implementation

v1.9.0-rc1

08 Aug 00:00
7b189c9

Choose a tag to compare

v1.9.0-rc1 Pre-release
Pre-release

1.9.0-rc1 (August 7, 2020)

v1.8.1

11 Jul 05:35
6b29558

Choose a tag to compare

1.8.1 (July 10, 2020)

Features:

  • Added binary release pipeline in Azure CI

Bugfixes:

  • Multiple fixes in testing environment
  • Fixes in InfiniBand DEVX transport
  • Fixes in memory management for CUDA IPC transport
  • Fixes for binutils 2.34+
  • Fixes in RPM SPEC file and package generation
  • Fixes for AMD ROCM build environment

v1.8.1-rc4

07 Jul 23:31
7a25cf2

Choose a tag to compare

v1.8.1-rc4 Pre-release
Pre-release

1.8.1-rc4 (July 7, 2020)

Features:

  • Added binary release pipeline in Azure CI

Bugfixes:

  • Multiple fixes in testing environment
  • Fixes in InfiniBand DEVX transport
  • Fixes in memory management for CUDA IPC transport
  • Fixes for binutils 2.34+
  • Fixes in RPM SPEC file and package generation
  • Fixes for AMD ROCM build environment

v1.8.1-rc2

06 Jul 06:28
6ec60d3

Choose a tag to compare

v1.8.1-rc2 Pre-release
Pre-release

1.8.1-RC2 (July 4, 2020)

Features:

  • Added binary release pipeline in Azure CI

Bugfixes:

  • Multiple fixes in testing environment
  • Fixes in InfiniBand DEVX transport
  • Fixes in memory management for CUDA IPC transport
  • Fixes for binutils 2.34+
  • Fixes in RPM SPEC file and package generation
  • Fixes for AMD ROCM build environment

v1.8.1-rc1

30 Jun 07:50
13ff111

Choose a tag to compare

v1.8.1-rc1 Pre-release
Pre-release

1.8.1-RC1 (June 23, 2020)

Features:

  • Added binary release pipeline in Azure CI

Bugfixes:

  • Multiple fixes in testing environment
  • Fixes in InfiniBand DEVX transport
  • Fixes in memory management for CUDA IPC transport
  • Fixes for binutils 2.34+
  • Fixes in RPM SPEC file and package generation
  • Fixes for AMD ROCM build environment

v1.8.0

04 Apr 00:03
c30b7da

Choose a tag to compare

1.8.0 (April 3, 2020)

Features:

UCX Core

  • Improved detection for DEVX support
  • Improved TCP scalability
  • Added support for ROCM to perftest
  • Added support for different source and target memory types to perftest
  • Added optimized memcpy for ROCM devices
  • Added hardware tag-matching for CUDA buffers
  • Added support for CUDA and ROCM managed memories
  • Added support for client/server disconnect protocol over rdma connection manager
  • Added support for striding receive queue for hardware tag-matching
  • Added XPMEM-based rendezvous protocol for shared memory
  • Added support shared memory communication between containers on same machine
  • Added support for multi-threaded RDMA memory registration for large regions
  • Added new test cases to Azure CI

UCX Java (API Preview)

  • Added APIs for stream send/recv, tag probe, and connect request handle
  • Added Java package (automatically published) to Maven central

Bugfixes:

  • Multiple fixes in JUCX
  • Fixes in UCP thread safety
  • Fixes for most recent versions GCC, PGI, and ICC
  • Fixes for CPU affinity on Azure instances
  • Fixes in XPMEM support on PPC64
  • Performance fixes in CUDA IPC
  • Fixes in RDMA CM flows
  • Multiple fixes in TCP transport
  • Multiple fixes in documentation
  • Fixes in transport lane selection logic
  • Fixes in Java jar build
  • Fixes in socket connection manager for Nvidia DGX-2 platform

v1.8.0-rc2

01 Apr 18:04
465e1fb

Choose a tag to compare

v1.8.0-rc2 Pre-release
Pre-release

1.8.0-rc2 (TBD)

Features:

UCX Core

  • Improved detection for DEVX support
  • Improved TCP scalability
  • Added support for ROCM to perftest
  • Added support for different source and target memory types to perftest
  • Added optimized memcpy for ROCM devices
  • Added hardware tag-matching for CUDA buffers
  • Added support for CUDA and ROCM managed memories
  • Added support for client/server disconnect protocol over rdma connection manager
  • Added support for striding receive queue for hardware tag-matching
  • Added XPMEM-based rendezvous protocol for shared memory
  • Added support shared memory communication between containers on same machine
  • Added support for multi-threaded RDMA memory registration for large regions
  • Added new test cases to Azure CI

UCX Java (API Preview)

  • Added APIs for stream send/recv, tag probe, and connect request handle
  • Added Java package (automatically published) to Maven central

Bugfixes:

  • Multiple fixes in JUCX
  • Fixes in UCP thread safety
  • Fixes for most recent versions GCC, PGI, and ICC
  • Fixes for CPU affinity on Azure instances
  • Fixes in XPMEM support on PPC64
  • Performance fixes in CUDA IPC
  • Fixes in RDMA CM flows
  • Multiple fixes in TCP transport
  • Multiple fixes in documentation
  • Fixes in transport lane selection logic
  • Fixes in Java jar build
  • Fixes in socket connection manager for Nvidia DGX-2 platform

v1.8.0-rc1

15 Mar 09:14
fdb84ae

Choose a tag to compare

v1.8.0-rc1 Pre-release
Pre-release

1.8.0-rc1

Features:

UCX Core

  • Improved detection for DEVX support
  • Improved TCP scalability
  • Added support for ROCM to perftest
  • Added support for different source and target memory types to perftest
  • Added optimized memcpy for ROCM devices
  • Added hardware tag-matching for CUDA buffers
  • Added support for CUDA and ROCM managed memories
  • Added support for client/server disconnect protocol over rdma connection manager
  • Added support for striding receive queue for hardware tag-matching
  • Added XPMEM-based rendezvous protocol for shared memory
  • Added support shared memory communication between containers on same machine
  • Added support for multi-threaded RDMA memory registration for large regions

UCX Java (API Preview)

  • Added APIs for stream send/recv, tag probe, and connect request handle
  • Added Java package (automatically published) to Maven central

Bugfixes:

  • Multiple fixes in JUCX
  • Fixes in UCP thread safety
  • Fixes for most recent versions GCC, PGI, and ICC
  • Fixes for CPU affinity on Azure instances
  • Fixes in XPMEM support on PPC64
  • Performance fixes in CUDA IPC
  • Fixes in RDMA CM flows
  • Multiple fixes in TCP transport
  • Multiple fixes in documentation