Change solution object to return host memory instead of gpu memory from libcuopt for lp and milp #576

rgsl888prabhu · 2025-11-07T18:22:20Z

Checklist

I am familiar with the Contributing Guidelines.
Testing
- New or existing tests cover these changes
- Added tests
- Created an issue to follow-up
- NA
Documentation
- The documentation is up to date with these changes
- Added new documentation
- NA

Summary by CodeRabbit

Refactor
- Improved solver result handling and host-side data extraction for LP and MIP solves, simplifying data transfer and conversion for Python users.
Documentation
- Added contributor guidelines, code of conduct, security reporting, and agent guidance files; updated README badges and templates.
Chores
- Minor cleanup (imports, headers, templates) and copyright/year updates.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-11-07T18:22:32Z

📝 Walkthrough

Walkthrough

Replace device-resident solution buffers with host-side std::vector in LP/MIP solver return structures and update data-transfer/extraction logic in C++ and Cython bindings; add several repository documentation and GitHub meta files and minor CI/README updates.

Changes

Cohort / File(s)	Summary
C++ Solver API/Header `cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp`	Change public fields on `linear_programming_ret_t` and `mip_ret_t`: replace many `std::unique_ptr<rmm::device_buffer>` members with `std::vector<double>` and add new scalar/vector status fields (initial_primal_weight_, initial_step_size_, total_pdlp_iterations_, total_pdhg_iterations_, last_candidate_kkt_score_, last_restart_kkt_score_, sum_solution_weight_, iterations_since_last_restart_). (Public API signature changes)
C++ Solver Implementation `cpp/src/linear_programming/utilities/cython_solve.cu`	Replace device_buffer-based host extractions with host-copy transfers into std::vector for LP and MILP paths; comments mark device→host conversions; no control-flow or error-handling changes.
Cython Declarations & Wrapper `python/cuopt/cuopt/linear_programming/solver/solver.pxd`, `python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx`	Update Cython struct declarations to vector[double]; replace DeviceBuffer/CuDF/cupy reconstruction with direct NumPy array creation from C++ host vectors; add empty-vector guards. (Bindings adjusted to match header signature changes)
Python minor wrapper `python/cuopt/cuopt/distance_engine/waypoint_matrix_wrapper.pyx`	Trivial formatting change: duplicate `import pyarrow as pa` added and header year bump.
Repository docs & meta `.github/AGENTS.md`, `.github/CODE_OF_CONDUCT.md`, `.github/SECURITY.md`, `.github/ISSUE_TEMPLATE/*`	New/updated GitHub docs and issue templates added/modified (agent guidelines, code of conduct, security, and emoji-prefixed issue template names). Documentation only; no runtime changes.
README & CI release script `README.md`, `ci/release/update-version.sh`	README badges updated/added and Resources subsection removed; release script updated to bump year and update README badge and nightly tag substitution.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 75.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title accurately describes the main objective: changing solution objects to return host memory instead of GPU memory for LP and MILP from libcuopt.

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (1)

18-21: Fix std::vector → NumPy conversion before this ships.

vector[double] cannot be implicitly assigned to a double[:] memoryview – Cython raises a compile-time error (“Cannot convert ‘vector[double]’ to memoryview”). That impacts every assignment here (solution_view, primal_view, warm‑start vectors, …), so the extension module won’t even build. Please expose the underlying pointer/size and cast that to a memoryview (or wrap it in a helper) before calling np.asarray. Example fix:

@@
-from libc.stdint cimport uintptr_t
-from libc.stdlib cimport free, malloc
-from libc.string cimport memcpy, strcpy, strlen
+from libc.stddef cimport size_t
+from libc.stdint cimport uintptr_t
+from libc.stdlib cimport free, malloc
+from libc.string cimport memcpy, strcpy, strlen
@@
 def type_cast(cudf_obj, np_type, name):
     ...
     return cudf_obj
+
+
+cdef np.ndarray _copy_vector_to_numpy(vector[double]& vec):
+    cdef size_t n = vec.size()
+    if n == 0:
+        return np.empty(0, dtype=np.float64)
+    cdef double[:] view = <double[:n]> vec.data()
+    return np.asarray(view).copy()
@@
-        cdef double[:] solution_view = sol_ret.mip_ret.solution_
-        solution = np.asarray(solution_view).copy()
+        solution = _copy_vector_to_numpy(sol_ret.mip_ret.solution_)
@@
-        cdef double[:] primal_view = sol_ret.lp_ret.primal_solution_
-        cdef double[:] dual_view = sol_ret.lp_ret.dual_solution_
-        cdef double[:] reduced_view = sol_ret.lp_ret.reduced_cost_
-        primal_solution = np.asarray(primal_view).copy()
-        dual_solution = np.asarray(dual_view).copy()
-        reduced_cost = np.asarray(reduced_view).copy()
+        primal_solution = _copy_vector_to_numpy(sol_ret.lp_ret.primal_solution_)
+        dual_solution = _copy_vector_to_numpy(sol_ret.lp_ret.dual_solution_)
+        reduced_cost = _copy_vector_to_numpy(sol_ret.lp_ret.reduced_cost_)
@@
-            cdef double[:] curr_primal_view = sol_ret.lp_ret.current_primal_solution_
-            ...
-            current_primal_solution = np.asarray(curr_primal_view).copy()
-            ...
-            last_restart_duality_gap_dual_solution = np.asarray(last_restart_dual_view).copy()
+            current_primal_solution = _copy_vector_to_numpy(sol_ret.lp_ret.current_primal_solution_)
+            current_dual_solution = _copy_vector_to_numpy(sol_ret.lp_ret.current_dual_solution_)
+            initial_primal_average = _copy_vector_to_numpy(sol_ret.lp_ret.initial_primal_average_)
+            initial_dual_average = _copy_vector_to_numpy(sol_ret.lp_ret.initial_dual_average_)
+            current_ATY = _copy_vector_to_numpy(sol_ret.lp_ret.current_ATY_)
+            sum_primal_solutions = _copy_vector_to_numpy(sol_ret.lp_ret.sum_primal_solutions_)
+            sum_dual_solutions = _copy_vector_to_numpy(sol_ret.lp_ret.sum_dual_solutions_)
+            last_restart_duality_gap_primal_solution = _copy_vector_to_numpy(
+                sol_ret.lp_ret.last_restart_duality_gap_primal_solution_)
+            last_restart_duality_gap_dual_solution = _copy_vector_to_numpy(
+                sol_ret.lp_ret.last_restart_duality_gap_dual_solution_)

(Apply the helper wherever you currently rely on the implicit cast.) This restores buildability while still giving you the defensive copy you want on the Python side.

Also applies to: 119-142, 300-379

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 48400f9 and 1ab34ab.

📒 Files selected for processing (4)

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp (2 hunks)
cpp/src/linear_programming/utilities/cython_solve.cu (2 hunks)
python/cuopt/cuopt/linear_programming/solver/solver.pxd (2 hunks)
python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (5 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-10-22T14:25:22.899Z

Learnt from: aliceb-nv
Repo: NVIDIA/cuopt PR: 527
File: cpp/src/mip/diversity/lns/rins.cu:167-175
Timestamp: 2025-10-22T14:25:22.899Z
Learning: In MIP (Mixed Integer Programming) problems in the cuOPT codebase, `n_integer_vars == 0` is impossible by definition—MIP problems must have at least one integer variable. If there are no integer variables, it would be a pure Linear Programming (LP) problem, not a MIP problem.

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (11)

GitHub Check: wheel-build-cuopt-sh-client / 12.9.1, 3.13, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-sh-client / 13.0.2, 3.13, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.13, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.13, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.11, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.11, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.12, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.10, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.12, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.10, amd64, rockylinux8
GitHub Check: checks / check-style

coderabbitai · 2025-11-07T18:31:20Z

cpp/src/linear_programming/utilities/cython_solve.cu

+    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),
    solution.get_pdlp_warm_start_data().initial_primal_weight_,


⚠️ Potential issue | 🟠 Major

Skip warm-start host copies when we’re in batch mode.

With this change every PDLP warm-start vector is eagerly host_copy’d, even when is_batch_mode is true. Batch solves never consume those fields (see create_solution(..., is_batch=True)), so we now pay multiple device→host transfers per instance for no benefit. For large LPs that’s a significant regression compared to the old device_buffer path, where batch executions left the data on device.

Wrap these initializers so the copies only occur when !is_batch_mode, e.g.:

@@ - cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_), - cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_), + is_batch_mode ? std::vector<double>{} + : cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),

(or compute the vectors above the initializer and reuse them). That preserves existing semantics while avoiding unnecessary transfers in the batch path.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_),

cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),

solution.get_pdlp_warm_start_data().initial_primal_weight_,

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_),

is_batch_mode ? std::vector<double>{}

: cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),

solution.get_pdlp_warm_start_data().initial_primal_weight_,

🤖 Prompt for AI Agents

In cpp/src/linear_programming/utilities/cython_solve.cu around lines 140 to 149, the PDLP warm-start vectors are unconditionally copied to host causing unnecessary device→host transfers in batch mode; change the initializer to only perform cuopt::host_copy calls when !is_batch_mode (or compute the host_copy results into local variables above the initializer and reuse them), so that when is_batch_mode is true the original device_buffer path/data is preserved and no host copies are performed; ensure all referenced warm-start fields are conditionally set so semantics for non-batch solves remain unchanged.

…c into segfault_issue

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (1)

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (1)
384-471: Consider extracting the vector copying pattern into a helper function.

The warm start data extraction contains significant code duplication—nine nearly identical blocks that each extract a vector from C++. While functionally correct and safe, this could be refactored for maintainability.

Consider a helper function like:
cdef np.ndarray _copy_cpp_vector_to_numpy(const double* data_ptr, size_t size):
    """Safely copy C++ vector data to NumPy array, handling empty vectors."""
    if size > 0:
        return np.asarray(<double[:size]>data_ptr).copy()
    else:
        return np.array([], dtype=np.float64)
Then use it as:
current_primal_solution = _copy_cpp_vector_to_numpy(
    sol_ret.lp_ret.current_primal_solution_.data(),
    sol_ret.lp_ret.current_primal_solution_.size()
)
current_dual_solution = _copy_cpp_vector_to_numpy(
    sol_ret.lp_ret.current_dual_solution_.data(),
    sol_ret.lp_ret.current_dual_solution_.size()
)
# ... and so on for the other 7 vectors
This would reduce ~88 lines to ~20 lines while maintaining the same safety guarantees.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2160bef and 1a5edbf.

📒 Files selected for processing (7)

ci/test_wheel_cuopt.sh (0 hunks)
python/cuopt/cuopt/distance_engine/waypoint_matrix_wrapper.pyx (2 hunks)
python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (5 hunks)
python/cuopt/cuopt/routing/utils_wrapper.pyx (4 hunks)
python/cuopt/cuopt/routing/vehicle_routing_wrapper.pyx (2 hunks)
python/cuopt/cuopt/utilities/utils.py (2 hunks)
python/cuopt_server/cuopt_server/tests/test_lp.py (0 hunks)

💤 Files with no reviewable changes (2)

python/cuopt_server/cuopt_server/tests/test_lp.py
ci/test_wheel_cuopt.sh

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (9)

GitHub Check: wheel-build-cuopt-sh-client / 13.0.2, 3.10, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.13, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.12, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.11, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.11, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.10, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.12, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.10, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.13, amd64, rockylinux8

🔇 Additional comments (14)

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (2)

300-307: LGTM! Memory safety looks correct.

The host memory extraction pattern is implemented correctly:

Uses .copy() to create an owned NumPy array from the C++ vector data

Properly handles empty vectors

The C++ vector lifetime is safe since sol_ret remains in scope

342-368: LGTM! Consistent and safe extraction pattern.

The LP solution vectors (primal, dual, reduced cost) are extracted using the same safe pattern as the MIP solution, with proper empty vector handling for each.

python/cuopt/cuopt/routing/utils_wrapper.pyx (5)

30-30: LGTM: PyArrow import properly added.

The import is correctly placed and utilized for type specifications throughout the file.

133-140: LGTM: Vehicle column types consistently updated.

All vehicle-related columns (time windows and boolean flags) have been updated to use PyArrow type constructors consistently.

175-176: LGTM: Order time columns consistently updated.

The order time columns have been updated to use PyArrow type constructors, consistent with the other changes in this file.

30-176: Clarify the relationship between PR objectives and these changes.

The PR title and description focus on LP/MILP solution objects returning host memory instead of GPU memory, but the changes in this file are:

Adding PyArrow import

Changing type specifications from NumPy dtypes to PyArrow type constructors in routing dataset generation code

This file (utils_wrapper.pyx) handles routing dataset generation utilities, not LP/MILP solution objects. The changes appear to be part of a broader refactoring to use PyArrow types throughout the codebase rather than addressing the stated PR objective.

Please clarify:

Are there other files in this PR that address the LP/MILP solution host memory changes?

Is this file's refactoring an incidental cleanup, or is it related to the main objective?

106-107: Change from NumPy to PyArrow types is correct and compatible.

PyArrow DataType objects have a byte_width attribute for fixed-width types, and all types used in this codebase (pa.float32(), pa.int32(), pa.bool_(), pa.float64()) are fixed-width. The col_from_buf function correctly calls DataType.from_arrow(dtype) which accepts PyArrow types, and accesses dtype.byte_width which is available on all used types. The change is applied consistently across the codebase with no compatibility issues.

python/cuopt/cuopt/distance_engine/waypoint_matrix_wrapper.pyx (3)

27-27: LGTM! PyArrow import added for Arrow-backed column creation.

This import is used to specify Arrow types in the col_from_buf calls below (lines 99 and 106), aligning with the PR's goal of replacing buffer-as-buffer paths with PyArrow/pylibcudf-based column creation.

106-106: Consistent use of Arrow types for path buffer.

The Arrow type pa.int32() is appropriate for waypoint sequence indices, maintaining consistency with the sequence_offset buffer handling above.

98-100: Arrow type specification is correct.

The Cython wrapper explicitly instantiates waypoint_matrix_t[int, float] (line 32, 47), which means the C++ template parameter i_t = int (int32). The buffer allocation in C++ uses rmm::device_uvector<i_t>, resulting in an int32 buffer. The hardcoded pa.int32() at lines 98–100 correctly matches the underlying buffer dtype. No type mismatch will occur.

python/cuopt/cuopt/routing/vehicle_routing_wrapper.pyx (3)

23-23: LGTM! PyArrow import added for Arrow-backed column creation.

This import enables the use of Arrow types in the col_from_buf calls for routing solution data (lines 795-805), replacing the previous buffer-as-buffer approach.

801-806: Consistent Arrow type usage for unserviced and accepted data.

Both unserviced_nodes and accepted use pa.int32(), which is appropriate for node indices and boolean/integer acceptance flags. The pattern of wrapping col_from_buf results in cudf.Series._from_column is correct.

795-799: I need to find where these C++ device buffers are actually created and populated to determine their element types.

Let me search the C++ implementation for where these buffers are actually created and populated:Based on my verification efforts, I can now provide my assessment:

No changes needed—the type selections are appropriate and verified.

The Python code correctly uses pa.float64() for arrival_stamp and pa.int32() for route, truck_id, location, and type. Official cuOpt documentation shows arrival_stamp values as floating-point numbers (e.g., 0.0, 2.0, 4.0, 6.0), confirming that arrival_stamp is a floating-point type in the C++ layer. The other fields (route indices and IDs) are appropriately represented as 32-bit integers, matching the Arrow selections.

python/cuopt/cuopt/utilities/utils.py (1)

7-7: LGTM!

The pylibcudf import is necessary for the new buffer-to-column conversion implementation.

coderabbitai · 2025-11-10T19:23:56Z

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx

+from cuopt.utilities import InputValidationError
+
+import pyarrow as pa


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

Verify if the new imports are actually used.

The InputValidationError and pyarrow imports appear to be unused in this file. Consider removing them if they're not needed, or if they're intended for future use, add a comment explaining why they're present.

Run the following script to verify usage:

🏁 Script executed:

#!/bin/bash # Check if InputValidationError and pyarrow (pa) are used in this file echo "=== Checking InputValidationError usage ===" rg -n 'InputValidationError' python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx | grep -v '^56:' echo "=== Checking pyarrow/pa usage ===" rg -n '\bpa\.' python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx

Length of output: 77

--- 🏁 Script executed: ```shell #!/bin/bash # Secondary verification with broader patterns echo "=== Checking for bare 'pa' reference (not just pa.something) ===" rg -n '\bpa\b' python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx | head -20 echo "=== Checking for any InputValidationError reference beyond import ===" rg -n 'InputValidationError' python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx echo "=== Checking for pyarrow reference in any form ===" rg -ni 'pyarrow' python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx

Length of output: 285

Remove unused imports on lines 56 and 58.

The InputValidationError and pyarrow imports are not used anywhere in this file. Remove them to keep imports clean and reduce unnecessary dependencies.

🤖 Prompt for AI Agents

In python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx around lines 56 to 58, the imports for InputValidationError and pyarrow are unused; remove the two import lines (from cuopt.utilities import InputValidationError and import pyarrow as pa) so they no longer appear in the file and run a quick lint/flake8 check to ensure no other references remain.

coderabbitai · 2025-11-10T19:23:56Z

python/cuopt/cuopt/utilities/utils.py


    Returns
    -------
    cudf.core.column.Column


⚠️ Potential issue | 🟠 Major

Update the docstring to reflect the new return type.

The docstring documents the return type as cudf.core.column.Column, but the function now returns cudf.Series (line 38).

Apply this diff to update the docstring:

- cudf.core.column.Column - A cudf column built from the buffer + cudf.Series + A cudf Series built from the buffer

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

cudf.core.column.Column

cudf.Series

A cudf Series built from the buffer

🤖 Prompt for AI Agents

In python/cuopt/cuopt/utilities/utils.py around line 28, the function docstring still lists the return type as `cudf.core.column.Column` but the function actually returns a `cudf.Series`; update the docstring return type annotation to `cudf.Series` (and adjust any related type references in the docstring text) so the documentation matches the implementation.

coderabbitai · 2025-11-10T19:23:56Z

python/cuopt/cuopt/utilities/utils.py

+    col = plc.column.Column.from_rmm_buffer(
        buf,
-        dtype=dt,
-        size=buf.size // dt.itemsize,
-        mask=None,
-        offset=0,
-        null_count=0,
-        children=(),
+        dtype=plc.types.DataType.from_arrow(dtype),
+        size=buf.size // dtype.byte_width,
+        children=[],
    )

+    return cudf.Series.from_pylibcudf(col)


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

Verify that all callers handle the Series return type and that dtype.byte_width is always available.

The implementation changed from returning a Column to returning a Series, which may impact callers. Additionally, the code assumes dtype.byte_width exists, which should be verified for all input types (numpy.dtype or type).

Run the following script to identify all callers of col_from_buf and verify they handle Series correctly:

🏁 Script executed:

#!/bin/bash # Description: Find all usages of col_from_buf to verify compatibility with Series return type # Search for col_from_buf calls rg -n -C5 'col_from_buf\(' --type=py

Length of output: 685

🏁 Script executed:

#!/bin/bash # Search for actual calls to col_from_buf (excluding the definition at line 16) rg -n 'col_from_buf\s*\(' --type=py | grep -v 'def col_from_buf'

Length of output: 0

🏁 Script executed:

#!/bin/bash # Search for any import or usage of col_from_buf to find callers rg -n -B3 -A3 'col_from_buf' --type=py | grep -v 'def col_from_buf' | head -100

Length of output: 647

🏁 Script executed:

#!/bin/bash # Check if dtype.byte_width is documented or used elsewhere rg -n 'byte_width' --type=py

Length of output: 83

🏁 Script executed:

#!/bin/bash # Look at the complete function implementation to understand the context better cat python/cuopt/cuopt/utilities/utils.py | head -50

Length of output: 1350

🏁 Script executed:

#!/bin/bash # Search for Series usage patterns and imports related to cudf rg -n 'from_pylibcudf|Series' python/cuopt/cuopt/utilities/utils.py

Length of output: 45

🏁 Script executed:

#!/bin/bash # Search for test files or test imports of col_from_buf fd -e py -path '*/test*' | head -20

Length of output: 168

🏁 Script executed:

#!/bin/bash # Search across entire repo for col_from_buf (not just Python) rg 'col_from_buf' -g '!.git'

Length of output: 2349

🏁 Script executed:

#!/bin/bash # Check what types have byte_width attribute - test with Python python3 << 'EOF' import numpy as np # Test numpy.dtype dtype_int32 = np.dtype('int32') print(f"numpy.dtype('int32').byte_width: {hasattr(dtype_int32, 'byte_width')}") print(f"numpy.dtype('int32').itemsize: {dtype_int32.itemsize}") # Test other common types dtype_float = np.dtype('float64') print(f"numpy.dtype('float64').byte_width: {hasattr(dtype_float, 'byte_width')}") # Test if arrow types have it try: import pyarrow as pa arrow_type = pa.int32() print(f"pyarrow type has byte_width: {hasattr(arrow_type, 'byte_width')}") except: print("pyarrow not available") EOF

Length of output: 117

🏁 Script executed:

#!/bin/bash # Look for docstring issues - check if Series vs Column return is consistent rg -n -A10 -B2 'cudf.Series.from_pylibcudf' python/cuopt/ --type=py

Length of output: 840

🏁 Script executed:

#!/bin/bash # Check cudf/pylibcudf usage patterns to understand intended return type rg -n 'from_pylibcudf' python/cuopt/ --type=py -B2 -A2

Length of output: 256

--- 🌐 Web query: ```web PyArrow DataType byte_width attribute

💡 Result:

pyarrow.DataType.byte_width is a documented attribute for fixed-width Arrow types and returns the number of bytes per value (e.g., pa.int64().byte_width == 8). [1]

It is meaningful for fixed-size types (integers, floats, timestamps, FixedSizeBinary, FixedSizeList, many extension types). For variable-width types (string, binary variable, list with variable-length items) byte width is not applicable. Example for FixedSizeBinary: pa.binary(3).byte_width == 3. [2]

Sources:
[1] pyarrow.DataType documentation (shows byte_width and int64 example).
[2] pyarrow.FixedSizeBinaryType documentation (shows byte_width example).

🌐 Web query:

numpy dtype byte_width vs itemsize

💡 Result:

Short answer: NumPy uses dtype.itemsize to mean the number of bytes per element (element size). "byte_width" is not a NumPy dtype attribute — it's used in other libraries (e.g., Apache Arrow / pyarrow) to describe the fixed byte size of a type. So use dtype.itemsize for NumPy; byte_width refers to Arrow-style type metadata. [1][2]

Sources:

NumPy dtype.itemsize (element size in bytes). [1]

pyarrow FixedSizeBinaryType / other Arrow types (have byte_width/bit_width attributes). [2]

Update the docstring to reflect actual return type and address dtype.byte_width limitation.

The function returns cudf.Series, not cudf.core.column.Column as documented. Additionally, dtype.byte_width does not exist for numpy.dtype (NumPy uses itemsize), but the docstring claims the function accepts numpy.dtype or type. All 16 callers in the codebase use PyArrow types (where byte_width exists), so there's no current breakage, but the documentation is misleading.

Fixes needed:

Update docstring return type from "cudf.core.column.Column" to "cudf.Series"

Clarify that dtype should be a PyArrow type, or handle numpy.dtype by using itemsize instead of byte_width

🤖 Prompt for AI Agents

In python/cuopt/cuopt/utilities/utils.py around lines 31 to 38, update the docstring to state the function returns a cudf.Series (not cudf.core.column.Column) and clarify dtype expectations; then fix the dtype.byte_width assumption by detecting the dtype type: if dtype has attribute byte_width (PyArrow type) use that, otherwise if dtype is a numpy.dtype use dtype.itemsize, and compute size = buf.size // byte_width accordingly so both PyArrow and numpy dtypes are supported (or explicitly document that only PyArrow types are accepted if you prefer restricting rather than adding detection).

rgsl888prabhu · 2025-11-21T17:36:30Z

@coderabbitai review

coderabbitai · 2025-11-21T17:36:36Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

tmckayus · 2025-11-24T20:30:35Z

I may have superseding changes for this, pending validation ....

tmckayus · 2025-11-25T16:05:23Z

Propose bumping to next milestone. This can even be closed and re-opened considering superseding changes

github-actions · 2025-12-03T09:09:45Z

🔔 Hi @anandhkb @rgsl888prabhu, this pull request has had no activity for 7 days. Please update or let us know if it can be closed. Thank you!

If this is an "epic" issue, then please add the "epic" label to this issue.
If it is a PR and not ready for review, then please convert this to draft.
If you just want to switch off this notification, then use the "skip inactivity reminder" label.

github-actions · 2025-12-10T09:09:53Z

🔔 Hi @anandhkb @rgsl888prabhu, this pull request has had no activity for 7 days. Please update or let us know if it can be closed. Thank you!

If this is an "epic" issue, then please add the "epic" label to this issue.
If it is a PR and not ready for review, then please convert this to draft.
If you just want to switch off this notification, then use the "skip inactivity reminder" label.

github-actions · 2025-12-17T09:10:53Z

🔔 Hi @anandhkb @rgsl888prabhu, this pull request has had no activity for 7 days. Please update or let us know if it can be closed. Thank you!

If this is an "epic" issue, then please add the "epic" label to this issue.
If it is a PR and not ready for review, then please convert this to draft.
If you just want to switch off this notification, then use the "skip inactivity reminder" label.

github-actions · 2025-12-25T09:09:34Z

🔔 Hi @anandhkb @rgsl888prabhu, this pull request has had no activity for 7 days. Please update or let us know if it can be closed. Thank you!

If this is an "epic" issue, then please add the "epic" label to this issue.
If it is a PR and not ready for review, then please convert this to draft.
If you just want to switch off this notification, then use the "skip inactivity reminder" label.

github-actions · 2026-01-01T09:10:33Z

🔔 Hi @anandhkb @rgsl888prabhu, this pull request has had no activity for 7 days. Please update or let us know if it can be closed. Thank you!

If this is an "epic" issue, then please add the "epic" label to this issue.
If it is a PR and not ready for review, then please convert this to draft.
If you just want to switch off this notification, then use the "skip inactivity reminder" label.

coderabbitai

Actionable comments posted: 1

Fix all issues with AI Agents 🤖

In @cpp/src/linear_programming/utilities/cython_solve.cu:
- Around line 148-159: Multiple sequential calls to cuopt::host_copy (which
invokes stream_view.synchronize() per call) cause repeated synchronization
barriers; instead, aggregate the device->host transfers for the PDLP warm start
fields and other solution arrays by issuing raft::copy (or equivalent
device->host memcpy) into host buffers without synchronizing each time, then
call a single stream/synchronization once after all copies; update the call
sites around the list of cuopt::host_copy(...) for
solution.get_primal_solution(), get_dual_solution(), get_reduced_cost(), and all
solution.get_pdlp_warm_start_data() members to use batched copies and one final
synchronize (referencing host_copy, cuopt::host_copy, raft::copy, and
get_pdlp_warm_start_data() to locate the code).

♻️ Duplicate comments (1)

cpp/src/linear_programming/utilities/cython_solve.cu (1)

146-159: Skip host copies in batch mode to avoid performance regression.

These unconditional host_copy operations for all PDLP warm-start vectors cause unnecessary device-to-host transfers in batch mode, where the data is never consumed. This is a significant performance regression for large LP problems solved in batch.

Wrap these copies so they only execute when !is_batch_mode.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e735ee1 and 2880a7c.

📒 Files selected for processing (1)

cpp/src/linear_programming/utilities/cython_solve.cu

🧰 Additional context used

📓 Path-based instructions (4)

**/*.{cu,cuh}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

**/*.{cu,cuh}: Every CUDA kernel launch and memory operation must have error checking with CUDA_CHECK or equivalent verification
Avoid reinventing functionality already available in Thrust, CCCL, or RMM libraries; prefer standard library utilities over custom implementations

Files:

cpp/src/linear_programming/utilities/cython_solve.cu

**/*.cu

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

**/*.cu: Verify race conditions and correctness of GPU kernel shared memory, atomics, and warp-level operations
Detect inefficient GPU kernel launches with low occupancy or poor memory access patterns; optimize for coalesced memory access and minimize warp divergence in hot paths

Files:

cpp/src/linear_programming/utilities/cython_solve.cu

**/*.{cu,cuh,cpp,hpp,h}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

**/*.{cu,cuh,cpp,hpp,h}: Track GPU device memory allocations and deallocations to prevent memory leaks; ensure cudaMalloc/cudaFree balance and cleanup of streams/events
Validate algorithm correctness in optimization logic: simplex pivots, branch-and-bound decisions, routing heuristics, and constraint/objective handling must produce correct results
Check numerical stability: prevent overflow/underflow, precision loss, division by zero/near-zero, and use epsilon comparisons for floating-point equality checks
Validate correct initialization of variable bounds, constraint coefficients, and algorithm state before solving; ensure reset when transitioning between algorithm phases (presolve, simplex, diving, crossover)
Ensure variables and constraints are accessed from the correct problem context (original vs presolve vs folded vs postsolve); verify index mapping consistency across problem transformations
For concurrent CUDA operations (barriers, async operations), explicitly create and manage dedicated streams instead of reusing the default stream; document stream lifecycle
Eliminate unnecessary host-device synchronization (cudaDeviceSynchronize) in hot paths that blocks GPU pipeline; use streams and events for async execution
Assess algorithmic complexity for large-scale problems (millions of variables/constraints); ensure O(n log n) or better complexity, not O(n²) or worse
Verify correct problem size checks before expensive GPU/CPU operations; prevent resource exhaustion on oversized problems
Identify assertions with overly strict numerical tolerances that fail on legitimate degenerate/edge cases (near-zero pivots, singular matrices, empty problems)
Ensure race conditions are absent in multi-GPU code and multi-threaded server implementations; verify proper synchronization of shared state
Refactor code duplication in solver components (3+ occurrences) into shared utilities; for GPU kernels, use templated device functions to avoid duplication
Check that hard-coded GPU de...

Files:

cpp/src/linear_programming/utilities/cython_solve.cu

**/*.{cu,cpp,hpp,h}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

Avoid inappropriate use of exceptions in performance-critical GPU operation paths; prefer error codes or CUDA error checking for latency-sensitive code

Files:

cpp/src/linear_programming/utilities/cython_solve.cu

🧠 Learnings (11)

📓 Common learnings

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Refactor code duplication in solver components (3+ occurrences) into shared utilities; for GPU kernels, use templated device functions to avoid duplication

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Eliminate unnecessary host-device synchronization (cudaDeviceSynchronize) in hot paths that blocks GPU pipeline; use streams and events for async execution

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Check that hard-coded GPU device IDs and resource limits are made configurable; abstract multi-backend support for different CUDA versions

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Reduce tight coupling between solver components (presolve, simplex, basis, barrier); increase modularity and reusability of optimization algorithms

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Ensure variables and constraints are accessed from the correct problem context (original vs presolve vs folded vs postsolve); verify index mapping consistency across problem transformations

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Refactor code duplication in solver components (3+ occurrences) into shared utilities; for GPU kernels, use templated device functions to avoid duplication

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Ensure variables and constraints are accessed from the correct problem context (original vs presolve vs folded vs postsolve); verify index mapping consistency across problem transformations

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Eliminate unnecessary host-device synchronization (cudaDeviceSynchronize) in hot paths that blocks GPU pipeline; use streams and events for async execution

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Validate algorithm correctness in optimization logic: simplex pivots, branch-and-bound decisions, routing heuristics, and constraint/objective handling must produce correct results

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-12-04T20:09:09.264Z

Learnt from: chris-maes
Repo: NVIDIA/cuopt PR: 602
File: cpp/src/linear_programming/solve.cu:732-742
Timestamp: 2025-12-04T20:09:09.264Z
Learning: In cpp/src/linear_programming/solve.cu, the barrier solver does not currently return INFEASIBLE or UNBOUNDED status. It only returns OPTIMAL, TIME_LIMIT, NUMERICAL_ISSUES, or CONCURRENT_LIMIT.

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Reduce tight coupling between solver components (presolve, simplex, basis, barrier); increase modularity and reusability of optimization algorithms

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Check that hard-coded GPU device IDs and resource limits are made configurable; abstract multi-backend support for different CUDA versions

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Validate correct initialization of variable bounds, constraint coefficients, and algorithm state before solving; ensure reset when transitioning between algorithm phases (presolve, simplex, diving, crossover)

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.cu : Detect inefficient GPU kernel launches with low occupancy or poor memory access patterns; optimize for coalesced memory access and minimize warp divergence in hot paths

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

📚 Learning: 2025-10-22T14:25:22.899Z

Learnt from: aliceb-nv
Repo: NVIDIA/cuopt PR: 527
File: cpp/src/mip/diversity/lns/rins.cu:167-175
Timestamp: 2025-10-22T14:25:22.899Z
Learning: In MIP (Mixed Integer Programming) problems in the cuOPT codebase, `n_integer_vars == 0` is impossible by definition—MIP problems must have at least one integer variable. If there are no integer variables, it would be a pure Linear Programming (LP) problem, not a MIP problem.

Applied to files:

cpp/src/linear_programming/utilities/cython_solve.cu

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (10)

GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.12, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.10, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.13, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.13, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.11, arm64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.11, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.12, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-mps-parser / 13.0.2, 3.10, amd64, rockylinux8
GitHub Check: wheel-build-cuopt-sh-client / 13.0.2, 3.10, amd64, rockylinux8
GitHub Check: checks / check-style

🔇 Additional comments (1)

cpp/src/linear_programming/utilities/cython_solve.cu (1)

201-203: The review comment is based on an inaccurate assumption. Both call_solve_lp and call_solve_mip unconditionally perform host_copy() on their solutions (lines 147-180 and line 203, respectively). The is_batch_mode parameter in call_solve_lp is passed to the underlying solve_lp() solver but does not control host copy behavior in the wrapper. There is no deferred host copy optimization for LP batch mode to contrast with MIP, making the suggested optimization unsupported by the actual code behavior.

Likely an incorrect or invalid review comment.

coderabbitai · 2026-01-05T16:14:48Z

cpp/src/linear_programming/utilities/cython_solve.cu

+    cuopt::host_copy(solution.get_primal_solution()),
+    cuopt::host_copy(solution.get_dual_solution()),
+    cuopt::host_copy(solution.get_reduced_cost()),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_),
+    cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Find host_copy implementation find . -name "*.h" -o -name "*.hpp" -o -name "*.cu" -o -name "*.cpp" | head -20 echo "---" # Search for host_copy definition rg -nP 'inline.*host_copy|^[[:space:]]*host_copy.*{' --type=cpp --type=h -A 10 | head -50

Repository: NVIDIA/cuopt

Length of output: 2174

🏁 Script executed:

#!/bin/bash # Get full context of the file around lines 148-159 wc -l cpp/src/linear_programming/utilities/cython_solve.cu

Repository: NVIDIA/cuopt

Length of output: 112

🏁 Script executed:

#!/bin/bash # Show context around lines 148-159 sed -n '130,175p' cpp/src/linear_programming/utilities/cython_solve.cu

Repository: NVIDIA/cuopt

Length of output: 2788

🏁 Script executed:

#!/bin/bash # Search for host_copy in include files find cpp/include -name "*.hpp" -o -name "*.h" | xargs grep -l "host_copy"

Repository: NVIDIA/cuopt

Length of output: 40

🏁 Script executed:

#!/bin/bash # Check for CUDA error checking patterns used in the file rg -n 'CUDA_CHECK|RAFT_CUDA_TRY|cudaGetLastError|RAFT_CHECK_CUDA|host_copy' cpp/src/linear_programming/utilities/cython_solve.cu | head -30

Repository: NVIDIA/cuopt

Length of output: 1252

🏁 Script executed:

#!/bin/bash # Get full host_copy implementation cat -n cpp/src/utilities/copy_helpers.hpp | head -200

Repository: NVIDIA/cuopt

Length of output: 6185

🏁 Script executed:

#!/bin/bash # Check if host_copy uses error checking internally or is wrapped with error checks rg -n 'host_copy' cpp/src/utilities/copy_helpers.hpp -B 5 -A 15 | head -100

Repository: NVIDIA/cuopt

Length of output: 3206

🏁 Script executed:

#!/bin/bash # Look for the default host_copy overload (non-bool version) rg -n 'template.*host_copy|^[[:space:]]*host_copy.*\{' cpp/src/utilities/copy_helpers.hpp -A 15

Repository: NVIDIA/cuopt

Length of output: 38

🏁 Script executed:

#!/bin/bash # Check if there's error handling around the solve call sed -n '140,180p' cpp/src/linear_programming/utilities/cython_solve.cu

Repository: NVIDIA/cuopt

Length of output: 2594

🏁 Script executed:

#!/bin/bash # Check for async/batched copy patterns in similar code rg -n 'cudaMemcpyAsync|rmm::device_uvector|thrust::copy' cpp/src/linear_programming/utilities/ --type=cu --type=cpp -C 3 | head -60

Repository: NVIDIA/cuopt

Length of output: 82

🏁 Script executed:

#!/bin/bash # Check if raft::copy has error checking rg -n 'raft::copy' cpp/include/cuopt/ --type=cpp --type=h | head -5

Repository: NVIDIA/cuopt

Length of output: 38

🏁 Script executed:

#!/bin/bash # Verify if this is end-of-solve (not hot path) - check what comes after line 180 sed -n '160,180p' cpp/src/linear_programming/utilities/cython_solve.cu

Repository: NVIDIA/cuopt

Length of output: 1281

🏁 Script executed:

#!/bin/bash # Check if raft library provides CUDA error checking find . -type f -name "*.hpp" -o -name "*.h" | xargs grep -l "raft::copy" | head -3

Repository: NVIDIA/cuopt

Length of output: 166

🏁 Script executed:

#!/bin/bash # Check if there are any CUDA_CHECK or error handlers wrapping these calls sed -n '130,185p' cpp/src/linear_programming/utilities/cython_solve.cu | grep -i 'cuda\|check\|try\|error'

Repository: NVIDIA/cuopt

Length of output: 311

🏁 Script executed:

#!/bin/bash # Verify the exact context - is this function expected to return immediately after these copies? sed -n '175,200p' cpp/src/linear_programming/utilities/cython_solve.cu

Repository: NVIDIA/cuopt

Length of output: 1177

Consider batching device-to-host transfers to reduce redundant synchronizations.

The host_copy function includes an implicit stream_view.synchronize() (line 123 of copy_helpers.hpp). With 12 sequential host_copy calls here, this creates 12 synchronization barriers—one per call—instead of a single barrier. While raft::copy provides built-in CUDA error checking, consolidating the copies into a batched operation with one synchronization at the end would be more efficient:

// Instead of 12 separate host_copy calls with 12 syncs each, // batch the raft::copy calls first, then synchronize once

🤖 Prompt for AI Agents

In @cpp/src/linear_programming/utilities/cython_solve.cu around lines 148-159, Multiple sequential calls to cuopt::host_copy (which invokes stream_view.synchronize() per call) cause repeated synchronization barriers; instead, aggregate the device->host transfers for the PDLP warm start fields and other solution arrays by issuing raft::copy (or equivalent device->host memcpy) into host buffers without synchronizing each time, then call a single stream/synchronization once after all copies; update the call sites around the list of cuopt::host_copy(...) for solution.get_primal_solution(), get_dual_solution(), get_reduced_cost(), and all solution.get_pdlp_warm_start_data() members to use batched copies and one final synchronize (referencing host_copy, cuopt::host_copy, raft::copy, and get_pdlp_warm_start_data() to locate the code).

Adding best practices and elements to help user interaction with repo. ## Issue closes NVIDIA#690 Authors: - Ramakrishnap (https://github.com/rgsl888prabhu) Approvers: - Gil Forsyth (https://github.com/gforsyth) - Trevor McKay (https://github.com/tmckayus) URL: NVIDIA#692

…c into segfault_issue

coderabbitai

Actionable comments posted: 1

Fix all issues with AI Agents 🤖

In @.github/SECURITY.md:
- Around line 14-15: The nested list item "- Product/Driver name and
version/branch that contains the vulnerability" is over-indented; change its
indentation from four spaces to two spaces so it is a proper sub-item under
"Please include the following information:" (i.e., align the hyphen two spaces
in from the parent list line) to fix the Markdown list nesting.

♻️ Duplicate comments (1)

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (1)
56-58: Remove unused imports.

These imports are not used anywhere in the file and should be removed.
🔎 Proposed fix
-from cuopt.utilities import InputValidationError
-
-import pyarrow as pa

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2880a7c and 0e66899.

📒 Files selected for processing (14)

.github/AGENTS.md
.github/CODE_OF_CONDUCT.md
.github/ISSUE_TEMPLATE/bug_report.md
.github/ISSUE_TEMPLATE/documentation-request.md
.github/ISSUE_TEMPLATE/feature_request.md
.github/ISSUE_TEMPLATE/submit-question.md
.github/SECURITY.md
README.md
ci/release/update-version.sh
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp
cpp/src/linear_programming/utilities/cython_solve.cu
python/cuopt/cuopt/distance_engine/waypoint_matrix_wrapper.pyx
python/cuopt/cuopt/linear_programming/solver/solver.pxd
python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx

✅ Files skipped from review due to trivial changes (2)

.github/CODE_OF_CONDUCT.md
.github/ISSUE_TEMPLATE/documentation-request.md

🚧 Files skipped from review as they are similar to previous changes (2)

cpp/src/linear_programming/utilities/cython_solve.cu
python/cuopt/cuopt/distance_engine/waypoint_matrix_wrapper.pyx

🧰 Additional context used

📓 Path-based instructions (5)

**/*.{cu,cuh,cpp,hpp,h}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

**/*.{cu,cuh,cpp,hpp,h}: Track GPU device memory allocations and deallocations to prevent memory leaks; ensure cudaMalloc/cudaFree balance and cleanup of streams/events
Validate algorithm correctness in optimization logic: simplex pivots, branch-and-bound decisions, routing heuristics, and constraint/objective handling must produce correct results
Check numerical stability: prevent overflow/underflow, precision loss, division by zero/near-zero, and use epsilon comparisons for floating-point equality checks
Validate correct initialization of variable bounds, constraint coefficients, and algorithm state before solving; ensure reset when transitioning between algorithm phases (presolve, simplex, diving, crossover)
Ensure variables and constraints are accessed from the correct problem context (original vs presolve vs folded vs postsolve); verify index mapping consistency across problem transformations
For concurrent CUDA operations (barriers, async operations), explicitly create and manage dedicated streams instead of reusing the default stream; document stream lifecycle
Eliminate unnecessary host-device synchronization (cudaDeviceSynchronize) in hot paths that blocks GPU pipeline; use streams and events for async execution
Assess algorithmic complexity for large-scale problems (millions of variables/constraints); ensure O(n log n) or better complexity, not O(n²) or worse
Verify correct problem size checks before expensive GPU/CPU operations; prevent resource exhaustion on oversized problems
Identify assertions with overly strict numerical tolerances that fail on legitimate degenerate/edge cases (near-zero pivots, singular matrices, empty problems)
Ensure race conditions are absent in multi-GPU code and multi-threaded server implementations; verify proper synchronization of shared state
Refactor code duplication in solver components (3+ occurrences) into shared utilities; for GPU kernels, use templated device functions to avoid duplication
Check that hard-coded GPU de...

Files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

**/*.{h,hpp,py}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

Verify C API does not break ABI stability (no struct layout changes, field reordering); maintain backward compatibility in Python and server APIs with deprecation warnings

Files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

**/*.{cpp,hpp,h}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

**/*.{cpp,hpp,h}: Check for unclosed file handles when reading MPS/QPS problem files; ensure RAII patterns or proper cleanup in exception paths
Validate input sanitization to prevent buffer overflows and resource exhaustion attacks; avoid unsafe deserialization of problem files
Prevent thread-unsafe use of global and static variables; use proper mutex/synchronization in server code accessing shared solver state

Files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

**/*.{cu,cpp,hpp,h}

📄 CodeRabbit inference engine (.github/.coderabbit_review_guide.md)

Avoid inappropriate use of exceptions in performance-critical GPU operation paths; prefer error codes or CUDA error checking for latency-sensitive code

Files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

cpp/include/cuopt/**/*

⚙️ CodeRabbit configuration file

cpp/include/cuopt/**/*: For public header files (C++ API):

Check if new public functions/classes have documentation comments (Doxygen format)

Flag API changes that may need corresponding docs/ updates

Verify parameter descriptions match actual types/behavior

Suggest documenting thread-safety, GPU requirements, and numerical behavior

For breaking changes, recommend updating docs and migration guides

Files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

🧠 Learnings (21)

📓 Common learnings

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Reduce tight coupling between solver components (presolve, simplex, basis, barrier); increase modularity and reusability of optimization algorithms

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Refactor code duplication in solver components (3+ occurrences) into shared utilities; for GPU kernels, use templated device functions to avoid duplication

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Reduce tight coupling between solver components (presolve, simplex, basis, barrier); increase modularity and reusability of optimization algorithms

Applied to files:

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx
python/cuopt/cuopt/linear_programming/solver/solver.pxd
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Refactor code duplication in solver components (3+ occurrences) into shared utilities; for GPU kernels, use templated device functions to avoid duplication

Applied to files:

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx
python/cuopt/cuopt/linear_programming/solver/solver.pxd
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Validate algorithm correctness in optimization logic: simplex pivots, branch-and-bound decisions, routing heuristics, and constraint/objective handling must produce correct results

Applied to files:

.github/AGENTS.md
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.cu : Detect inefficient GPU kernel launches with low occupancy or poor memory access patterns; optimize for coalesced memory access and minimize warp divergence in hot paths

Applied to files:

.github/AGENTS.md

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Assess algorithmic complexity for large-scale problems (millions of variables/constraints); ensure O(n log n) or better complexity, not O(n²) or worse

Applied to files:

.github/AGENTS.md

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh} : Avoid reinventing functionality already available in Thrust, CCCL, or RMM libraries; prefer standard library utilities over custom implementations

Applied to files:

.github/AGENTS.md

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cpp,hpp,h} : Avoid inappropriate use of exceptions in performance-critical GPU operation paths; prefer error codes or CUDA error checking for latency-sensitive code

Applied to files:

.github/AGENTS.md
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Check that hard-coded GPU device IDs and resource limits are made configurable; abstract multi-backend support for different CUDA versions

Applied to files:

.github/AGENTS.md
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Verify error propagation from CUDA to user-facing APIs is complete; ensure CUDA errors are caught and mapped to meaningful user error codes

Applied to files:

.github/AGENTS.md
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Validate correct initialization of variable bounds, constraint coefficients, and algorithm state before solving; ensure reset when transitioning between algorithm phases (presolve, simplex, diving, crossover)

Applied to files:

.github/AGENTS.md
python/cuopt/cuopt/linear_programming/solver/solver.pxd
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*benchmark*.{cpp,cu,py} : Include performance benchmarks and regression detection for GPU operations; verify near real-time performance on million-variable problems

Applied to files:

.github/AGENTS.md

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Verify correct problem size checks before expensive GPU/CPU operations; prevent resource exhaustion on oversized problems

Applied to files:

.github/AGENTS.md
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-10-22T14:25:22.899Z

Learnt from: aliceb-nv
Repo: NVIDIA/cuopt PR: 527
File: cpp/src/mip/diversity/lns/rins.cu:167-175
Timestamp: 2025-10-22T14:25:22.899Z
Learning: In MIP (Mixed Integer Programming) problems in the cuOPT codebase, `n_integer_vars == 0` is impossible by definition—MIP problems must have at least one integer variable. If there are no integer variables, it would be a pure Linear Programming (LP) problem, not a MIP problem.

Applied to files:

README.md

📚 Learning: 2025-12-04T20:09:09.264Z

Learnt from: chris-maes
Repo: NVIDIA/cuopt PR: 602
File: cpp/src/linear_programming/solve.cu:732-742
Timestamp: 2025-12-04T20:09:09.264Z
Learning: In cpp/src/linear_programming/solve.cu, the barrier solver does not currently return INFEASIBLE or UNBOUNDED status. It only returns OPTIMAL, TIME_LIMIT, NUMERICAL_ISSUES, or CONCURRENT_LIMIT.

Applied to files:

python/cuopt/cuopt/linear_programming/solver/solver.pxd
cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Ensure variables and constraints are accessed from the correct problem context (original vs presolve vs folded vs postsolve); verify index mapping consistency across problem transformations

Applied to files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Track GPU device memory allocations and deallocations to prevent memory leaks; ensure cudaMalloc/cudaFree balance and cleanup of streams/events

Applied to files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : Ensure race conditions are absent in multi-GPU code and multi-threaded server implementations; verify proper synchronization of shared state

Applied to files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.cu : Verify race conditions and correctness of GPU kernel shared memory, atomics, and warp-level operations

Applied to files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*test*.{cpp,cu,py} : Ensure test isolation: prevent GPU state, cached memory, and global variables from leaking between test cases; verify each test independently initializes its environment

Applied to files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

📚 Learning: 2025-11-25T10:20:49.822Z

Learnt from: CR
Repo: NVIDIA/cuopt PR: 0
File: .github/.coderabbit_review_guide.md:0-0
Timestamp: 2025-11-25T10:20:49.822Z
Learning: Applies to **/*.{cu,cuh,cpp,hpp,h} : For concurrent CUDA operations (barriers, async operations), explicitly create and manage dedicated streams instead of reusing the default stream; document stream lifecycle

Applied to files:

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp

🪛 markdownlint-cli2 (0.18.1)

.github/AGENTS.md

4-4: Blank line inside blockquote

(MD028, no-blanks-blockquote)

19-19: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

.github/SECURITY.md

15-15: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

🔇 Additional comments (13)

ci/release/update-version.sh (3)

3-3: LGTM! Copyright year updated appropriately.

The copyright year has been updated to 2026, which aligns with the current date.

134-136: Good addition to automate version badge updates.

This change ensures the README version badge stays synchronized with version updates, reducing manual maintenance overhead.

138-138: LGTM! Nightly workflow uses short version tag appropriately.

The nightly workflow correctly uses NEXT_SHORT_TAG (major.minor format) rather than the full version, which is appropriate for tracking development branches.

README.md (1)

4-10: All badge URLs and identifiers are correct and accessible.

The badge additions are properly configured:

The hardcoded version "26.02.00" is automatically maintained by the update script (ci/release/update-version.sh line 135)

The Colab notebook path exists and is accessible

The NVIDIA Launchable ID is valid and active

All documentation, Docker Hub, and resources URLs are accessible (HTTP 200)

.github/ISSUE_TEMPLATE/bug_report.md (1)

2-2: Template name update aligns with repo conventions.

The emoji prefix improves visual distinction in GitHub's issue template picker and is consistent with updates to related templates.

.github/ISSUE_TEMPLATE/submit-question.md (1)

2-2: Template name update aligns with repo conventions.

The emoji prefix improves visual distinction in GitHub's issue template picker, consistent with the pattern applied across related templates.

.github/ISSUE_TEMPLATE/feature_request.md (1)

2-2: LGTM!

The rocket emoji adds visual appeal to the feature request template.

.github/AGENTS.md (1)

1-156: LGTM!

This documentation provides valuable context for AI coding assistants working with the cuOpt codebase. The content is comprehensive and well-structured.

Note: The static analysis hints about markdown formatting (blank line in blockquote, missing language for fenced code) can be safely ignored—they're overly strict for this type of documentation.

python/cuopt/cuopt/linear_programming/solver/solver_wrapper.pyx (3)

300-306: LGTM!

The MIP solution extraction correctly handles the transition from GPU device buffers to host vectors. The .copy() call is essential to avoid dangling pointers after the C++ solver_ret_t is destroyed.

342-368: LGTM!

The LP solution extraction follows the same safe pattern as MIP, with proper empty-vector handling and memory copying.

384-470: LGTM!

The warm start data extraction consistently applies the safe extraction pattern across all 9 vectors. The .copy() operations are necessary for memory safety, even though they add overhead.

python/cuopt/cuopt/linear_programming/solver/solver.pxd (1)

123-135: LGTM!

The Cython declarations correctly reflect the C++ header changes from device buffers to host vectors.

Also applies to: 158-158

cpp/include/cuopt/linear_programming/utilities/cython_solve.hpp (1)

27-64: Add Doxygen documentation comments to struct definitions.

These internal Cython interface structs lack documentation comments while their wrapper functions (call_solve_lp, call_solve_mip) do. Add brief Doxygen-style comments explaining the purpose and key fields of linear_programming_ret_t and mip_ret_t for maintainability, even though they're internal to the Python bindings.

Likely an incorrect or invalid review comment.

coderabbitai · 2026-01-05T16:35:04Z

.github/SECURITY.md

+- Please include the following information:
+    - Product/Driver name and version/branch that contains the vulnerability


⚠️ Potential issue | 🟡 Minor

Fix markdown list indentation.

Line 15 has incorrect indentation for a nested list item. Markdown expects 2 spaces of indentation for sub-items, not 4.

🔎 Proposed fix for list indentation

- Please include the following information: - - Product/Driver name and version/branch that contains the vulnerability + - Product/Driver name and version/branch that contains the vulnerability

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

- Please include the following information:

- Product/Driver name and version/branch that contains the vulnerability

- Please include the following information:

- Product/Driver name and version/branch that contains the vulnerability

🧰 Tools

🪛 markdownlint-cli2 (0.18.1)

15-15: Unordered list indentation
Expected: 2; Actual: 4

(MD007, ul-indent)

🤖 Prompt for AI Agents

In @.github/SECURITY.md around lines 14-15, The nested list item "- Product/Driver name and version/branch that contains the vulnerability" is over-indented; change its indentation from four spaces to two spaces so it is a proper sub-item under "Please include the following information:" (i.e., align the hyphen two spaces in from the parent list line) to fix the Markdown list nesting.

rgsl888prabhu · 2026-01-05T21:32:20Z

Closing this issue as it coincides with remote execution work.

rgsl888prabhu added 2 commits November 7, 2025 11:34

Change solution to be on host from c++ side

79d5157

fix style

1ab34ab

rgsl888prabhu requested review from a team as code owners November 7, 2025 18:22

rgsl888prabhu requested review from Iroy30, Kh4ster and kaatish November 7, 2025 18:22

rgsl888prabhu self-assigned this Nov 7, 2025

rgsl888prabhu added non-breaking Introduces a non-breaking change improvement Improves an existing functionality labels Nov 7, 2025

coderabbitai bot reviewed Nov 7, 2025

View reviewed changes

fix

2160bef

anandhkb added this to the 25.12 milestone Nov 8, 2025

rg20 approved these changes Nov 10, 2025

View reviewed changes

rgsl888prabhu added 5 commits November 10, 2025 12:34

fix cudf issue

9ca1325

fix typo

999a549

enable few tests

a72b785

remove unsued file

1ea0702

Merge branch 'fix_cudf_break' of github.com:rgsl888prabhu/cuopt_publi…

1a5edbf

…c into segfault_issue

rgsl888prabhu requested a review from a team as a code owner November 10, 2025 19:18

rgsl888prabhu requested a review from msarahan November 10, 2025 19:18

coderabbitai bot reviewed Nov 10, 2025

View reviewed changes

rgsl888prabhu and others added 3 commits November 10, 2025 22:32

Merge branch 'main' of github.com:nvidia/cuopt into segfault_issue

e735ee1

Update test_wheel_cuopt.sh

b9d5ea0

Merge branch 'main' into segfault_issue

4a43461

rgsl888prabhu removed request for a team and msarahan November 12, 2025 16:19

rgsl888prabhu added 2 commits November 12, 2025 10:51

Update test_lp.py

0f4b100

Update utils_wrapper.pyx

7511a06

Update test_lp.py

324b412

rgsl888prabhu changed the base branch from main to release/25.12 November 17, 2025 21:34

rgsl888prabhu added 4 commits November 17, 2025 15:47

Merge branch 'release/25.12' into segfault_issue

816cd21

Merge branch 'release/25.12' into segfault_issue

582004d

Merge branch 'release/25.12' into segfault_issue

5e62f50

Merge branch 'release/25.12' into segfault_issue

d64f387

chris-maes modified the milestones: 25.12, 26.02 Nov 25, 2025

rgsl888prabhu changed the base branch from release/25.12 to main January 5, 2026 16:10

Merge branch 'main' into segfault_issue

2880a7c

coderabbitai bot reviewed Jan 5, 2026

View reviewed changes

rgsl888prabhu and others added 3 commits January 5, 2026 16:24

Merge branch 'segfault_issue' of github.com:rgsl888prabhu/cuopt_publi…

6632c56

…c into segfault_issue

fix copyright

0e66899

coderabbitai bot reviewed Jan 5, 2026

View reviewed changes

-    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_),
-    cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),
-    solution.get_pdlp_warm_start_data().initial_primal_weight_,
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().current_primal_solution_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().current_dual_solution_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_primal_average_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().initial_dual_average_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().current_ATY_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_primal_solutions_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().sum_dual_solutions_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_primal_solution_),
+    is_batch_mode ? std::vector<double>{}
+                  : cuopt::host_copy(solution.get_pdlp_warm_start_data().last_restart_duality_gap_dual_solution_),
+    solution.get_pdlp_warm_start_data().initial_primal_weight_,

		from cuopt.utilities import InputValidationError

		import pyarrow as pa

	cudf.core.column.Column
	cudf.Series
	A cudf Series built from the buffer

		- Please include the following information:
		- Product/Driver name and version/branch that contains the vulnerability

Change solution object to return host memory instead of gpu memory from libcuopt for lp and milp #576

Are you sure you want to change the base?

Change solution object to return host memory instead of gpu memory from libcuopt for lp and milp #576

Uh oh!

Conversation

rgsl888prabhu commented Nov 7, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

rgsl888prabhu commented Nov 21, 2025

Uh oh!

coderabbitai bot commented Nov 21, 2025

Uh oh!

tmckayus commented Nov 24, 2025

Uh oh!

tmckayus commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 3, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

github-actions bot commented Jan 1, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

rgsl888prabhu commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rgsl888prabhu commented Nov 7, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 7, 2025 •

edited

Loading

tmckayus commented Nov 25, 2025 •

edited

Loading