10.12.1

mathias-von-ottenbreit · mathias-von-ottenbreit · commit 8c75abf50fe4 · 2025-09-21T08:59:53.000+02:00
diff --git a/API_REFERENCE_FOR_REGRESSION.md b/API_REFERENCE_FOR_REGRESSION.md
@@ -50,7 +50,7 @@ Limits 1) the number of terms already in the model that can be considered as int
 Specifies the variance power when ***loss_function*** is "tweedie". Specifies a dispersion parameter when ***loss_function*** is "negative_binomial", "cauchy" or "weibull". 
 
 #### validation_tuning_metric (default = "default")
-Specifies which metric to use for validating the model and tuning ***m***. The model will try to minimize the validation metric. Available options are "default" (using the same methodology as when calculating the training error), "mse", "mae", "negative_gini" (normalized), "group_mse", "group_mse_by_prediction", "neg_top_quantile_mean_response", "bottom_quantile_mean_response" and "custom_function". The default is often a choice that fits well with respect to the ***loss_function*** chosen. However, if you want to use ***loss_function*** or ***dispersion_parameter*** as tuning parameters then the default is not suitable. "group_mse" requires that the "group" argument in the ***fit*** method is provided. "group_mse_by_prediction" groups predictions by up to ***group_mse_by_prediction_bins*** groups and calculates groupwise mse. "neg_top_quantile_mean_response" calculates the negative of the sample weighted mean response for observations with predictions in the top quantile (as specified by the ***quantile*** parameter). For example, if ***quantile*** is 0.95, this metric will be the negative of the sample weighted mean response for the 5% of observations with the highest predictions. "bottom_quantile_mean_response" calculates the sample weighted mean response for observations with predictions in the bottom quantile (as specified by the ***quantile*** parameter). For example, if ***quantile*** is 0.05, this metric will be the sample weighted mean response for the 5% of observations with the lowest predictions. For "custom_function" see ***calculate_custom_validation_error_function*** below.
+Specifies which metric to use for validating the model and tuning ***m***. The model will try to minimize the validation metric. Available options are "default" (using the same methodology as when calculating the training error), "mse", "mae", "negative_gini" (normalized), "group_mse", "group_mse_by_prediction", "neg_top_quantile_mean_response", "bottom_quantile_mean_response" and "custom_function". The default is often a choice that fits well with respect to the ***loss_function*** chosen. However, if you want to use ***loss_function*** or ***dispersion_parameter*** as tuning parameters then the default is not suitable. "group_mse" requires that the "group" argument in the ***fit*** method is provided. "group_mse_by_prediction" groups predictions by up to ***group_mse_by_prediction_bins*** groups and calculates groupwise mse. "neg_top_quantile_mean_response" calculates the negative of the sample weighted mean response for observations with predictions in the top quantile (as specified by the ***quantile*** parameter). For example, if ***quantile*** is 0.95, this metric will be the negative of the sample weighted mean response for the 5% of observations with the highest predictions. "bottom_quantile_mean_response" calculates the sample weighted mean response for observations with predictions in the bottom quantile (as specified by the ***quantile*** parameter). For example, if ***quantile*** is 0.05, this metric will be the sample weighted mean response for the 5% of observations with the lowest predictions. For "custom_function" see ***calculate_custom_validation_error_function*** below. Please note that for non-default values a significantly higher ***early_stopping_rounds*** than the default of 200 might be needed.
 
 #### quantile (default = 0.5)
 Specifies the quantile to use when ***loss_function*** is "quantile" or when ***validation_tuning_metric*** is "neg_top_quantile_mean_response" or "bottom_quantile_mean_response".
diff --git a/cpp/APLRRegressor.h b/cpp/APLRRegressor.h
@@ -945,7 +945,6 @@ void APLRRegressor::scale_response_if_using_log_link_function()
         {
             scaling_factor_for_log_link_function = 1 / inverse_scaling_factor;
             y_train *= scaling_factor_for_log_link_function;
-            y_validation *= scaling_factor_for_log_link_function;
         }
         else
             scaling_factor_for_log_link_function = 1.0;
@@ -1773,13 +1772,19 @@ void APLRRegressor::calculate_and_validate_validation_error(size_t boosting_step
 
 double APLRRegressor::calculate_validation_error(const VectorXd &predictions)
 {
+    VectorXd predictions_used{predictions};
+    if (link_function == "log")
+    {
+        predictions_used /= scaling_factor_for_log_link_function;
+    }
+
     if (validation_tuning_metric == "default")
     {
         if (loss_function == "custom_function")
         {
             try
             {
-                return calculate_custom_loss_function(y_validation, predictions, sample_weight_validation, group_validation, other_data_validation);
+                return calculate_custom_loss_function(y_validation, predictions_used, sample_weight_validation, group_validation, other_data_validation);
             }
             catch (const std::exception &e)
             {
@@ -1789,33 +1794,33 @@ double APLRRegressor::calculate_validation_error(const VectorXd &predictions)
         }
         else if (loss_function == "group_mse_cycle")
         {
-            return calculate_group_mse_by_prediction_validation_error(predictions);
+            return calculate_group_mse_by_prediction_validation_error(predictions_used);
         }
         else
-            return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, loss_function, dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);
+            return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, loss_function, dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);
     }
     else if (validation_tuning_metric == "mse")
-        return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, MSE_LOSS_FUNCTION), sample_weight_validation);
+        return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, MSE_LOSS_FUNCTION), sample_weight_validation);
     else if (validation_tuning_metric == "mae")
-        return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, "mae"), sample_weight_validation);
+        return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, "mae"), sample_weight_validation);
     else if (validation_tuning_metric == "negative_gini")
-        return -calculate_gini(y_validation, predictions, sample_weight_validation) / calculate_gini(y_validation, y_validation, sample_weight_validation);
+        return -calculate_gini(y_validation, predictions_used, sample_weight_validation) / calculate_gini(y_validation, y_validation, sample_weight_validation);
     else if (validation_tuning_metric == "group_mse")
     {
         bool group_is_not_provided{group_validation.rows() == 0};
         if (group_is_not_provided)
             throw std::runtime_error("When validation_tuning_metric is group_mse then the group argument in fit() must be provided.");
-        return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, "group_mse", dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);
+        return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, "group_mse", dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);
     }
     else if (validation_tuning_metric == "group_mse_by_prediction")
     {
-        return calculate_group_mse_by_prediction_validation_error(predictions);
+        return calculate_group_mse_by_prediction_validation_error(predictions_used);
     }
     else if (validation_tuning_metric == "custom_function")
     {
         try
         {
-            return calculate_custom_validation_error_function(y_validation, predictions, sample_weight_validation, group_validation, other_data_validation);
+            return calculate_custom_validation_error_function(y_validation, predictions_used, sample_weight_validation, group_validation, other_data_validation);
         }
         catch (const std::exception &e)
         {
@@ -1825,7 +1830,7 @@ double APLRRegressor::calculate_validation_error(const VectorXd &predictions)
     }
     else if (validation_tuning_metric == "neg_top_quantile_mean_response")
     {
-        double mean_response{calculate_quantile_mean_response(predictions, true)};
+        double mean_response{calculate_quantile_mean_response(predictions_used, true)};
         if (std::isinf(mean_response))
         {
             return mean_response;
@@ -1834,7 +1839,7 @@ double APLRRegressor::calculate_validation_error(const VectorXd &predictions)
     }
     else if (validation_tuning_metric == "bottom_quantile_mean_response")
     {
-        return calculate_quantile_mean_response(predictions, false);
+        return calculate_quantile_mean_response(predictions_used, false);
     }
     else
         throw std::runtime_error(validation_tuning_metric + " is an invalid validation_tuning_metric.");
@@ -2025,7 +2030,6 @@ void APLRRegressor::revert_scaling_if_using_log_link_function()
     if (link_function == "log")
     {
         y_train /= scaling_factor_for_log_link_function;
-        y_validation /= scaling_factor_for_log_link_function;
         intercept += std::log(1 / scaling_factor_for_log_link_function);
         for (Eigen::Index i = 0; i < intercept_steps.size(); ++i)
         {
diff --git a/documentation/APLR 10.12.1.pdf b/documentation/APLR 10.12.1.pdf
diff --git a/setup.py b/setup.py
@@ -28,7 +28,7 @@
 
 setuptools.setup(
     name="aplr",
-    version="10.12.0",
+    version="10.12.1",
     description="Automatic Piecewise Linear Regression",
     ext_modules=[sfc_module],
     author="Mathias von Ottenbreit",

Original file line number	Diff line number	Diff line change
`@@ -945,7 +945,6 @@ void APLRRegressor::scale_response_if_using_log_link_function()`
`945`	`945`	`{`
`946`	`946`	`scaling_factor_for_log_link_function = 1 / inverse_scaling_factor;`
`947`	`947`	`y_train *= scaling_factor_for_log_link_function;`
`948`		`- y_validation *= scaling_factor_for_log_link_function;`
`949`	`948`	`}`
`950`	`949`	`else`
`951`	`950`	`scaling_factor_for_log_link_function = 1.0;`
`@@ -1773,13 +1772,19 @@ void APLRRegressor::calculate_and_validate_validation_error(size_t boosting_step`
`1773`	`1772`
`1774`	`1773`	`double APLRRegressor::calculate_validation_error(const VectorXd &predictions)`
`1775`	`1774`	`{`
	`1775`	`+ VectorXd predictions_used{predictions};`
	`1776`	`+ if (link_function == "log")`
	`1777`	`+ {`
	`1778`	`+ predictions_used /= scaling_factor_for_log_link_function;`
	`1779`	`+ }`
	`1780`	`+`
`1776`	`1781`	`if (validation_tuning_metric == "default")`
`1777`	`1782`	`{`
`1778`	`1783`	`if (loss_function == "custom_function")`
`1779`	`1784`	`{`
`1780`	`1785`	`try`
`1781`	`1786`	`{`
`1782`		`- return calculate_custom_loss_function(y_validation, predictions, sample_weight_validation, group_validation, other_data_validation);`
	`1787`	`+ return calculate_custom_loss_function(y_validation, predictions_used, sample_weight_validation, group_validation, other_data_validation);`
`1783`	`1788`	`}`
`1784`	`1789`	`catch (const std::exception &e)`
`1785`	`1790`	`{`
`@@ -1789,33 +1794,33 @@ double APLRRegressor::calculate_validation_error(const VectorXd &predictions)`
`1789`	`1794`	`}`
`1790`	`1795`	`else if (loss_function == "group_mse_cycle")`
`1791`	`1796`	`{`
`1792`		`- return calculate_group_mse_by_prediction_validation_error(predictions);`
	`1797`	`+ return calculate_group_mse_by_prediction_validation_error(predictions_used);`
`1793`	`1798`	`}`
`1794`	`1799`	`else`
`1795`		`- return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, loss_function, dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);`
	`1800`	`+ return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, loss_function, dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);`
`1796`	`1801`	`}`
`1797`	`1802`	`else if (validation_tuning_metric == "mse")`
`1798`		`- return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, MSE_LOSS_FUNCTION), sample_weight_validation);`
	`1803`	`+ return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, MSE_LOSS_FUNCTION), sample_weight_validation);`
`1799`	`1804`	`else if (validation_tuning_metric == "mae")`
`1800`		`- return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, "mae"), sample_weight_validation);`
	`1805`	`+ return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, "mae"), sample_weight_validation);`
`1801`	`1806`	`else if (validation_tuning_metric == "negative_gini")`
`1802`		`- return -calculate_gini(y_validation, predictions, sample_weight_validation) / calculate_gini(y_validation, y_validation, sample_weight_validation);`
	`1807`	`+ return -calculate_gini(y_validation, predictions_used, sample_weight_validation) / calculate_gini(y_validation, y_validation, sample_weight_validation);`
`1803`	`1808`	`else if (validation_tuning_metric == "group_mse")`
`1804`	`1809`	`{`
`1805`	`1810`	`bool group_is_not_provided{group_validation.rows() == 0};`
`1806`	`1811`	`if (group_is_not_provided)`
`1807`	`1812`	`throw std::runtime_error("When validation_tuning_metric is group_mse then the group argument in fit() must be provided.");`
`1808`		`- return calculate_mean_error(calculate_errors(y_validation, predictions, sample_weight_validation, "group_mse", dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);`
	`1813`	`+ return calculate_mean_error(calculate_errors(y_validation, predictions_used, sample_weight_validation, "group_mse", dispersion_parameter, group_validation, unique_groups_validation, quantile), sample_weight_validation);`
`1809`	`1814`	`}`
`1810`	`1815`	`else if (validation_tuning_metric == "group_mse_by_prediction")`
`1811`	`1816`	`{`
`1812`		`- return calculate_group_mse_by_prediction_validation_error(predictions);`
	`1817`	`+ return calculate_group_mse_by_prediction_validation_error(predictions_used);`
`1813`	`1818`	`}`
`1814`	`1819`	`else if (validation_tuning_metric == "custom_function")`
`1815`	`1820`	`{`
`1816`	`1821`	`try`
`1817`	`1822`	`{`
`1818`		`- return calculate_custom_validation_error_function(y_validation, predictions, sample_weight_validation, group_validation, other_data_validation);`
	`1823`	`+ return calculate_custom_validation_error_function(y_validation, predictions_used, sample_weight_validation, group_validation, other_data_validation);`
`1819`	`1824`	`}`
`1820`	`1825`	`catch (const std::exception &e)`
`1821`	`1826`	`{`
`@@ -1825,7 +1830,7 @@ double APLRRegressor::calculate_validation_error(const VectorXd &predictions)`
`1825`	`1830`	`}`
`1826`	`1831`	`else if (validation_tuning_metric == "neg_top_quantile_mean_response")`
`1827`	`1832`	`{`
`1828`		`- double mean_response{calculate_quantile_mean_response(predictions, true)};`
	`1833`	`+ double mean_response{calculate_quantile_mean_response(predictions_used, true)};`
`1829`	`1834`	`if (std::isinf(mean_response))`
`1830`	`1835`	`{`
`1831`	`1836`	`return mean_response;`
`@@ -1834,7 +1839,7 @@ double APLRRegressor::calculate_validation_error(const VectorXd &predictions)`
`1834`	`1839`	`}`
`1835`	`1840`	`else if (validation_tuning_metric == "bottom_quantile_mean_response")`
`1836`	`1841`	`{`
`1837`		`- return calculate_quantile_mean_response(predictions, false);`
	`1842`	`+ return calculate_quantile_mean_response(predictions_used, false);`
`1838`	`1843`	`}`
`1839`	`1844`	`else`
`1840`	`1845`	`throw std::runtime_error(validation_tuning_metric + " is an invalid validation_tuning_metric.");`
`@@ -2025,7 +2030,6 @@ void APLRRegressor::revert_scaling_if_using_log_link_function()`
`2025`	`2030`	`if (link_function == "log")`
`2026`	`2031`	`{`
`2027`	`2032`	`y_train /= scaling_factor_for_log_link_function;`
`2028`		`- y_validation /= scaling_factor_for_log_link_function;`
`2029`	`2033`	`intercept += std::log(1 / scaling_factor_for_log_link_function);`
`2030`	`2034`	`for (Eigen::Index i = 0; i < intercept_steps.size(); ++i)`
`2031`	`2035`	`{`