Interaction constraints (#3126)

* Add interaction constraints functionality. * Minor fixes. * Minor fixes. * Change lambda to function. * Fix gpu bug, remove extra blank lines. * Fix gpu bug. * Fix style issues. * Try to fix segfault on MACOS. * Fix bug. * Fix bug. * Fix bugs. * Change parameter format for R. * Fix R style issues. * Change string formatting code. * Change docs to say R package not supported. * Remove R functionality, moving to separate PR. * Keep track of branch features in tree object. * Only track branch features when feature interactions are enabled. * Fix lint error. * Update docs and simplify tests.

Interaction constraints (#3126)
* Add interaction constraints functionality. * Minor fixes. * Minor fixes. * Change lambda to function. * Fix gpu bug, remove extra blank lines. * Fix gpu bug. * Fix style issues. * Try to fix segfault on MACOS. * Fix bug. * Fix bug. * Fix bugs. * Change parameter format for R. * Fix R style issues. * Change string formatting code. * Change docs to say R package not supported. * Remove R functionality, moving to separate PR. * Keep track of branch features in tree object. * Only track branch features when feature interactions are enabled. * Fix lint error. * Update docs and simplify tests.
bca2da97 · Belinda Trotta · GitHub · f5e51649 · bca2da97
Unverified Commit bca2da97 authored Jun 23, 2020 by Belinda Trotta Committed by GitHub Jun 23, 2020
Show whitespace changes
Inline Side-by-side

Showing with 25 additions and 0 deletions

tests/python_package_test/test_engine.py tests/python_package_test/test_engine.py +25 -0

No files found.
--- a/tests/python_package_test/test_engine.py
+++ b/tests/python_package_test/test_engine.py
@@ -2185,3 +2185,28 @@ class TestEngine(unittest.TestCase):
                    'split_gain', 'threshold', 'decision_type', 'missing_direction',
                    'missing_type', 'weight', 'count'):
            self.assertIsNone(tree_df.loc[0, col])
+    def test_interaction_constraints(self):
+        X, y = load_boston(True)
+        num_features = X.shape[1]
+        train_data = lgb.Dataset(X, label=y)
+        # check that constraint containing all features is equivalent to no constraint
+        params = {'verbose': -1,
+                  'seed': 0}
+        est = lgb.train(params, train_data, num_boost_round=10)
+        pred1 = est.predict(X)
+        est = lgb.train(dict(params, interation_constraints=[list(range(num_features))]), train_data,
+                        num_boost_round=10)
+        pred2 = est.predict(X)
+        np.testing.assert_allclose(pred1, pred2)
+        # check that constraint partitioning the features reduces train accuracy
+        est = lgb.train(dict(params, interaction_constraints=[list(range(num_features // 2)),
+                                                              list(range(num_features // 2, num_features))]),
+                        train_data, num_boost_round=10)
+        pred3 = est.predict(X)
+        self.assertLess(mean_squared_error(y, pred1), mean_squared_error(y, pred3))
+        # check that constraints consisting of single features reduce accuracy further
+        est = lgb.train(dict(params, interaction_constraints=[[i] for i in range(num_features)]), train_data,
+                        num_boost_round=10)
+        pred4 = est.predict(X)
+        self.assertLess(mean_squared_error(y, pred3), mean_squared_error(y, pred4))