CamlCATE

Here we’ll walk through an example of generating synthetic data, running CamlCATE, and visualizing results using the ground truth as reference.

CamlCATE is particularly useful when highly accurate CATE estimation is of primary interest in the presence of exogenous treatment, simple linear confounding, or complex non-linear confounding exists.

CamlCATE enables the use of various CATE models with varying assumptions on functional form of treatment effects & heterogeneity. When a set of CATE models are considered, the final CATE model is automatically selected is based on validation set performance.

Generate Synthetic Data

Here we’ll leverage the SyntheticDataGenerator class to generate a linear synthetic data generating process, with a binary treatment, continuous outcome, and a mix of confounding/mediating continuous covariates.

from caml.logging import configure_logging
import logging

configure_logging(level=logging.DEBUG)

[06/10/25 15:23:02] DEBUG    Logging configured with level: DEBUG                                     logging.py:70

from caml.extensions.synthetic_data import SyntheticDataGenerator

data_generator = SyntheticDataGenerator(
    n_obs=10_000,
    n_cont_outcomes=1,
    n_binary_treatments=1,
    n_cont_confounders=2,
    n_cont_modifiers=2,
    n_confounding_modifiers=1,
    causal_model_functional_form="linear",
    seed=10,
)

                    WARNING  SyntheticDataGenerator is experimental and may change in future         generics.py:44
                             versions.

We can print our simulated data via:

data_generator.df

	W1_continuous	W2_continuous	X1_continuous	X2_continuous	T1_binary	Y1_continuous
0	0.354380	-3.252276	2.715662	-3.578800	1	11.880305
1	0.568499	2.484069	-6.402235	-2.611815	0	-32.292141
2	0.162715	8.842902	1.288770	-3.788545	1	-48.696391
3	0.362944	-0.959538	1.080988	-3.542550	0	-1.899468
4	0.612101	1.417536	4.143630	-4.112453	1	-7.315334
...	...	...	...	...	...	...
9995	0.340436	0.241095	-6.524222	-3.188783	1	-27.578609
9996	0.019523	1.338152	-2.555492	-3.643733	1	-19.692436
9997	0.325401	1.258659	-3.340546	-4.255203	1	-26.087316
9998	0.586715	1.263264	-2.826709	-4.149383	1	-25.876331
9999	0.003002	6.723381	1.260782	-3.660600	1	-38.200522

10000 rows × 6 columns

To inspect our true data generating process, we can call data_generator.dgp. Furthermore, we will have our true CATEs and ATEs at our disposal via data_generator.cates & data_generator.ates, respectively. We’ll use this as our source of truth for performance evaluation of our CATE estimator.

for t, df in data_generator.dgp.items():
    print(f"\nDGP for {t}:")
    print(df)


DGP for T1_binary:
{'formula': '1 + W1_continuous + W2_continuous + X1_continuous', 'params': array([ 0.4609703 ,  0.2566887 , -0.03896251,  0.07238272]), 'noise': array([-0.51949108, -1.88624383,  0.86927397, ...,  0.87157749,
        0.0697439 , -0.72616319]), 'raw_scores': array([0.58800598, 0.13710535, 0.75412862, ..., 0.7549587 , 0.60527474,
       0.39290362]), 'function': <function SyntheticDataGenerator._create_dgp_function.<locals>.f_binary at 0x7f9b287b37f0>}

DGP for Y1_continuous:
{'formula': '1 + W1_continuous + W2_continuous + X1_continuous + X2_continuous + T1_binary + T1_binary*X1_continuous + T1_binary*X2_continuous', 'params': array([ 1.11129512, -4.1263484 , -4.82709212,  1.87319625,  2.60635605,
       -0.91633948,  0.71653213, -0.25067306]), 'noise': array([-1.15370094, -0.26681987,  0.05261899, ..., -0.18887322,
       -0.45736583, -0.57057603]), 'raw_scores': array([ 11.88030508, -32.29214063, -48.69639077, ..., -26.0873159 ,
       -25.87633114, -38.20052217]), 'function': <function SyntheticDataGenerator._create_dgp_function.<locals>.f_cont at 0x7f9b2918d3f0>}

data_generator.cates

	CATE_of_T1_binary_on_Y1_continuous
0	1.926628
1	-4.849035
2	0.956792
3	0.746245
4	3.083586
...	...
9995	-4.791812
9996	-1.834046
9997	-2.243283
9998	-1.901629
9999	0.904665

10000 rows × 1 columns

data_generator.ates

	Treatment	ATE
0	T1_binary_on_Y1_continuous	-0.55937

Running CamlCATE

Class Instantiation

We can instantiate and observe our CamlCATE object via:

Tip

W can be leveraged if we want to use certain covariates only in our nuisance functions to control for confounding and not in the final CATE estimator. This can be useful if a confounder may be required to include, but for compliance reasons, we don’t want our CATE model to leverage this feature (e.g., gender). However, this will restrict our available CATE estimators to orthogonal learners, since metalearners necessarily include all covariates. If you don’t care about W being in the final CATE estimator, pass it as X, as done below.

from caml import CamlCATE

caml_obj = CamlCATE(
    df=data_generator.df,
    Y="Y1_continuous",
    T="T1_binary",
    X=[c for c in data_generator.df.columns if "X" in c]
    + [c for c in data_generator.df.columns if "W" in c],
    discrete_treatment=True,
    discrete_outcome=False,
)

                    WARNING  CamlCATE is experimental and may change in future versions.             generics.py:44

print(caml_obj)

================== CamlCATE Object ==================
Data Backend: pandas
No. of Observations: 10,000
Outcome Variable: Y1_continuous
Discrete Outcome: False
Treatment Variable: T1_binary
Discrete Treatment: True
Features/Confounders for Heterogeneity (X): ['X1_continuous', 'X2_continuous', 'W1_continuous', 'W2_continuous']
Features/Confounders as Controls (W): []
Random Seed: None

Nuisance Function AutoML

We can then obtain our nuisance functions / regression & propensity models via Flaml AutoML:

caml_obj.auto_nuisance_functions(
    flaml_Y_kwargs={
        "time_budget": 30,
        "verbose": 0,
        "estimator_list": ["rf", "extra_tree", "xgb_limitdepth"],
    },
    flaml_T_kwargs={
        "time_budget": 30,
        "verbose": 0,
        "estimator_list": ["rf", "extra_tree", "xgb_limitdepth"],
    },
)

print(caml_obj.model_Y_X_W)
print(caml_obj.model_Y_X_W_T)
print(caml_obj.model_T_X_W)

ExtraTreesRegressor(max_features=0.9924623662362855, max_leaf_nodes=3267,
                    n_estimators=140, n_jobs=-1, random_state=12032022)
ExtraTreesRegressor(max_features=0.9924623662362855, max_leaf_nodes=3267,
                    n_estimators=140, n_jobs=-1, random_state=12032022)
XGBClassifier(base_score=None, booster=None, callbacks=[],
              colsample_bylevel=0.9366334928584987, colsample_bynode=None,
              colsample_bytree=0.7801788111200721, device=None,
              early_stopping_rounds=None, enable_categorical=False,
              eval_metric=None, feature_types=None, gamma=None,
              grow_policy=None, importance_type=None,
              interaction_constraints=None, learning_rate=0.42324459351542365,
              max_bin=None, max_cat_threshold=None, max_cat_to_onehot=None,
              max_delta_step=None, max_depth=1, max_leaves=None,
              min_child_weight=77.88614459419128, missing=nan,
              monotone_constraints=None, multi_strategy=None, n_estimators=14,
              n_jobs=-1, num_parallel_tree=None, random_state=None, ...)

Fit CATE Estimators

Now that we have obtained our first-stage models, we can fit our CATE estimators via:

Note

The selected model defaults to the one with the highest RScore. All fitted models are still accessible via the cate_estimators attribute and if you want to change default estimator, you can run caml_obj._validation_estimator = {different_model}.

🚀Forthcoming: Additional scoring techniques & AutoML for CATE estimators is on our roadmap.

caml_obj.fit_validator(
    cate_estimators=[
        "LinearDML",
        "CausalForestDML",
        "ForestDRLearner",
        "LinearDRLearner",
        "DomainAdaptationLearner",
        "SLearner",
        "TLearner",
        "XLearner",
    ],
    validation_size=0.2,
    test_size=0.2,
    n_jobs=-1,
)

[06/10/25 15:24:54] INFO     Best Estimator: LinearDRLearner                                            cate.py:854

                    INFO     Estimator RScores: {'LinearDML': 0.4330668250950459, 'CausalForestDML':    cate.py:855
                             0.42359894037321255, 'ForestDRLearner': 0.43008695668922525,                          
                             'LinearDRLearner': 0.433440820487627, 'DomainAdaptationLearner':                      
                             0.4220209999654573, 'SLearner': 0.3955966870782457, 'TLearner':                       
                             0.39094152379289226, 'XLearner': 0.41786415918575315}

caml_obj.validation_estimator

<econml.dr._drlearner.LinearDRLearner at 0x7f9b3d1a68c0>

caml_obj.cate_estimators

[('LinearDML', <econml.dml.dml.LinearDML at 0x7f9b29114d90>),
 ('CausalForestDML',
  <econml.dml.causal_forest.CausalForestDML at 0x7f9b29117df0>),
 ('ForestDRLearner', <econml.dr._drlearner.ForestDRLearner at 0x7f9b29157640>),
 ('LinearDRLearner', <econml.dr._drlearner.LinearDRLearner at 0x7f9b3d1a68c0>),
 ('DomainAdaptationLearner',
  <econml.metalearners._metalearners.DomainAdaptationLearner at 0x7f9b291963b0>),
 ('SLearner', <econml.metalearners._metalearners.SLearner at 0x7f9b4fe080d0>),
 ('TLearner', <econml.metalearners._metalearners.TLearner at 0x7f9b29156500>),
 ('XLearner', <econml.metalearners._metalearners.XLearner at 0x7f9b70950220>)]

Validate model on test hold out set

Here we can validate our model on the test hold out set. Currently, this is only available for when continuous outcomes and binary treatments exist.

caml_obj.validate()

[06/10/25 15:24:56] INFO     All validation results suggest that the model has found statistically      cate.py:499
                             significant heterogeneity.

   treatment  blp_est  blp_se  blp_pval  qini_est  qini_se  qini_pval  autoc_est  autoc_se  autoc_pval  cal_r_squared
0          1    1.018   0.023       0.0     0.975    0.029        0.0      2.611     0.089         0.0          0.973

Refit our selected model on the entire dataset

Now that we have selected our top performer and validated results on the test set, we can fit our final model on the entire dataset.

caml_obj.fit_final()

caml_obj.final_estimator

<econml.dr._drlearner.LinearDRLearner at 0x7f9b28651a50>

Validating Results with Ground Truth

First, we will obtain our predictions.

cate_predictions = caml_obj.predict()

Average Treatment Effect (ATE)

We’ll use the summarize() method after obtaining our predictions above, where our the displayed mean represents our Average Treatment Effect (ATE).

caml_obj.summarize()

	cate_predictions_0_1
count	10000.000000
mean	-0.571645
std	3.416533
min	-6.993616
25%	-3.540207
50%	-0.601345
75%	2.377039
max	6.127550

Now comparing this to our ground truth, we see the model performed well the true ATE:

data_generator.ates

	Treatment	ATE
0	T1_binary_on_Y1_continuous	-0.55937

Conditional Average Treatment Effect (CATE)

Now we want to see how the estimator performed in modeling the true CATEs.

First, we can simply compute the Precision in Estimating Heterogeneous Effects (PEHE), which is simply the Root Mean Squared Error (RMSE):

from sklearn.metrics import root_mean_squared_error

true_cates = data_generator.cates.iloc[:, 0]
root_mean_squared_error(true_cates, cate_predictions)

0.14776717903884

Not bad! Now let’s use some visualization techniques:

from caml.extensions.plots import cate_true_vs_estimated_plot

cate_true_vs_estimated_plot(
    true_cates=true_cates, estimated_cates=cate_predictions
)

from caml.extensions.plots import cate_histogram_plot

cate_histogram_plot(true_cates=true_cates, estimated_cates=cate_predictions)

from caml.extensions.plots import cate_line_plot

cate_line_plot(
    true_cates=true_cates, estimated_cates=cate_predictions, window=20
)

Overall, we can see the model performed remarkably well!~

Obtaining Model Objects & Artifacts for Production Systems

In many production settings, we will want to store our model, information on the features used, etc. We provide attributes that to pull key information (more to be added later as class evolves)

Grabbing final model object:

caml_obj.final_estimator

<econml.dr._drlearner.LinearDRLearner at 0x7f9b28651a50>

Grabbing input features:

caml_obj.input_names

{'feature_names': ['X1_continuous',
  'X2_continuous',
  'W1_continuous',
  'W2_continuous'],
 'output_names': 'Y1_continuous',
 'treatment_names': 'T1_binary'}

Grabbing all fitted CATE estimators:

caml_obj.cate_estimators

[('LinearDML', <econml.dml.dml.LinearDML at 0x7f9b29114d90>),
 ('CausalForestDML',
  <econml.dml.causal_forest.CausalForestDML at 0x7f9b29117df0>),
 ('ForestDRLearner', <econml.dr._drlearner.ForestDRLearner at 0x7f9b29157640>),
 ('LinearDRLearner', <econml.dr._drlearner.LinearDRLearner at 0x7f9b3d1a68c0>),
 ('DomainAdaptationLearner',
  <econml.metalearners._metalearners.DomainAdaptationLearner at 0x7f9b291963b0>),
 ('SLearner', <econml.metalearners._metalearners.SLearner at 0x7f9b4fe080d0>),
 ('TLearner', <econml.metalearners._metalearners.TLearner at 0x7f9b29156500>),
 ('XLearner', <econml.metalearners._metalearners.XLearner at 0x7f9b70950220>)]