GitHub - CausalML/adversarial-ope

See debug.py to see example of how everything can be run. Summary of contents:

environments contains the Environments (currently only a single environment, with a simple one-dimensional state space and uniformly distributed transitions, and where one end of state space is always prefered for reward maximization, so adversarial transition function can be computed analytically).
policies contains different policies I am experimenting with
models contains nn.Module code for modelling the Q / beta / w functions (along with the critic functions for minimax methods)
learners contains the learning algorihtms (currently have implemented minimax algorithm for estimating Q/beta)
utils contains some useful generic utilities

libraries needed: torch, numpy, gymnasium

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
environments		environments
learners		learners
models		models
policies		policies
utils		utils
README.md		README.md
build_datasets.py		build_datasets.py
create_results_plots.py		create_results_plots.py
debug.ipynb		debug.ipynb
debug.py		debug.py
experiment_config.json		experiment_config.json
get_true_policy_values.py		get_true_policy_values.py
results_true_policy_value.csv		results_true_policy_value.csv
run_experiments_fixed_lambda.py		run_experiments_fixed_lambda.py

Provide feedback