The problem of learning treatment assignment policies from randomized or observational data arises in many fields.For example, in personalized medicine, we seek to map patient observables (like age, gender, heart pressure, etc.) to a treatment choice using a data-driven rule.There has recently been a considerable amount of work on statistical methodology for policy learning, including