Do health economic evaluations using observational data provide reliable assessment of treatment effects?

Rovithis, Dimitrios

doi:10.1186/2191-1991-3-21

Review
Open access
Published: 19 September 2013

Do health economic evaluations using observational data provide reliable assessment of treatment effects?

Dimitrios Rovithis¹

Health Economics Review volume 3, Article number: 21 (2013) Cite this article

121k Accesses
12 Citations
3 Altmetric
Metrics details

Abstract

Economic evaluation in modern health care systems is seen as a transparent scientific framework that can be used to advance progress towards improvements in population health at the best possible value. Despite the perceived superiority that trial-based studies have in terms of internal validity, economic evaluations often employ observational data. In this review, the interface between econometrics and economic evaluation is explored, with emphasis placed on highlighting methodological issues relating to the evaluation of cost-effectiveness within a bivariate framework. Studies that satisfied the eligibility criteria exemplified the use of matching, regression analysis, propensity scores, instrumental variables, as well as difference-in-differences approaches. All studies were reviewed and critically appraised using a structured template. The findings suggest that although state-of-the-art econometric methods have the potential to provide evidence on the causal effects of clinical and policy interventions, their application in economic evaluation is subject to a number of limitations. These range from no credible assessment of key assumptions and scarce evidence regarding the relative performance of different methods, to lack of reporting of important study elements, such as a summary outcome measure and its associated sampling uncertainty. Further research is required to better understand the ways in which observational data should be analysed in the context of the economic evaluation framework.

Introduction

Trial-based studies are regarded as the gold standard in evaluative research since the assignment of individuals into treatment is typically random, independent of covariates and potential outcomes, ensuring in this way the highest possible internal validity [1]. Nevertheless, pragmatic reasons require economic evaluations to also rely on observational economic and clinical data [2]. Studies using observational data are prone to selection bias, with the treatment effect potentially being confounded with individual, provider or other characteristics [3]. Selection bias constitutes a major threat to the internal validity of a study and unless its presence can be minimised the estimated treatment effects do not necessarily imply a cause and effect relationship [4].

Over the years, a number of econometric methods that deal with selection bias have been developed, depending on whether the source of bias is observed or not [5]. Such analytical approaches operate in the context of the potential outcomes framework and include matching, regression analysis, propensity scores, instrumental variables, regression discontinuity designs, difference-in-differences approaches and control functions [6]. Their key task is the construction of the counterfactual outcome, when the evaluation problem is the measurement of a treatment effect in the presence of non-random selection into treatment. Non-experimental evaluation methods have the potential to estimate a single average effect, or look into the heterogeneity of individuals’ responses to the intervention of interest, depending on the nature of the research question, the richness and type of the available data, as well as the postulated model for outcome and selection processes [7].

Detailed exposition of these methods and examples of their use in a range of applications has been reviewed elsewhere [8, 9]. The aim of this study is to identify which methods are currently used in economic evaluation studies employing observational data and discuss the scope and scientific quality of the current evidence-base, in order to identify gaps in our knowledge and to consider the future research agenda.

Review

Eligibility criteria and identification strategy

A review of the international English language literature was undertaken. The eligibility criteria for inclusion required studies to be full economic evaluations as defined by Drummond et al. [10] and use observational microdata referring to the same population for both the cost and effectiveness outcome. The review placed particular emphasis on identifying methods than including all applications. As such, only studies that demonstrated some modification in the methodology addressing selection bias were included. In addition, studies should use an econometric method to adjust at least one of the cost, effectiveness and cost-effectiveness outcomes.

The studies reviewed were identified using a four-stage process. First, three generic electronic bibliographic databases, namely MEDLINE, EMBASE, and Econlit were searched through the OvidSP interface in order to generate as many papers of potential methodological interest as possible for the years 1990-2010. Second, additional searches for the same time period were carried out in three specialised databases: NHS EED, HEED and CEA Registry. The search strategy, which was adapted for each database, combined and interacted the terms “cost*”, “effect*”, “benefit*”, “cost-effective*” and “cost-benefit*”, with “matching”, “stratification”, “regression*”, “propensity score*”, “instrumental variable*”, “difference in difference*”, “control function” and “discontinuity”. Third, the database searching was supplemented by communicating with other experts. The expert communication involved sending a brief outline of the review objectives, together with a list of key publications to individuals working in similar research areas. Colleagues were requested to suggest further published, unpublished or work-in-progress research for inclusion in the review. Experts were identified through the literature, known contacts and posting on relevant online discussion lists. Fifth, an examination of the references and the citations of all eligible studies was undertaken, with a view to identify further papers that were not already captured during the previous stages.

Review process

In methodologically allied sciences such as epidemiology, a checklist of items that should be included in studies reporting observational research has been established [11]. In the absence of a similar methodological inquiry for economic evaluation, all papers here were reviewed using a custom-made structured template, (Additional file 1) the development of which was informed by a conceptual review of non-experimental methods that can be used for the evaluation of treatment effects. The template comprised of three parts aiming to extract relevant factual information from each study and critically appraise important aspects of their methodology. More specifically, the first part recorded general characteristics such as bibliographic information, the type of economic evaluation undertaken, whether a summary cost-effectiveness outcome measure was used, as well as the interventions evaluated. In the second part, the template focused on extracting the method(s) adjusting for selection bias, the estimation techniques employed and whether adjustment was undertaken for costs, effectiveness or cost-effectiveness. Any comparisons with other methods or studies, the types of uncertainty evaluated and the authors’ conclusions with respect to the ability of methods to adjust for selection bias were also extracted. Finally, the third part recorded information relating to the justification of choice of method and the specification(s) used, as well as whether any relevant tests or graphical analyses were carried out. A reviewer’s assessment concerning potential weakness of the study was also included. This was based on the extracted information as these were provided by the authors and placed particular emphasis on assessing the plausibility of the assumptions postulated by the analytical method employed.

Methods such as regression analysis, matching, ‘sharp’ regression discontinuity designs, as well solutions relying on the propensity score require the selection on observables assumption and common support for baseline covariates between the treatment groups [7]. The idea here is that in groups with sufficient overlapping baseline characteristics, treatment assignment of individuals is said to be “as good as random”, with potential outcomes being independent of treatment status [12]. In cases of imbalance, once the analyst conditions on a set of observable confounders, it is assumed that there are no differences in the distributions of unobserved confounders which are correlated with those that are observed [13]. Regression analysis and difference-in-differences quasi-experimental designs can obtain the treatment effect under weaker conditions as long as longitudinal data are used. In such cases, what is typically assumed is that unobserved confounders correlated with treatment assignment and the outcomes are time-invariant [14]. For example, the conventional difference-in-differences approach assumes that the composition of individuals in the treatment and control groups follows a parallel path over time [15]. On the other hand, methods making use of exclusion restrictions (instrumental variables, control functions, ‘fuzzy’ regression discontinuity designs) can explicitly address selection on unobservables, arising from specification errors, endogeneity and unobserved heterogeneity [6]. The principal assumptions under which these methods operate require that the exclusion restriction is strongly correlated with treatment, it only influences outcomes through treatment and it is independent of unobserved confounders [16]. Finally, a further assumption for unbiased treatment effect estimates in parametric implementations of these methods is that the model reflects the true relationship between the outcome of interest and the covariates used for the adjustment [17].

Results

A schematic diagram of the overall identification process is presented in Figure 1. The original search strategy yielded a total of 6,647 unique studies. Additional independent search strategies in the three specialised databases returned 3,640 studies in NHS EED, 2,708 in HEED and 60 in the CEA Registry. Requests for studies from other experts yielded a further 18 studies. All studies underwent a screening process to ensure that they met the eligibility criteria of the review. It should be noted that when it was apparent from the title or abstract that a study failed on any of these criteria, it was discarded. When it was unclear or if any doubt remained, the full paper was examined. Following the review of titles and abstracts, full text copies were obtained for 383 potentially relevant studies. After assessment, 342 were excluded from the review because they did not meet the eligibility criteria. Cross-reference checks of the selected studies yielded another 2 relevant studies that satisfied the eligibility criteria. The final sample of the studies fully reviewed using an equivalent number of structured templates comprised of 43 studies.

Table 1 provides a summary of the results of the review (Additional file 2). As it can be seen, around a third of the economic evaluations included in the review did not report a summary outcome measure, whereas half of the studies failed to report in a precise and transparent manner information relating to sampling uncertainty for cost-effectiveness. A relatively small percentage of studies evaluated multiple interventions but the majority did not consider explicitly the issue of how these should be handled in the analysis and relied on pairwise comparisons of interventions. In terms of analytical methods currently employed to adjust for selection bias, the review identified five broad categories, which were mostly applied in sample sizes of 5000 individuals or less. These include different methods matching on individual covariates, some form of regression analysis using cross-sectional or longitudinal data; propensity score analysis either through matching, or regression modelling using the propensity score as a covariate; difference-in-differences approaches; and instrumental variables analysis. Solutions based on the propensity score dominated the sample of the reviewed studies.

Table 1 Main characteristics of the reviewed studies

Full size table

The majority of studies failed to adequately assess the assumptions postulated by each method. For example, studies relying on matching, regression and propensity score analysis usually justified the selection on observables assumption by providing a simple description for the confounders adjusting. Overlap between treatment groups was explored mostly through standard tests that assess heterogeneity, although in studies where regression analysis was employed, balance between groups was sometimes investigated using models that assessed whether significant interactions were present between treatment and covariates. Studies employing some form of matching assessed covariate balance post-matching by comparing means in the resulting groups. Economic evaluations relying on difference-in-differences approaches typically assessed the parallel path assumption by comparing pre-existing time trends, whereas analyses exploiting the use of instruments to achieve quasi-randomisation mostly assessed the relevance of the instrument but not its validity. Studies using parametric models rarely assessed functional form assumptions through formal statistical tests and no studies employed any graphical analysis for visual inspection. Finally, almost half of the reviewed studies attempted to contrast their findings with those obtained from other studies, while a small percentage directly compared the results that different methods produced.

Discussion

Economic evaluations often employ observational data. Econometric methods can only adjust for selection bias by relying on assumptions that should approximately be met. Although these assumptions are mostly untestable, their credibility in a particular setting can be assessed and analysts engaging in applied economic evaluation should always undertake extensive checking procedures to confirm the robustness of their results. In fact, this process may often require more effort than the estimation of the treatment effect itself.

The review revealed that this is typically not the case. The published economic evaluation literature currently routinely applies econometric methodology without carefully considering whether the key assumptions under which these methods operate hold. For example, reviewed studies making use of methods that assume selection on observables rarely used findings from prior research or expert advice to establish the causal pathway between interventions and outcomes. These studies also did not report the conduct of observational ‘placebo’ tests [18], or sensitivity analyses such as Rosenbaum’s bounds simulating the likely presence and impact of unobserved confounders [19]. Similarly, alternative specifications considering varying sets of covariates or different functional forms in regression models were only reported in the few studies published in statistical and health economics journals, aiming to offer a more methodological treatment of the evaluation. In addition, in studies where individuals between treatment groups are not comparable, parametric methods may extrapolate beyond the support provided by the available data [20]. The literature currently is dominated by studies that do not adequately assess the common support assumption. For instance, although no statistically significant interactions between treatment and covariates may be identified, imbalances may still remain [21]. Graphical analysis such as histograms or smoothed density plots can be very effective in detecting areas of insufficient overlap and can complement statistical tests.

Matching methods can act as a data pre-processing stage before inference [17]. This approach allows the analyst to consider problems of imbalance in individual covariates and assess overlap between treatment groups in a direct and explicit manner, both before and after adjustment [13]. Authors of economic evaluations employing some form of matching often failed to report details regarding the type of matching performed and generally did not consider different methods in their analyses. Alternative matching procedures, as well as different values of key estimator parameters can have important implications with regards to the bias-efficiency trade-off inherent in all these methods and may also result in varying interpretations of the treatment effect [12]. Studies using matching reported balance tests relying on mean differences, as well as standardised differences. The latter will typically be preferred as they are not affected by sample size and therefore can be used for comparisons between treatment groups that contain different numbers of individuals [22]. Nevertheless, solutions relying on the propensity score require post-match balance in the entire distribution of individual covariates. As such, higher moments including variance, skewness, and kurtosis, as well as cross-moments such as the covariance, should ideally be examined [17]. This was something that nearly all relevant studies included in the review failed to report. For continuous covariates, graphical analyses using quantile–quantile plots and side-by-side boxplots, or Kolmogorov–Smirnoff non-parametric tests of the equality of distributions can consider the full covariate distribution and thus be more informative [20, 23].

For studies relying on exclusion restrictions to account for unobservables, it is crucial for the analyst to determine whether these are relevant and valid in a particular setting [7]. In a purely statistical context, random assignment in a trial meets all the required assumptions and as such it is an exclusion restriction by definition. In contrast, in observational studies, particularly in socially behavioural settings, choice of convincing exclusion restrictions will often be less straightforward and will have to be justified on qualitative rather than empirical grounds [24]. For economic decision-making, strong exclusion restrictions will require the analyst to go through the challenging process of crafting plausible natural experiments, which exploit extensive demand and supply side information to construct variables that can induce strong external variation in the treatment assignment of individuals [25]. In their quest for finding an instrument that satisfies the assumed properties laid out above, studies included in this review often employed a well-known tactic in economics, which involves exploiting the use of geographical variables. Nevertheless, the externality of an instrument does not necessarily also assures exogeneity; that is, it does not automatically fulfil the orthogonality condition required for consistent estimation in the instrumental variable context [26]. Indeed, the economic evaluation by Polsky and Basu [27] should act as a reminder that the performance of an instrument will not always be guaranteed.

At this point, it is important to stress that the application of different methods will ultimately depend on the availability of data. Econometric methods are all data driven, being applicable only in situations where relevant microdata can be accessed to support them and as long as the analysis takes advantage of their availability. Administrative data can potentially provide the analyst with the ability to link information from multiple databases creating datasets containing more complete data on individuals over time, additional background and demographic variables, as well as data on participants and non-participants [28]. In addition, routinely collected information is increasingly shifting from data related to processes of care and patient outcomes such as mortality and morbidity, to data related to more complex measures of health status [29]. These considerations can expand the range of non-experimental methods that can be used to measure treatment effects.

A key aspect of the review appraisal was to consider whether a comparison of cost-effectiveness estimates with existing evidence was attempted, or whether studies explored the sensitivity of their results to alternative methods. The motivation for the former rests on the fact that estimates from other relevant studies, when available, can potentially offer a prior indication regarding the direction of the treatment effect. This is particularly true for evidence generated from randomised trials, which in principle can constitute an important benchmark for learning about non-experimental methods [30, 31]. Unfortunately, when comparisons of this kind were attempted in the reviewed studies, these were mostly qualitative in nature, relating to overall conclusions or comparing only certain outcomes such as costs, survival or hospitalisations. Some economic evaluations restricted the scope of their comparisons to those across methods. Such comparisons can also act as sensitivity analysis when the availability of data allows the use of alternative analytical approaches, which rely on different assumptions and have the potential to exhibit variable performance in different settings. For example, in the econometrics literature, choice among estimators that rely on the selection on observables assumption is normally warranted on small sample arguments [6]. Currently, the embryonic nature of such evidence in the studies reviewed does not allow any firm conclusions to be drawn regarding the relative ability of different methods or their combinations to reduce selection bias in the context of cost-effectiveness. However, what is clear is that choice of method may not only influence estimates, but can also fundamentally alter conclusions [26].

In addition, no economic evaluations identified by this review employed any ‘doubly robust’ approaches, which typically involve the use of regression analysis in combination with some form of weighting. For example, Robins and colleagues [32] proposed the use of the inverse propensity score to weigh a regression model, offering in this way additional protection against misspecification. More recently, doubly robust estimation has been extended to instrumental variable analysis [33, 34]. Another strand of this type of research that gets increasing attention in the econometrics literature is the use of regression analysis after matching. This is a ‘bias-correction’ solution that has been shown to correct for remaining finite sample bias, while potentially also making violations of functional form assumptions less consequential [35, 36]. Given the greater potential for misspecification that arises from the consideration of economic and clinical endpoints in the analysis, the development of such approaches for evaluating cost-effectiveness and their comparison with standalone solutions is highly desirable.

In economic evaluation for decision-making, three additional issues merit attention [10]. First, incremental costs and effectiveness should be combined in a summary outcome measure. Second, the analyst must quantify and evaluate the sampling variability in this cost-effectiveness estimate. Third, the analysis must ideally take into consideration all relevant comparators. The review revealed that a number of studies did not combine incremental costs and effectiveness. Summary outcome measures are used by decision-makers to help make policy recommendations on the allocation of resources for competing health care interventions [37]. In the absence of a summary outcome measure, evaluating sampling uncertainty for the purpose of cost-effectiveness will not be possible. In addition, there seems to be a lack of transparency in the reporting of such information. For example, reviewed studies failed to report which bootstrap method was used to construct the reported confidence intervals, or did not provide any justification for the number of replications employed. The use of multiple comparators in an economic evaluation also raises the question of how these should be handled in the econometric analysis. Some studies identified by the review have shown that in regression analysis the use of multinomial choice models can act as an alternative to pairwise comparisons of interventions. Although these approaches have also been exemplified in the econometrics literature for propensity score matching [38] and doubly robust methods [39], no such extensions in the context of cost-effectiveness were identified.

This review is subject to certain caveats, which must be acknowledged. First, the conclusions of this review do not apply to all economic evaluations that use observational data. Decision analytical modelling-based studies, as well as studies employing hypothetical data or summary evidence for costs and effectiveness were considered beyond the scope of this review and were excluded. In addition, methods dealing with issues relevant to missing and censored data were also not included. Second, the review should not be considered an exhaustive investigation of the applied economic evaluation literature employing observational microdata. Nevertheless, a four-stage identification process ensured that as many studies as possible exemplifying modifications of analytical approaches were captured. Finally, it should also be acknowledged that only one reviewer carried out the review of studies. As such, although a structured template was used in an attempt to streamline the review process and render the appraisal of studies more rigorous, the categorisation of the collected information and the interpretation of the findings presented here may be subject to a certain degree of subjectivity.

Conclusions

Estimation of treatment effects in economic evaluation involves considerable challenges when observational data are used. The aim of this structured review was to identify econometric methods that can be used to evaluate the cost-effectiveness of health care interventions and critically appraise their application in economic evaluations employing such data. Available methods adjust for selection bias using an array of mostly untestable assumptions, with a cost-effectiveness evaluation requiring consideration of a broader range of issues compared to other observational studies. Current limitations include inadequate assessment of the credibility of fundamental assumptions; absence of good quality evidence regarding the sensitivity of results to different analytical approaches or variations in crucial estimator parameters; failure to combine incremental costs and effectiveness in a summary outcome measure; no consideration of sampling uncertainty for the purpose of evaluating cost-effectiveness; and unclear handling of multiple interventions. Future research should exemplify robust analyses that explicitly acknowledge these issues and address them in a convincing manner.

References

Collins R, MacMahon S: Reliable assessment of the effects of treatment on mortality and major morbidity, I: clinical trials. Lancet 2001,357(9253):373–380. 10.1016/S0140-6736(00)03651-5
Article CAS PubMed Google Scholar
Drummond M: Experimental versus observational data in the economic evaluation of pharmaceuticals. Med Decis Making 1998,18(2):S12-S18. 10.1177/0272989X9801800203
Article CAS PubMed Google Scholar
Buxton M, Drummond M, Van Hout B, Prince R, Sheldon T, Szucs T, Vray M: Modelling in economic evaluation: An unavoidable fact of life. Health Econ 1997,6(3):217–227. 10.1002/(SICI)1099-1050(199705)6:3<217::AID-HEC267>3.0.CO;2-W
Article CAS PubMed Google Scholar
Jones A: Identification of treatment effects in Health Economics. Health Econ 2007,16(11):1127–1131. 10.1002/hec.1302
Article PubMed Google Scholar
Heckman J: Building Bridges Between Structural and Program Evaluation Approaches to Evaluating Policy. J Econ Lit 2010,48(2):356–398. 10.1257/jel.48.2.356
Article PubMed Central PubMed Google Scholar
Imbens G, Wooldridge J: Recent developments in the econometrics of program evaluation. J Econ Lit 2009,47(1):5–86. 10.1257/jel.47.1.5
Article Google Scholar
Blundell R, Dias MC: Alternative approaches to evaluation in empirical microeconomics. J Hum Resour 2009,44(3):565–640. 10.1353/jhr.2009.0009
Article Google Scholar
Jones A: Panel Data Methods and Applications to Health Economics. In Palgrave Handbook of Econometrics, Volume 2: Applied Econometrics. Edited by: Mills T, Petterson K. Palgrave Macmillan; 2009.
Google Scholar
Jones A, Rice N: Econometric evaluation of health policies. In The Oxford Handbook of Health Economics. Edited by: Glied S, Smith P. Oxford: Oxford University Press; 2011.
Google Scholar
Drummond M, Sculpher M, Torrance G, O'Brien B, Stoddart G: Methods for the economic evaluation of health care programmes. 3rd edition. Oxford: Oxford University Press; 2005.
Google Scholar
Von Elm E, Altman D, Egger M, Pocock S, Gøtzsche P, Vandenbroucke J: The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. Prev Med 2007,45(4):247–251. 10.1016/j.ypmed.2007.08.012
Article PubMed Google Scholar
Imbens G: Nonparametric estimation of average treatment effects under exogeneity: a review. Rev Econ Stat 2004,86(1):4–29. 10.1162/003465304323023651
Article Google Scholar
Stuart E: Matching methods for causal inference: a review and a look forward. Stat Sci 2010,25(1):1. 10.1214/09-STS313
Article PubMed Central PubMed Google Scholar
Wooldridge J: Econometric Analysis of Cross Section and Panel Data. 2nd edition. MIT Press; 2010.
Google Scholar
Abadie A: Semiparametric difference-in-differences estimators. Rev Econ Stud 2005,72(1):1–19. 10.1111/0034-6527.00321
Article Google Scholar
Angrist J, Krueger A Working Paper No. w8456. In Instrumental variables and the search for identification: From supply and demand to natural experiments. National Bureau of Economic Research; 2001.
Chapter Google Scholar
Ho D, Imai K, King G, Stuart E: Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Political Analysis 2007,15(3):199–236.
Article Google Scholar
Sekhon J: Opiates for the matches: matching methods for causal inference. Annu Rev Political Sci 2009, 12: 487–508. 10.1146/annurev.polisci.11.060606.135444
Article Google Scholar
Rosenbaum P: Observational studies. 2nd edition. Springer; 2002.
Book Google Scholar
Sekhon J, Grieve R: A Nonparametric Matching Method for Covariate Adjustment with Application to Economic Evaluation. 2009. [online] Available at: [Accessed 17 March 2010] http://sekhon.berkeley.edu/papers/GeneticMatching_SekhonGrieve.pdf
Google Scholar
Mihaylova B, Pitman R, Tincello D, Van Der Vaart H, Tunn R, Timlin L, Quail D, Johns A, Sculpher M: Cost-effectiveness of duloxetine: the Stress Urinary Incontinence Treatment (SUIT) study. Value Health 2010,13(5):565–572. 10.1111/j.1524-4733.2010.00729.x
Article PubMed Google Scholar
Flury B, Riedwyl H: Standard distance in univariate and multivariate analysis. Am Stat 1986,40(3):249–251.
Google Scholar
Austin P: The relative ability of different propensity score methods to balance measured covariates between treated and untreated subjects in observational studies. Med Decis Making 2009,29(6):661–677. 10.1177/0272989X09341755
Article PubMed Google Scholar
Heckman J: Econometric causality. Int Stat Rev 2008,76(1):1–27. 10.1111/j.1751-5823.2007.00024.x
Article Google Scholar
Angrist J, Imbens G, Rubin D: Identification of causal effects using instrumental variables. J Am Stat Assoc 1996,91(434):444–455. 10.1080/01621459.1996.10476902
Article Google Scholar
Deaton A: Instruments, randomization, and learning about development. J Econ Lit 2010, 48: 424–455. 10.1257/jel.48.2.424
Article Google Scholar
Polsky D, Basu A: Selection bias in observational data. In The Elgar Companion to Health Economics. Edited by: Jones A. Edware Elgar Publishing; 2006.
Google Scholar
Hotz V, Goerge R, Balzekas J, Margolin F Report of the Advisory Panel on Research Uses of Administrative Data of the Northwestern University/University of Chicago Joint Center for Poverty Research. Administrative data for policy-relevant research: Assessment of current utility and recommendations for development 1998.
Google Scholar
Hutchings H, Cheung W, Williams J, Cohen D, Longo M, Russell I: Can electronic routine data act as a surrogate for patient-assessed outcome measures? Int J Technol Assess Health Care 2005,21(1):138–143.
Article PubMed Google Scholar
LaLonde R: Evaluating the econometric evaluations of training programs with experimental data. Am Econ Rev 1986,76(4):604–620.
Google Scholar
Smith J, Todd P: Does matching overcome LaLonde’s critique of nonexperimental estimators? J Econometrics 2005,125(1):305–353.
Article Google Scholar
Robins J, Rotnitzky A, Zhao L: Estimation of regression coefficients when some regressors are not always observed. J Am Stat Assoc 1994,89(427):846–866. 10.1080/01621459.1994.10476818
Article Google Scholar
Uysal S: Doubly Robust IV Estimation of the Local Average Treatment Effects 2011. Available at: [Accessed 11 October 2012] http://www.ihs.ac.at/vienna/resources/Economics/Papers/Uysal_paper.pdf
Okui R, Small D, Tan Z, Robins J: Doubly Robust Instrumental Variable Regression. Statistica Sinica 2012, 22: 173–205.
Article Google Scholar
Abadie A, Imbens G: Bias-corrected matching estimators for average treatment effects. J Bus Econ Stat 2011,29(1):1–11. 10.1198/jbes.2009.07333
Article Google Scholar
Iacus S, King G, Porro G: Multivariate matching methods that are monotonic imbalance bounding. J Am Stat Assoc 2011,106(493):345–361. 10.1198/jasa.2011.tm09599
Article CAS Google Scholar
National Institute for Health and Clinical Excellence: Guide to the Methods of Technology Appraisal. London: NICE; 2008.
Google Scholar
Lechner M: Identification and estimation of causal effects of multiple treatments under the conditional independence assumption. Econometric Eval Labour Market Policies 2001, 13: 43–58. 10.1007/978-3-642-57615-7_3
Article Google Scholar
Uysal S: Doubly Robust Estimation of Causal Effects with Multivalued Treatments 2012. Available at: [Accessed 20 November 2012] http://www.ihs.ac.at/vienna/resources/Economics/Papers/20121115_Paper_Uysal.pdf

Download references

Acknowledgments

I acknowledge partial financial support from the National Institute for Health Research (NIHR) Programme Grants for Applied Research funding scheme (RP-PG-0707-10010). The views expressed in this publication are those of the author and not necessarily those of the NHS, the NIHR or the Department of Health.

Author information

Authors and Affiliations

National Perinatal Epidemiology Unit, University of Oxford, Old Road Campus, Headington, Oxford, OX3 7LF, UK
Dimitrios Rovithis

Authors

Dimitrios Rovithis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dimitrios Rovithis.

Additional information

Competing interests

The author declares that he has no competing interests.

Electronic supplementary material

Additional file 1: Structured template used for the review. (PDF 51 KB)

13561_2013_62_MOESM2_ESM.pdf

Additional file 2: Table S1: General information for the reviewed studies. Table S2. Analytical approaches employed in the reviewed studies. Table S3. Reviewer’s appraisal and comments. Table S4. Key data sources identified from the reviewed studies. (PDF 614 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Rovithis, D. Do health economic evaluations using observational data provide reliable assessment of treatment effects?. Health Econ Rev 3, 21 (2013). https://doi.org/10.1186/2191-1991-3-21

Download citation

Received: 03 June 2013
Accepted: 30 August 2013
Published: 19 September 2013
DOI: https://doi.org/10.1186/2191-1991-3-21

Do health economic evaluations using observational data provide reliable assessment of treatment effects?

Abstract

Introduction

Review

Eligibility criteria and identification strategy

Review process

Results

Discussion

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Electronic supplementary material

Additional file 1: Structured template used for the review. (PDF 51 KB)

13561_2013_62_MOESM2_ESM.pdf

Authors’ original submitted files for images

Authors’ original file for figure 1

Rights and permissions

About this article

Cite this article

Keywords

Health Economics Review

Contact us

Do health economic evaluations using observational data provide reliable assessment of treatment effects?

Abstract

Introduction

Review

Eligibility criteria and identification strategy

Review process

Results

Discussion

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Electronic supplementary material

Additional file 1: Structured template used for the review. (PDF 51 KB)

13561_2013_62_MOESM2_ESM.pdf

Authors’ original submitted files for images

Authors’ original file for figure 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Health Economics Review

Contact us