Skip to main content

A reconstruction of a medical history from administrative data: with an application to the cost of skin cancer


The medical record is a repository of clinical data, which can greatly enhance the quality of health and healthcare analysis. Administrative data are collected for the purpose of billing and reimbursement, and are valued by health researchers because the data are routinely audited to maintain accurate financial records. However, the quantity of incorporated clinical data can be variable. In this paper we reconstruct the medical record from health service invoices to estimate the cost of treating keratinocytic cancer (KC). The data from an epidemiological survey were linked to an administrative data set supplied by the national health insurer. A matched sampling technique with multivariable analysis was used to estimate cost. A KC treatment was identified with 42 service codes which explicitly nominated treatment of a KC. Algorithms identifying comorbities potentially correlated with KC were constructed from the service codes. The annual cost of a KC treatment was estimated to be AU$667 per individual. The average cost of explicit KC treatments was AU$231, while the cost of generic procedures used to treat KC was AU$436. Our ability to accurately control for the medical history enabled our analysis to quantify and describe the constituent costs of KC treatment.


Empirical analysis in healthcare can be enhanced by controlling for the confounding effects of comorbidities documented within a patient’s medical record. A patient’s medical history can be obtained by direct interview or interrogation of their medical record. However, self-reported medical histories can be subject to a reporting bias, while review of the medical record may be unfeasible or costly. There is a growing appreciation of the benefits of using administrative data to conduct health research [1-3]. Many health insurers, both public and private, generate large datasets for the purpose of either reimbursing physicians or invoicing patients. Although not designed for research, these administrative data, which include item codes identifying discrete episodes of care, are a potentially rich source of clinical information.

In this paper, our aim is to reconstruct a patient’s medical history from the service codes contained within an administrative dataset, to facilitate the estimation of the cost of treating the non-melanoma skin cancers (and which are more accurately described as keratinocyte cancers (KC)). Keratinocyte cancers, which comprise both basal cell carcinomas (BCC) and squamous cell carcinoma (SCC), are cancers with high incidence but low mortality. Worldwide, KC are the most prevalent cancers affecting white-skinned individuals and their incidence is rising rapidly in many countries [4]. High reported incidence rates of KC in Australia (1,170 per 100,000) [5] and the United States (233 per 100,000) [6] ensure that these cancers remain the most costly and fifth most costly to treat in Australia [7] and the United States [8], respectively. However, due to their low mortality rate, many national cancer registries have incomplete or non-existent reporting of KC [4,9]. Therefore, the analysis of administrative data may be particularly useful for health service research of diseases such as KC, where conventional data sources may otherwise be incomplete.

Our literature review identified nine studies that estimated the cost of treating KC using administrative data [7,8,10-16]. Although each study analysed a different dataset, their methods were broadly similar. Typically, they identified an episode of treatment for KC within their data. They then ascribed a mean cost per KC episode before reporting an aggregate cost, for their jurisdiction of interest. However, there was significant heterogeneity with respect to how KC treatments were defined and costed. For example Souza et al. [17] relied on “expert opinion” to define a KC treatment. Data from the Brazilian National Health Service and medical costs supplied by the Brazilian Medical Association were used to estimate aggregate costs. An Australian study published by Fransen et al. analysed an administrative dataset obtained from the national health insurer, Medicare Australia [7]. These data included a unique item code for each medical service delivered. A treatment for KC was identified if one of 37 item codes, which denoted excision of a BCC or SCC, were identified within the data. The corresponding costs were summed and reported. However, the costs of ancillary services such as histology or pharmaceuticals were not included nor were the costs of attendance fees.

The surveyed literature almost entirely reports average and aggregate costs. Three European studies [10-12] identified a KC using the International Classification of Diseases (ICD) codes and costs of treatment were estimated by using national Diagnostic Related Group (DRG) cost weights. Outpatient costs were included in an ad hoc manner, using sub-samples of outpatient cost data. The three US studies [8,14,18] used data collected for the Medicare Current Beneficiary Survey (MCBS) 1992–95 to derive cost estimates for KC. Data from each Medicare claim was linked to the appropriate specialty [14] and costs summed and reported. While these costing methods were no doubt sound, simply reporting an aggregate cost provides little opportunity for researchers and policy makers to further integrate these estimates. As the focus of these studies tended to be on the procedure rather than the individual questions concerning who is consuming which KC treatments remained largely unanswered.

We could identify one study, which employed a different method. Bentzen et al. [16] estimate the cost of treating KC in Denmark. The Danish National Patient Register tracks all inpatient and outpatient health costs. A KC was identified using ICD codes. All individuals treated for a KC in the period 2004 to 2008 were matched to a set of controls (at a ratio of 1:4) on four criteria (age, sex, civil status and residence). The costs of treating KC were calculated as the average annual excess costs per year for patients after diagnosis relative to the matched control cohort. The principal strength of the paper by Bentzen et al. [16] lay in its capacity to analyse patient records, which linked cost and demographic data. While KC incidence was identified by ICD code, the cost of a treatment was not predetermined. Instead Bentzen et al. [16] estimated the cost of treating KC conditional upon a set of demographic controls. An advantage of this approach is that a description of treatment costs can be developed. The principal limitation was that Bentzen et al. [16] only controlled for age, sex, civil status and residence. Human disease can be correlated for a variety of genetic, environmental and social reasons. If available, controls for medical history may have been beneficial.

In this paper, we derive a set of dichotomous variables from treatments documented in an administrative dataset to capture the medical history. Our aims were twofold. Firstly, we control for cost of treating comorbid disease to report an estimate the cost of KC treatment. Secondly, we identified and classified the medical treatments utilised by patients with KC. Controlling for medical history can not only result in more accurate measures of cost but also offer a deeper understanding of component costs.



In 2011, the QSkin study enrolled 43,794 individuals aged 40 to 69 years selected at random from the Queensland electoral roll [19,20]. Overall, 46% of the respondents were male with a mean age of 56 years [19]. The respondents reported their level of sun exposure, skin phenotype, history of skin cancer, demographic and socio-economic characteristics [19]. Ethical approval for the study was received from the QIMR Berghofer Institute of Medical Research Human Research Ethics Committee and the Department of Health. Consent was obtained to link survey data supplied by the respondent to individual level cost data obtained from two publically funded health programs administered by Medicare Australia, the Pharmaceutical Benefits Scheme (PBS) and the Medical Benefits Scheme (MBS). The PBS subsidises the cost of approved pharmaceuticals. The MBS subsidises (i) fee-for-service medical care provided by GPs and specialist physicians delivered in their private consulting rooms and (ii) medical care provided to private patients treated in private and public hospitals. However, the MBS excludes medical services provided to public inpatients. Other inpatient costs (private and public) not included in this cost analysis are non-medical services (nursing, allied health and ancillary services), hospital consumables, administrative overheads and capital depreciation.

Thus, the proportion of the total KC treatment costs captured by this sub-set of healthcare costs identifiable by a MBS item number is uncertain. A national survey of individuals treated for KC (n = 2502) in 2002 reported that 51.1% respondents were treated by general practitioners, 17.6% by dermatologists, 10.3% in skin cancer clinics, 5.9% by plastic surgeons, 3.4% other surgeons and 1% other and 9.2% not stated [21]. However, only 1.6% of respondents indicated that their last KC treatment was conducted in hospital [21]. While these categories are (i) not necessarily mutually exclusive (e.g., plastic surgery can be conducted in a hospital), and (ii) report treatments not costs, it is likely that medical treatments denoted by an MBS item number comprise a very high proportion of total KC treatment costs.

A sub-sample comprised of 2,000 randomly selected individuals with KC matched 1:1 on gender and 5-year age categories, to a group of controls. KC was identified by 42 MBS codesa, which unequivocally indicated a treatment for a BCC or SCC (see Table 1). Data cleaning identified inconsistencies in 0.4% of the cases and 11.95% of the controls, which were subsequently removed. The final sample of 3,753 individuals was comprised of 1,992 cases with KC and 1,761 controls. After matching the case and control cohorts contained an equal proportion of males and females. Individuals with KC were slightly older (57.2 years versus 55.7 years) more likely to be white (95.8% versus 92.3%), born in Australia (85.2% versus 79.6%), less likely to be employed full-time (39.8% versus 45.5%) and not have private health insurance (73.6% versus 67.6%).

Table 1 Observed category 1 costs

Theoretical model

Conceptually an individual with KC could incur three categories of medical costs related to the treatment of KC, two categories of direct treatment costs and one category for related costs. Category 1 costs refer to those procedures, which explicitly identify the treatment of a KC (i.e. the 42 MBS codes detailed above). Category 2 costs refer to non-specific medical treatments, which could also apply to the overall clinical management of KC, for example histopathology, or treatment with antibiotics. Thus, the total cost of treating KC is given by the sum of Category 1 and 2 costs. Category 3 costs refer to the treatment of comorbidities correlated with incidence of KC (e.g. melanoma).

The existence of correlated diseases that generate Category 3 costs could occur because of physiological, environmental, or psychosocial processes. KC is known to be correlated with other cancers [13,22]. Environmental factors such as ultraviolet (UV) radiation are positively correlated with melanoma [23,24] and KC [25]. Negative correlations may also exist, since UV radiation is responsible for the synthesis of vitamin D. Nowson et al. [26] have stated that Vitamin D deficiency is correlated with several diseases including heart disease [27], breast and colon cancer [28], autoimmune diseases such as multiple sclerosis [29], osteoporosis [30,31] and depression [32]. Other implicated diseases include Parkinson’s disease [33], tuberculosis [34] and infectious diseases [31]. Psychosocial factors could affect an individual’s capacity to implement disease prevention measures. Individuals who ignore public health campaigns to mitigate KC might also disregard other disease prevention initiatives.

Empirical model

To control for the potentially confounding effects of the cost of treating correlated comorbidities the following empirical model was estimated.

$$ C=f\left(KC,\mathbf{R}\mathbf{x},\mathbf{H}\mathbf{x},G,\ A\right) $$


$$ \begin{array}{l}\hfill \\ {}\hfill \\ {}C = \left({\mathrm{MBS}}_{\mathrm{Subsidy}} + {\mathrm{MBS}}_{\mathrm{Co}-\mathrm{payment}}\right) + \left({\mathrm{PBS}}_{\mathrm{Subsidy}} + {\mathrm{PBS}}_{\mathrm{Co}-\mathrm{payment}}\right)\hfill \\ {}\begin{array}{l}KC = 1\ \mathrm{if}\ \mathrm{one}\ \mathrm{of}\ 42\ \mathrm{Category}\ 1\ \mathrm{treatments}\\ {}\mathbf{R}\mathbf{x}=16\ \mathrm{concurrent}\ \mathrm{treatments}\\ {}\mathbf{H}\mathbf{x}=16\;\mathrm{past}\ \mathrm{treatments}\end{array}\hfill \\ {}G=\mathrm{Gender}\hfill \\ {}A=\mathrm{Age}\hfill \end{array} $$

The dependent variable Cost, measured in 2012 Australian dollars, was the sum of all MBS and PBS government subsidies and patient co-payments for services utilised from July 2011 to June 2012. Our explanatory variable of interest KC was a dichotomous variable equal to one if the participant received one of 42 identified treatments listed in Table 1.

The principal requirement of our empirical model was that it controlled for the cost of concurrent treatments. The vector Rx, which contained 16 dichotomous variables indicating the treatment of; three autoimmune diseases (asthma, rheumatoid arthritis and multiple sclerosis), two mental illnesses (depression and anxiety), cardiovascular disease and two associated risk factors (hypertension and hyperlipidaemia), four cancers (melanoma, breast, prostate and colorectal), osteoporosis, Parkinson’s disease, tuberculosis and bronchitis, was constructed from item codes supplied by Medicare Australia.

The Merck Manual [35] was reviewed to formulate a complete list of all diagnostic, medical, surgical and pharmacological treatments used to managed each comorbidity of interest. A search of the MBS [36] and PBS [37] websites was conducted to match each identified treatment to its corresponding item codes. These codes were then used to write the identification algorithms. While not all treatments could uniquely identify a diagnosis, many treatments do indicate a diagnosis. For example, antihypertensive medications are indicative of treatment for hypertension and antidepressants are indicative of treatment for depression. We exercised our clinical judgement to ensure that all included item codes could identify a clinical diagnosis. Nonspecific treatments were removed from the algorithms. For example, morphine, which is sometimes used to treat angina, was not used as an indicator of cardiovascular disease. An itemised list of the 1500 MBS and PBS item codes used to identify the 16 comorbidities are attached in Additional file 1. Comorbidity frequencies are reported in the appendix.

The QSkin survey collected information on co-morbidities as free text. The respondents could report up to two medical conditions that required treatment from a specialist doctor and two cancers (other than skin cancer). The written responses were analysed and used to generate Hx, a vector of 16 dichotomous variables indicating previous treatment for the aforementioned diagnoses. The vectors Rx and Hx are complementary. The former is derived from administrative data and reflects concurrent medical treatment, while the latter is derived from self-reported data and captures prior medical treatment. The costs of treating individuals who report a medical history vis-à-vis those who do not, are likely to be systematically different. Therefore, the vector Hx was included to capture severity of disease.

A residual treatment category, treated for other disease, was created to indicate if the respondent was treated for any other disease. Thus, the reference group for our empirical model were the 51 (1.35%) individuals who incurred no medical or pharmacological costs. Two demographic controls for gender and age were included in the specification of the empirical model. The coefficient on KC reflects the annual cost to society of 12 months of KC treatment.

Regression using ordinary least squares (OLS) with skewed cost data can result in heteroskedastic errors [38] and biased variance estimates, invalidating t-statistics and confidence intervals for regression coefficients [39]. Therefore, a generalised linear model (GLM) with the appropriate distributional family was selected using a modified Park test [40]. The advantage of this approach is that a GLM can accommodate heteroskedasticity through selection of the correct distributional family while generating predictions on the cost scale. This approach also enables one to infer the mean cost directly, without the need to retransform OLS estimates obtained with a logged dependent variable [38].


The average cost of all medical services utilised by the QSkin respondents, adjusted for age and sex, was AU$2,477. Individuals who were treated for KC consumed an average AU$2,971, while those who were not treated for KC consumed AU$1,918. The difference in means was statistically significant (p = 0.01). Conceptually, the AU$1,053 differential could be composed of Category 1, 2 and 3 costs. These costs are distilled as follows.

3.1 Category 1 costs

A Category 1 cost was defined as one any of 42 MBS items codes, which directly identified a KC treatment (see Table 1). Columns 1 and 2 report the MBS item description and code, respectively. Column 3 reports service frequency. Columns 4 and 5 report the mean and total costs. The 1,992 individuals treated for KC utilised AU$459,664 in Category 1 services. The mean cost per treated individual was AU$231, of which 77.7% was due to the MBS subsidy and 22.3% was due to the co-payment.

3.2 Direct costs

Direct costs were estimated using a GLM with a log-link function and Poisson family distribution, as determined by modified Park test [40]. Table 2 reports the GLM coefficients and a set of marginal effects. The direct cost (i.e., Category 1 and 2 services) of 12 months of KC treatment (AU$667 (p-value < 0.01)) is the marginal effect of a dichotomous change in KC from zero to one, with covariates held constant at their means. When gender and age were removed, the estimate increased to AU$676 (p-value < 0.01) indicating our estimate is robust with respect to these two covariates. In other specifications, dichotomous variables for education (Nil, School, High school, Trade, Certificate and University) and employment (Full-time, Part-time, Home duties, Student, Retired and Other) were included to control for socioeconomic status. However, F tests for joint statistical significance were rejected and their inclusion had no material effect on the coefficient for KC.

Table 2 Coefficients and marginal effects for a dichotomous change in keratinocyte cancer (KC)

The marginal effects of KC were also estimated controlling for comorbidity and age. Table 2, Column 4 reports the marginal effect of a dichotomous change in KC with each comorbidity set equal to one, and all other covariates held constant at their mean. Hence, for individuals treated for melanoma (n = 53), the marginal effect of 12 months of KC treatment was AU$988. The marginal effect of KC was estimated for each age, 40 through to 70. At age 40, the marginal effect of 12 months of KC treatment was AU$614. The marginal effect increased linearly, by AU$3.30 per year, to AU$713 at age 70.

Cost summary

The results presented in Table 3, Column 2 summarise the principal findings of this paper. The annual MBS subsidy per KC treatment was AU$677 per individual. As this estimate is a derived value, we cannot directly differentiate between the subsidy and co-payment. If the cost distribution was comparable to Category 1 services, this would imply the MBS subsidy was AU$518 (77.7%) and the co-payment AU$149 co-payment (22.3%). The average cost of Category 1 services was AU$230 (see Table 1 for description costs). The cost of Category 2 services used to treat KC was AU$437 (i.e. AU$667 – AU$230). A further AU$386 (i.e. AU$1,053 – AU$667) was spent on Category 3 services treating diseases correlated with KC.

Table 3 Costs of medical care used utilised by individuals with KC

Category 2 costs

When estimated with OLS the errors were not normally distributed (Shapiro-Wilk test: W = 0.48 (p < 0.01)) and heteroskedastic (χ 2 (1) = 4148.8 (p-value < 0.01)). We estimate our model using a GLM. Category 2 costs account for 66% of the costs attributable to the management of KC. Thus our best estimate of total Category 2 costs related to the treatment of KC is AU$868,512 (i.e. 1,992 * AU$436). Due to their magnitude, we sought to identify those Category 2 costs in the following way. First, the data were transformed into wide format, such that each observation was now a medical service. All Category 1 services were removed. The frequencies of the residual services were cross-tabulated with a dichotomous variable equal to one if the service was delivered to an individual with KC and zero if otherwise. The frequency difference, estimates the number of Category 2 and 3 services utilised.

$$ \mathrm{Freq}{.}_{\mathrm{KC}=1}\hbox{--}\ \mathrm{Freq}{.}_{\mathrm{KC}=0} = \mathrm{Category}\ 2\ \mathrm{services} + \mathrm{Category}\ 3\ \mathrm{services}. $$

Table 4 presents a summary of our findings. Column 1 lists clinical services groups, with their MBS item codes listed in the table notes. Columns 2 and 3 list the treatment frequencies for the cohorts with and without a KC and Column 4 reports the frequency differences. The cost of each service category is given by the product of Columns 4 and 5 and is reported in Column 6. After inspecting the service descriptors, we could identify three groups of medical services, which could plausibly be attributed to the treatment of KC. In our study, the KC cohort consumed an additional AU$191,115 on reconstructive surgeries, AU$167,096 on pathology and AU$453,623 on consultation fees.

Table 4 Potential category 2 costs

The data presented in Figure 1, summarise the analyses presented in the paper. In our sample, the total cost of all additional medical services consumed by people with a KC is AU$1,053 per year. The costs of direct treatments or Category 1 costs account for 22% of total costs. A detailed enumeration of these costs is reported in Table 1. Other generic treatments of KC or Category 2 costs account for 41% of the total. The principal cost components are approximately medical attendance fees 10.1%, other surgical costs 10.2%, anaesthetic fees 2.8% and pathology 8.9%. Other unspecified services contribute a further 9.3% to total costs. The cost of correlated comorbidities accounts for 37% of the total cost.

Figure 1
figure 1

The costs of treating individuals with KC (Total cost AU$1,053).


The medical record is comprised of a complex array of clinical data, which if available, can enhance the quality of analysis in health research. For this work, we utilised an administrative dataset to reconstruct the medical record of a sample of study participants from first principals. Our motivation was to estimate and identify the direct and associated costs of treating KC. Including controls for the medical record enabled the cost of KC treatment to be estimated. We identified 16 comorbidities of interest. Each comorbidity was identified by a list of diagnostic, medical, surgical and pharmacological item codes associated with the treatment of that disease. Multivariate regression was used to generate an estimate of cost, which controlled for the patient’s medical record with 16 dichotomous covariates.

The decision to identify three categories of medical costs was motivated by our desire to provide a more comprehensive description of the medical treatments utilised by patients with KC, than is currently available in the literature. Whereas Fransen et al. utilised a minimalist definition of KC, where a “treatment” was defined by 37 Category 1procedures directly associated with KC excision, our method has included Category 2 procedures in the analysis. Results derived from our GLM suggest that 12 months of KC treatment cost AU$667 per patient. Given a national age-standardised rate (ASR) of 3,271 KC services per 100,000 people [7], the implied cost of MBS services to the nation is AU$228 million per yearb, is considerably higher than AU$93.5 million reported by Fransen et al. [7].

Category 2 costs were found to account for 65% (AU$436) of the total KC treatment (AU$667). The robustness of this result was tested by constructing an ad hoc tabulation of medical services we could attribute to the treatment of KC. In addition to the Category 1 services, we could identify three distinct sub-groups of Category 2 costs, which were utilised by the KC cohort. Firstly, AU$191,000 was spent on reconstructive surgeries. Localised disfigurement is often a consequence of KC. Secondly, an additional AU$167,000 was spent on pathology. Histological examination of excised tissue plays a pivotal role in differentiating benign and malignant KC and therefore we believe that a significant proportion of these costs were due to KC. Thirdly the KC cohort consumed an additional AU$453,623 in medical fees.

While not all medical fees can be attributed to the treatment of KC, a pro rata adjustment implied by our GLM model, suggests that at least 63% [i.e. (667/1053)*100] of these medical fees were incurred managing KC. The surgical management of KC vis-à-visc correlated comorbidities is likely to be labour intensive as surgical management in the ambulatory setting can require separate appointments to diagnose biopsy, treat and provide follow-up care, unlike medical conditions, which can be diagnosed and treated within a single appointment. Inspection of the data suggests that in excess of AU$644,000 (74%) of the Category 2 costs implied by the GLM estimate can readily be identifiedc.

A strength of this study was our utilisation of data from a matched population-based sample of individuals linked to an administrative cost dataset supplied by the Department of Health, Australian Government. Our decision to control for medical history was justified because the distribution of comorbid disease was not randomly distributed amongst cases and controls. Treatment rates for hypertension (18.5% vs. 15.9%), colorectal cancer (8.5% vs. 6.1%) and prostate cancer (6.9% vs. 5.1%) were higher for individuals treated for KC. As a result, a large proportion of the comorbities were statistically significant. For individuals who received treatment for one of the 16 comorbidities the marginal effect of a KC was greater than the mean (AU$667). Patients with other cancer diagnoses (e.g. melanoma, breast, colorectal and prostate) had KC costs about AU$300 (45%) higher than those without. Furthermore, each additional year of life increased the marginal cost of KC treatment by AU$3.30 per year.

A recently published paper by Ong et al. [42] analysed the ICD-10 codes included in 8 million medical records from the United Kingdom. In patients aged 45–69 years, they report the relative risks of being treated for melanoma (9.4), breast cancer (1.25), prostate cancer (1.21), colon cancer (1.30) and rectal cancer (2.59) given a diagnosis of KC, which closely correspond to our estimates for melanoma (14.73), breast cancer (1.24), prostate cancer (1.37) and colorectal cancer (1.38)d. This suggests that the analysis of medical invoices can offer an accurate insight into the patients’ medical history.

Our study has a number of limitations. While our study collected administrative data that is likely to be comprehensive in most medical services for KC, other omitted costs to the health system will underestimate our figures. We did not have access to linked data for non-Medicare services, the inpatient costs for a hospital admission or other community services. Nor did our survey data include out-of-pocket expenses or indirect costs related to interruptions to employment associated with treatments. Furthermore, the sample includes individuals from one state of Australia with very high rates of skin cancer and preventive behaviour may differ from elsewhere. In Queensland, the per patient costs for treating KC may be minimised through the predominance of care delivered in office-based GP clinics and dedicated GP-run skin cancer clinics. The principal methodological limitation of our approach remains the exclusion of other relevant comorbidities from our statistical models. If relevant, the non-inclusion of these variables is likely to result in an over estimation of the Category 2 costs associated with treating KC and an underestimation of the Category 3 costs of comorbidities correlated with KC.


The methodology used in this paper has a much broader application than estimating the costs of KC. Being able to control for confounding effects within an individual’s medical history can enhance empirical analysis in healthcare. In this paper, medial service codes were utilised to reconstruct a medical record from first principles. These data have greatly enriched the breadth of our analysis, thus enabling further examination and reporting of a much richer description of the determinants of cost.


aOur list of 42 MBS codes contained all of the 37 codes identified by Fransen et al. [7].

b-Age Standardised Rate (ASR) of KC services per 100,000 in 2011 = 3,271 [6].

- Population = 22,319,006 [6].

- MBS subsidy per 12 month KSC treatment = AU$518 (i.e. 667 * 0.777) (QSkin).

- KSC treatments per year = 1.66 (QSkin).

- ASR/100,000 * Population* MBS subsidy/ KSC treatments per year = AU$228 m.

cAU$191,115 + AU$167,096 + (0.63 * AU$453,623) = AU$643,933.

dA complete set of estimated relative risks for each comorbidity are available from the corresponding author upon request.


  1. Ayanian J. Using administrative data to assess health care outcomes. Eur Heart J. 1999;20(23):1689–91.

    Article  CAS  PubMed  Google Scholar 

  2. Quan H, Sundararajan V, Halfon P, Fong A, Burnand B, Luthi J-C, et al. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. 2005;43(11):1130–9.

    Article  PubMed  Google Scholar 

  3. Riley GF. Administrative and claims records as sources of health care cost data. Med Care. 2009;47(7_Supplement_1):S51–S5.

    Article  PubMed  Google Scholar 

  4. Lomas A, Leonardi‐Bee J, Bath‐Hextall F. A systematic review of worldwide incidence of nonmelanoma skin cancer. Br J Dermatol. 2012;166(5):1069–80.

    Article  CAS  PubMed  Google Scholar 

  5. Staples MP, Elwood M, Burton RC, Williams JL, Marks R, Giles GG. Non-melanoma skin cancer in Australia: the 2002 national survey and trends since 1985. Med J Aust. 2006;184(1):6–10.

    PubMed  Google Scholar 

  6. Miller DL, Weinstock MA. Nonmelanoma skin cancer in the United States: incidence. J Am Acad Dermatol. 1994;30(5):774–8.

    Article  CAS  PubMed  Google Scholar 

  7. Fransen M, Karahalios A, Sharma N, English DR, Giles GG, Sinclair RD. Non-melanoma skin cancer in Australia. Med J Aust. 2012;197(10):565–8.

    Article  PubMed  Google Scholar 

  8. Housman TS, Feldman SR, Williford PM, Fleischer Jr AB, Goldman ND, Acostamadiedo JM, et al. Skin cancer is among the most costly of all cancers to treat for the Medicare population. J Am Acad Dermatol. 2003;48(3):425–9.

    Article  PubMed  Google Scholar 

  9. Eisemann N, Waldmann A, Geller AC, Weinstock MA, Volkmer B, Greinert R, et al. Non-Melanoma skin cancer incidence and impact of skin cancer screening on incidence. Journal of Investigative Dermatology. 2013.

  10. Nilsson GH, Carlsson L, Dal H, Ullen H. Skin diseases caused by ultraviolet radiation: the cost of illness. Int J Technol Assess Health Care. 2003;19(4):724–30.

    Article  PubMed  Google Scholar 

  11. Tinghog G, Carlsson P, Synnerstad I, Rosdahl I. Societal cost of skin cancer in Sweden in 2005. Acta Derm Venereol. 2008;88(5):467–73.

    Article  PubMed  Google Scholar 

  12. Stang A, Stausberg J, Boedeker W, Kerek-Bodden H, Jockel KH. Nationwide hospitalization costs of skin melanoma and non-melanoma skin cancer in Germany. J Eur Acad Dermatol Venereol. 2008;22(1):65–72.

    CAS  PubMed  Google Scholar 

  13. Chen J, Ruczinski I, Jorgensen TJ, Yenokyan G, Yao Y, Alani R, et al. Nonmelanoma skin cancer and risk for subsequent malignancy. J Natl Cancer Inst. 2008;100(17):1215–22.

    Article  PubMed Central  PubMed  Google Scholar 

  14. Rogers HW, Coldiron BM. Analysis of skin cancer treatment and costs in the United States medicare population, 1996–2008. Dermatol Surg. 2013;39(1pt1):35–42.

    Article  CAS  PubMed  Google Scholar 

  15. Souza RJ, Mattedi AP, Correa MP, Rezende ML, Ferreira AC. An estimate of the cost of treating non-melanoma skin cancer in the state of Sao Paulo, Brazil. An Bras Dermatol. 2011;86(4):657–62.

    Article  PubMed  Google Scholar 

  16. Bentzen J, Kjellberg J, Thorgaard C, Engholm G, Phillip A, Storm HH. Costs of illness for melanoma and nonmelanoma skin cancer in Denmark. Eur J Cancer Prev. 2013;22(6):569–76.

    Article  PubMed  Google Scholar 

  17. Souza RJ, Mattedi AP, Rezende ML, Correa Mde P, Duarte EM. An estimate of the cost of treating melanoma disease in the state of Sao Paulo - Brazil. An Bras Dermatol. 2009;84(3):237–43.

    Article  PubMed  Google Scholar 

  18. Chen JG, Fleischer AB, Smith ED, Kancler C, Goldman ND, Williford PM, et al. Cost of nonmelanoma skin cancer treatment in the United States. Dermatol Surg. 2001;27(12):1035–8.

    CAS  PubMed  Google Scholar 

  19. Olsen CM, Green AC, Neale RE, Webb PM, Cicero RA, Jackman LM, et al. Cohort profile: the QSkin sun and health study. Int J Epidemiol. 2012;41(4):929-i.

    Article  Google Scholar 

  20. Morze CJ, Olsen CM, Perry SL, Jackman LM, Ranieri BA, O’Brien SM, et al. Good test–retest reproducibility for an instrument to capture self-reported melanoma risk factors. J Clin Epidemiol. 2012;65(12):1329–36.

    Article  PubMed  Google Scholar 

  21. National Cancer Control Initiative. The 2002 National Non-Melanoma Skin Cancer Survey. A report by the NCCI Non-Melanoma Skin Cancer Working Group. Edited by MP Staples. Melbourne: NCCI; 2003.

    Google Scholar 

  22. Wheless L, Black J, Alberg AJ. Nonmelanoma skin cancer and the risk of second primary cancers: a systematic review. Cancer Epidemiol Biomarkers Prev. 2010;19(7):1686–95.

    Article  PubMed  Google Scholar 

  23. Jhappan C, Noonan FP, Merlino G. Ultraviolet radiation and cutaneous malignant melanoma. Oncogene. 2003;22(20):3099–112.

    Article  CAS  PubMed  Google Scholar 

  24. Australian Institute of Health and Welfare (AIHW). Health Expenditure Australia 2010–11. AIHW cat. no. HWE 56. Canberra: Australian Institute of Health and Welfare; 2011. Report No.

    Google Scholar 

  25. Cristofolini M, Bianchi R, Boi S, Decarli A, Hanau C, Micciolo R, et al. Analysis of the cost-effectiveness ratio of the health campaign for the early diagnosis of cutaneous melanoma in Trentino, Italy. Cancer. 1993;71(2):370–4.

    Article  CAS  PubMed  Google Scholar 

  26. Nowson CA, McGrath JJ, Ebeling PR, Haikerwal A, Daly RM, Sanders KM, et al. Vitamin D and health in adults in Australia and New Zealand: a position statement. Med J Aust. 2012;196(11):686–7.

    Article  PubMed  Google Scholar 

  27. Scragg R, Holdaway I, Jackson R, Lim T. Plasma 25-hydroxyvitamin D and its relation to physical activity and other heart disease risk factors in the general population. Ann Epidemiol. 1992;2(5):697–703.

    Article  CAS  PubMed  Google Scholar 

  28. Gandini S, Boniol M, Haukka J, Byrnes G, Cox B, Sneyd MJ, et al. Meta‐analysis of observational studies of serum 25‐hydroxyvitamin D levels and colorectal, breast and prostate cancer and colorectal adenoma. Int J Cancer. 2010;128(6):1414–24.

    Article  Google Scholar 

  29. van der Mei PDI. Vitamin D levels in people with multiple sclerosis and community controls in Tasmania, Australia. J Neurol. 2007;254(5):581–90.

    Article  PubMed  Google Scholar 

  30. Avenell A, Gillespie W, Gillespie L, O’Connell D. Vitamin D and vitamin D analogues for preventing fractures associated with involutional and post-menopausal osteoporosis. ACP J Club. 2006;144(1):14.

    Google Scholar 

  31. Holick MF. Vitamin D, deficiency. New Engl J Med. 2007;357(3):266–81.

    Article  CAS  PubMed  Google Scholar 

  32. Milaneschi Y, Shardell M, Corsi AM, Vazzana R, Bandinelli S, Guralnik JM, et al. Serum 25-hydroxyvitamin D and depressive symptoms in older women and men. J Clin Endocrinol Metabol. 2010;95(7):3225–33.

    Article  CAS  Google Scholar 

  33. Knekt P, Kilkkinen A, Rissanen H, Marniemi J, Saaksjarvi K, Heliovaara M. Serum vitamin D and the risk of Parkinson disease. Arch Neurol. 2010;67(7):808.

    Article  PubMed Central  PubMed  Google Scholar 

  34. Nnoaham KE, Clarke A. Low serum vitamin D levels and tuberculosis: a systematic review and meta-analysis. Int J Epidemiol. 2008;37(1):113–9.

    Article  PubMed  Google Scholar 

  35. Porter R, Kaplan J. The Merck Manual Online. 2011;2: (accessed June 2013).

  36. Australian Government Department of Health. MBS Online Medical Benefits Schedule Canberra Australian Government Department of Health. 2014. Available from:

    Google Scholar 

  37. Pharmaceutical Benefits Scheme (PBS). MBS Online Medical Benefits Schedule Canberra Australian Government Department of Health. 2013. Available from:

    Google Scholar 

  38. Jones AM. Models for Health Care: HEDG Working Paper 10/01York: University of York 2010.39. In: Wooldridge JM, editor. Introductory econometrics: a modern approach. USA: Southern-Western College Publishing; 2000.

    Google Scholar 

  39. Manning WG, Mullahy J. Estimating log models: to transform or not to transform? J Health Econ. 2001;20(4):461–94.

    Article  CAS  PubMed  Google Scholar 

  40. Mendenhall W, Reinmuth J. Statistics for Management and Economics North Scituate. Massachusetts: Duxbury Press; 1978.

    Google Scholar 

  41. Ong ELH, Goldacre R, Hoang U, Sinclair R, Goldacre M. Subsequent primary malignancies in patients with nonmelanoma skin cancer in England: a national record-linkage study. Cancer Epidemiol Biomarkers Prev. 2014;23(3):490–8.

    Article  PubMed  Google Scholar 

  42. Ong ELH, Goldacre R, Hoang U, Sinclair R, Goldacre M. Subsequent primary malignancies in patients with nonmelanoma skin cancer in england: a national record-linkage study. Cancer Epidemiol Biomarkers Prev. 2014;23(3):490–8.

    Article  PubMed  Google Scholar 

Download references


Dr Rowell received funding through a postdoctoral fellowship from the National Health and Medical Research Council (NHMRC) Centre for Research Excellence in Sun and Health. Prof Whiteman is supported by a Research Fellowship from the NHMRC. The QSkin Study is supported by a Program Grant from the NHMRC (552429).

Author information

Authors and Affiliations


Corresponding author

Correspondence to David Rowell.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

DR analyzed the data and wrote the manuscript. LGG analyzed the data and revised the manuscript. CMO collected the data and revised the manuscript. DCW collected the data and revised the manuscript. All authors read and approved the final manuscript.

Additional file

Additional file 1:

The Medicare Benefits Scheme item codes used to identify the covariates used to reconstruct the a medical history.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rowell, D., Gordon, L.G., Olsen, C.M. et al. A reconstruction of a medical history from administrative data: with an application to the cost of skin cancer. Health Econ Rev 5, 4 (2015).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:

JEL codes