Skip to main content

Using administrative register data for adjusting non-response bias in the finnish gambling harms survey

Abstract

Background

Low response rates are an increasing problem in population-based gambling surveys. Selective non-response may cause biased findings. Supporting information from administrative registers, whenever available for non-respondents can be utilized to estimate the effect of non-response to the gambling-related outcomes. The aim of this study is to evaluate the effect of non-response to the prevalences of two gambling measures: gambling participation and problem gambling.

Methods

Population-based Finnish Gambling Harms mixed-mode (online and postal) Survey 2016 was conducted among 18-year-olds or older in three geographical regions in Finland (response rate 36.2%). Weighted prevalences of gambling measures were calculated exploiting the respondents’ data (n = 7,153). The study sample (N = 19,741) was individually linked to socio-demographic data from Statistics Finland to obtain information on both respondents and non-respondents. Multiple imputation was utilized to calculate the adjusted prevalences of gambling measures by register-based variables: sex, age, residential area, family structure, household equivalised disposable income, highest education degree, employment status, and native language. Crude prevalences were compared against weighted and non-response adjusted prevalences.

Results

For gambling participation, there was no difference between the crude (81.9% [95% CI 81.0–82.8%]) and the weighted (83.2% [95% CI 82.3–84.0%]) prevalences (p-value 0.09), or between the crude and the non-response adjusted (82.3% [95% CI 81.6–83.0%]) prevalences (p-value 0.49). However, the non-response adjusted (2.8% [95% CI 2.4–3.3%]) prevalence of problem gambling was higher compared to the crude (1.9% [95% CI 1.6–2.3%]) prevalence (p-value 0.002), while there was no difference between the crude and the weighted (2.2% [95% CI 1.9–2.7%]) prevalences (p-value 0.26).

Conclusions

Non-response had an effect of problem gambling prevalence in a Finnish Gambling Harms Survey 2016. The presence of non-response bias should be checked when analysing population surveys. Using administrative register data enables unique opportunities to increase the reliability of the results and to adjust the estimates for non-response.

Trial registration

Clinical trial number: not applicable.

Peer Review reports

Background

Trends in Finnish gambling participation and problem gambling has been monitored using population-based studies [1], as in many other Western countries [2,3,4,5]. Random sampling methods are widely accepted as the best foundation for representative data. The quality of the measurement, systematic and random population coverage-related errors as well as representativeness of the target population has an impact on interpretation of survey results. However, low response rates are an increasing problem with population-based gambling surveys and vary depending on the data collection mode/methods and time [6]. The response rates of all types of surveys in general have continued to decrease internationally in the last ten years, and often already 30–40% response rate is considered good [7, 8]. At the same time, combinations of online and postal surveys have become more popular, even though challenges with their response rates are known. Understanding non-response and finding methods to increase representativeness of the data are essential, while these actions also provide information which can be exploited for reaching higher response rates. Therefore, the impact of non-response is well worth assessing [6].

Non-response, if selective, may cause biased findings. Supporting information from administrative registers, whenever available for non-respondents, can be used to define the profile of non-respondents. In gambling surveys, the association of non-response and socio-demographic factors, for instance sex, age, family structure, socio-economic status, residential area, ethnic background, has been studied with the results being somewhat inconsistent.

Non-respondents are typically more often men [6, 9,10,11,12,13,14,15,16]. On the other hand, based on the Finnish Gambling 2015 and 2019 studies, the response rate was higher among men compared with women [1, 17]. Further analysis with the 2015 data indicated that the sex difference disappeared when adjusting for net income [18]. The Finnish Gambling 2023 study had higher response rate for women compared to men [19].

Younger age is often associated with lower response rate [1, 12,13,14,15,16, 19, 20]. Furthermore, if there was no upper age limit for a study, the highest age group (85-year-olds or older) also had lower response rate [6]. In Sweden, however, there has been inconsistency in results. In 2015 study, the youngest (16–19-year-olds) were most active to participate, and 20–39-year-olds had the lowest response rate [9], while in most recent Swedish study conducted in 2021, 20–24-year-olds had the lowest response rate, while 70–84-year-olds had the highest [11].

The association between family structure and non-response is inconsistent. Non-respondents were more likely to live in large families or adult households [16]. Also in the most recent Swedish study, living in family without children was associated with higher response rate compared to living in family with children or being single with or without children [21]. In a Danish population study, the response rate was especially low among unmarried persons [6].

In Great Britain, having a responsible adult or household reference person who has been engaged in routine occupation have been associated with non-response [16]. Low socio-economic status, such as low education, low income, unemployment and/or receiving social welfare, has been linked with lower response rate [6, 18, 21]. However, Kontto et al. noticed that when net income was added to the non-response model with the above-mentioned Finnish data, the link between unemployed status (vs. self-employed) and non-response became non-significant, while the non-response was more prevalent among the lowest quintile of net income [18]. This may cause bias while studying gambling behaviour of socio-economically vulnerable individuals.

The impact of the residential area is not clear. In Sweden, the response rates were similar between those living in urban and non-urban areas [21]. On the other hand, some studies have reported differences between response rates by area [14, 16]. In Denmark and Iceland, those living in capital area participated significantly less often than those living in other areas [6, 13].

The response rate was especially low among persons with a different ethnic background than Danish which might be explained by the fact that the questionnaire was only available in Danish [6]. Also, people born outside Nordic countries had a low response rate [21].

The profile of non-response may cause bias in the prevalence of gambling measures, for instance gambling participation and problem gambling. In recent Nordic gambling studies, the prevalence of gambling participation was higher among men, older age groups, higher socio-economic groups, and individuals living in rural areas [11, 13, 14, 19]. However, the same socio-demographic factors which are associated with problem gambling are also associated with non-response [11, 13, 14, 19, 20, 22,23,24]. This may cause more bias to the prevalence estimates of problem gambling compared to gambling participation.

A method widely used for lowering the bias caused by non-response is weighting [25]. Auxiliary information, for instance demographic characteristics, on the population is exploited to ensure that the results are representative of the population. However, the number of variables used for weighting is often quite limited, leaving out characteristics, for instance socio-demographic factors, which may have an association with non-response.

Information about non-response can be used to estimate the effect of non-response to the study outcomes by obtaining additional auxiliary information through record linkage to administrative registers. This individual-level data on both respondents and non-respondents provides an opportunity to use sophisticated methods for adjusting bias caused by non-response, such as multiple imputation (MI). To our knowledge, these methods have not been used so far with gambling-related outcomes and our study aims to fill this gap in knowledge. Therefore, the aim of this study is to evaluate the effect of non-response to the prevalences of gambling participation and problem gambling in the Finnish Gambling Harms Survey 2016 [26]. The crude prevalences are compered to non-response adjusted prevalences calculated using MI. In comparison, crude prevalences are compared to weighted prevalences.

Methods

Participants and procedure

The Finnish Institute for Health and Welfare was responsible for design and analysis of data of the population-based Finnish Gambling Harms Survey 2016, while Statistics Finland was responsible for the data collection. Originally, the aim of this longitudinal study was to evaluate gambling, gambling-related harm, and opinions on gambling marketing linked with the reform of the Finnish gambling monopoly. Finnish Gambling Harms Survey 2016 was conducted in three selected geographical regions (Uusimaa, Pirkanmaa, and Kymenlaakso) covering for around 42% of the total Finnish population using mixed-mode survey [26, 27]. The sample consisted of six age-region strata with two age groups (18–24-year-olds, and 25-year-olds or older) from each of three regions. The study sample was selected by strata using systematic random sampling from the population frame sorted by domicile code from the national population information system based on the following criteria: (1) 18-year-old or older, (2) the ability to understand Finnish or Swedish, and (3) excluding institutionalized persons, such as prisoners, mental health patients and the infirm. 18–24-year-olds were oversampled due to fact that problem gambling being most prevalent is that age group: 15% of 18–24-year-olds were sampled for the survey while they represent only 10% of the population [26].

The data collection mode included online and postal surveys, which were available in both Finnish and Swedish. The invitation letter for the invitees was sent to their home address retrieved from the population information system. Both the invitation letter and the first reminder included a link to the web questionnaire and a personal participation code. In this letter, information about the upcoming option to answer to the paper questionnaire was provided. The next two reminder letters also included the paper questionnaire and a prepaid return envelope. The data collection was conducted between January and March 2017.

After excluding overcoverage (N = 67), the study sample size was 19,933 persons. Ultimately, 7,186 participated in the study. Of the respondents, 71% (N = 5,084) participated using the online survey while 29% (N = 2,102) completed the postal survey. The study sample was linked to administrative socio-demographic data from Statistics Finland through personal identification code issued to every Finnish citizen to obtain information on both respondents and non-respondents. 192 individuals were excluded from the study sample due to missing household information with the reasons for missingness including living abroad, homelessness, living in uncommon or unusual large household, and the place of residence being unknown. The final analysis data consisted of 7,153 respondents and 12,588 non-respondents (N = 19,741) with a response rate of 36.2% (Fig. 1).

Fig. 1
figure 1

Flowchart of the exclusion of sample subjects from the gross sample to the analysis dataset

Measures

Socio-demographic variables

The administrative socio-demographic data from Statistics Finland for all individuals in the study sample was obtained for following variables (categories in parenthesis): sex (men, women), age (continuous), number of family members (continuous), number of under 18-year-old family members (continuous), number of household members (continuous), household’s disposable income (continuous), highest education degree (primary, secondary, and tertiary), employment status (unemployed, entrepreneur, white-collar, blue-collar, student, retired, and other or missing), residential area (rural, semi-urban, and urban) and native language (Finnish, Swedish, and other).

Household equivalised disposable income (HEDI) was derived by dividing the household’s disposable income by its equivalent size. The household’s equivalent size was calculated as the sum of weights in a household. A weight was assigned to all household members using a following rule: 1.0 to the first adult; 0.5 to the second and each subsequent 18-year-old or older family member, and to all non-family household members regardless of age; 0.3 to each under 18-year-old family member. There was no information available concerning the age of non-family household members. The rule was a modification to the definition by Eurostat Statistics Explained definition [28] where the cut-off age is 14 years and ages of all household members is known. HEDI was categorized into tertiles. Family structure was derived from number of family members and number of under 18-year-old family members resulting following categories: living alone, family with children (under 18-year-olds), and family with only adults (18-year-olds or older). Age was categorized into categories 18–24, 25–34, 35–44, 45–54, 55–64, 65–74, 75–84, and 85 + years, since the response rate decreased among respondents older than 85 years (Supplementary Fig. 1).

Gambling measures

Gambling participation and problem gambling were derived only for the respondents. In the questionnaire the time frame for both gambling participation and problem gambling was past year, i.e. the calendar year 2016. Gambling participation variable has two categories (0 = ‘not a past-year gambler’ and 1 = ‘past-year gambler’). A respondent was classified as a past-year gambler if he/she had gambled at least one game type (out of 18 listed game types) during the previous year, (i.e. year 2016) and non-past-year gambler otherwise. Gambling refers to playing games for example lottery games, slot machines, scratch cards, sports and horse games, betting games which are also available online.

Problem gambling variable has two categories (0 = no problem gambling’ and 1 = ‘problem gambling’). Past-year gambling severity was evaluated using the 14-item Problem and Pathological Gambling Measure (PPGM) [29]. The PPGM includes three categories: Problems (7 questions), Impaired Control (4 questions), and Other Issues (3 questions). PPGM scores were divided into four categories: recreational gambling, at-risk gambling, problem gambling and pathological gambling. In our study, PPGM questions we asked only from those, who had gambled at least once a month in 2016, and those who had gambled less frequently than monthly were defined recreational gamblers. Furthermore, an additional category was created for those respondents who did not gamble at all. A binary problem gambling variable was derived by combining problem gambling and pathological gambling to indicate problem gambling with all the remaining categories indicating non-problem gambling.

Statistical analysis

Socio-demographic factors sex, age, residential area, family structure, HEDI, highest education degree, employment status, and native language were included in the analyses. Crude and weighted prevalences were calculated using complete case analysis, i.e. exploiting only the respondents with non-missing gambling measures.

For weighted prevalences, Statistics Finland provided the survey weights for each respondent after the following process: First, the design weights were calculated using the six age-region strata. Then, design weights were calibrated to match the population distributions of sex, age, education degree, region, and urban–rural classification using SAS macro Calmar 2 for correcting the possible bias caused by non-response in the sample [26, 30].

MI with logistic regression method was utilized to calculate the adjusted prevalences using R-package mice [31, 32]. Ten copies of the analysis data were generated. For each copy, the missing values of gambling participation and problem gambling were imputed using separate imputation models with all socio-demographic factors included in both imputation models before the prevalences were calculated from the imputed data. Finally, the results of ten analysis were combined with the Wald confidence intervals (CIs) of prevalences calculated using R-package miceafter [33]. Differences of crude prevalences against weighted and adjusted prevalences were tested using t-test. The data were analysed using R software version 4.2.2 [34].

Results

Response rates in all subgroups of socio-demographic factors are presented in Fig. 2. Response rates varied from 16.8 to 51.4%. Response rates were higher compared to the overall response rate among women, 55–84-year-olds, those in families with only adults, those in second and third HEDI tertile, those with tertiary education, white-collars, and retired. Response rates were lower compared to the overall response rate among men, 18–54-year-olds, 85-year-olds or older, those living alone or with children, those in the first HEDI tertile, those with primary or secondary education, unemployed, entrepreneurs, blue-collars, those with employment status missing or other, and those with native language other than Finnish or Swedish.

Fig. 2
figure 2

Response rates by socio-demographic factors. Note: dashed line = total response rate of the study sample was 36.2%

Crude, weighted, and adjusted prevalences of gambling participation are presented in Table 1. There were differences in crude prevalences between subgroups in the following socio-demographic factors: sex, age, highest education degree, employment status, and native language. There was no difference between the crude overall prevalence of gambling participation (81.9% [95% CI 81.0–82.8%]) against either the weighted overall prevalence (83.2% [95% CI 82.3–84.0%]) (p-value 0.09), or the adjusted overall prevalence (82.3% [95% CI 81.6–83.0%]) (p-value 0.49). Weighted and adjusted subgroup-specific prevalences were similar compared to the crude prevalences.

Table 1 Crude, weighted, and adjusted prevalences for gambling participation

Crude, weighted, and adjusted prevalences of problem gambling are presented in Table 2. The crude overall prevalence of problem gambling was 1.9% (95% CI 1.6–2.3%). There was a difference between the crude and the adjusted overall prevalences (2.8% [95% CI 2.5–3.2%]) (p-value 0.002) while there was no difference between the crude and the weighted overall prevalences (2.2% [95% CI 1.9–2.7%]) (p-value 0.26). The weighted subgroup-specific prevalences were similar compared to the crude prevalences, while the adjusted prevalences for men, urban area, and Finnish as a native language were higher than the corresponding crude prevalences.

Table 2 Crude, weighted, and adjusted prevalences for problem gambling

Discussion

Our results indicate that there was no difference between crude and adjusted prevalences of gambling participation. However, the adjusted prevalence of problem gambling was higher (2.8%) compared to the crude prevalence (1.9%). These results are line with the fact that same socio-demographic factors which are associated with problem gambling are also associated with non-response, while the same does not apply for gambling participation [11, 13, 14, 19, 20, 22,23,24]. The inclusion of non-respondents of these underrepresented subgroups through MI may increase the prevalence of overall problem gambling. For gambling participation, the prevalence has been found to be higher among men, older age groups, higher socio-economic groups, and individuals living in rural areas [11, 13, 14, 19], which are subgroups with relatively high response rates. Therefore, MI of gambling participation has no effect on the prevalence estimate. There was no difference between crude and weighted prevalences. Survey weights were assigned to each respondent using factors sex, age, education degree, region, and urban–rural classification. Usually, survey weights are defined using factors associated with non-response to improve representability of the data. However, weights should be calculated using good predictors of the outcome instead of the non-response [35]. With more sophisticated methods such as MI, the adjustment for non-response can be executed case-by-case, depending on outcome. The development of both weighting and adjusting methods is needed to increase the reliability of results.

Our analyses are based on a combination of online and postal population-based surveys, and the study yielded the response rate of 36.2%. Response rates varied largely, from 16.8 to 51.4%, between different subgroups, confirming that certain sub-population are clearly more difficult to recruit than others. Such sub-groups included men, 18–54-year-olds, 85-year-olds or older, those living alone or with children, those in the first HEDI tertile, those with primary or secondary education, unemployed, entrepreneurs, blue-collars, those with employment status missing or other, and those with native language other than Finnish or Swedish. Correspondingly, response rates were higher than the overall response rate among women, 55–84-year-olds, those in families with only adults, those in second and third HEDI tertile, those with tertiary education, white-collars, and retired. These results are in line with previous research where non-response rate is usually higher among men, the young ones, the single ones and in lower socio-economic groups [6, 18, 21]. The response rate of our study was higher than the international average of 29.0% for online and postal surveys [5]. Moreover, our response rate is comparable to the response rates in recent Nordic population-based studies: 26.0% in Norway, 28.5% in Sweden, 36.9% in Finland, 37.6% in Denmark, and 42.0% in telephone interview in Iceland [11, 13, 14, 19, 20]. These studies had rather large variation in both methodology and response rates, even in data set collected in rather similar time points and cultural contexts.

These results reinforce the growing need for targeting socio-demographic subgroups with specific data collection measures. For young men, which are one of the most challenging subgroups, possibility to answer to short version first, possibility to select interesting themes from questionnaires, and increased comprehensibility and plain language have been identified as factors as preliminary ideas for increasing response rate [36]. Generally, response rates could be increased by introducing mixed-mode surveys, where potential respondents are contacted with survey mode maximizing their probability to respond. As a simple example, older age groups are more likely to respond to paper questionnaire than younger age groups [37]. Non-response analysis can be used for profiling the individuals in the study sample and designing the data collection based on each profile. In this study, the response rate was relatively high until 85-year-old respondents. Response rates for older age groups may remain high in the future, since the population is aging, and older age groups have better functioning and are more active than their predecessors [38]. Therefore, the population-based surveys should be conducted without upper age limit for study sample to prevent the exclusion of increasingly large part of the population and thus jeopardizing the representativeness of a study sample.

To our knowledge, this study is the first to analyse both non-response of a population study and the effect of non-response with gambling-related outcomes. Similar types of sophisticated analyses have been performed with several other health surveys focusing on smoking [39], alcohol and other drug use [40], and primary health care utilization [25]. In this paper, MI was utilized to calculate the adjusted prevalences of gambling participation and problem gambling by register-based variables: sex, age, residential area, family structure, HEDI, highest education degree, employment status, and native language. Obtaining additional information from administrative registers enables more thorough analysis of the structure of non-response and exploiting MI provides prevalence estimates when controlling for possible non-response bias. Overall, previous gambling-related non-response analyses are mainly based on reporting the subgroup-specific response rates [6, 13, 14, 21].

Despite the development of statistical methods reducing non-response bias, maintaining as high as possible response rates is essential. If the response rates continue to decrease, at some point statistical methods are unable correct the bias, although the decrease of response rate does not necessarily mean the increase in non-response bias [41, 42]. Personalized contact with potential respondents, using a more interesting topic, shorter and well-formatted questionnaires, and incentives are factors associated with higher response rate [43, 44]. Also, concerns about privacy in online surveys [45] must be addressed in a digitalizing society.

Individual-level data from administrative registers offers an excellent opportunity to exploit more sophisticated statistical methods in the analysis of non-response. Countries with register data should be encouraged to invest extra effort to further exploit register data. The incorporation of health information to non-response analysis would probably bring added value to non-response analysis, since there is association between health status and non-response [46,47,48,49].

Strengths and limitations

Our analyses are based on a combination of online and postal population-based survey with a response rate of 36.2%. In the basic report [26] and in other publications [50,51,52,53,54], the response rate was reported as 36.1%. This small difference in response rates is explained by the exclusion of 192 individuals, who had missing register data. In addition, those answering the paper questionnaires tended to be older than those choosing the online survey [26]. Furthermore, gambling participation, particularly online gambling, as well as at-risk gambling and problem gambling were more common among those who participated using the online survey compared with those using the postal version [26]. The results of the latest Finnish gambling population study comparing different survey modes indicates that respondents and their gambling behaviour differs between computer-assisted telephone interview (CATI), and online and postal survey [19]. The prevalence of online gambling was higher among online and postal respondents compared to CATI respondents, while the prevalence of only land-based gambling was higher among CATI respondents. Also, the prevalence of low-risk gambling was higher in online and postal survey than in CATI.

Furthermore, it is known that socially desirable responding cause response bias affecting the accuracy of self-assessments. This notion is important while studying sensitive topics, such as problem gambling. In the context of addictions research, controlling the effect of socially desirable responding on self-reported measures should be considered as a method to help reduce error and improve validity [55]. Our analyses are based on data collected in 2017 and there has been changes in Finnish gambling environment since due to policy changes as well as transnational gambling trends [56]. For instance, the prevalence of gambling non-monopoly online games has increased [19]. Thus, our conclusions might be outdated, and the analysis need to be repeated with newer data to address this.

Conclusions

Non-response had an effect of problem gambling prevalence in the Finnish Gambling Harms Survey 2016. The presence of non-response bias should be checked when analysing population surveys. Using administrative register data enables unique opportunities to increase the reliability of the results and to adjust the estimates for non-response.

Data availability

The Finnish Gambling Harms dataset is available from the Finnish Social Science Data Archive [57]. The register-based data was available in FIONA, the protected environment of Statistics Finland [58]. The latter required a separate contract with THL and Statistics. The results were transferred from FIONA to the authors through a screening process.

Abbreviations

CATI:

Computer-assisted telephone interview

CI:

Confidence interval

Fohm:

Folkhälsomyndigheten

HEDI:

Household equivalised disposable income

MI:

Multiple imputation

OECD:

Organisation for Economic Co-operation and Development

PPGM:

Problem and Pathological Gambling Measure

SCB:

Statistics Sweden

THL:

Finnish Institute for Health and Welfare

References

  1. Salonen A, Lind K, Hagfors H, Castrén S, Kontto J, Rahapelaaminen. peliongelmat ja rahapelaamiseen liittyvät asenteet ja mielipiteet vuosina 2007–2019. Suomalaisten rahapelaaminen 2019. [Gambling, problem gambling and attitudes and opinions towards gambling in 2007–2019. Finnish Gambling 2019]: Report 18/2020. Helsinki, Finland: National Institute for Health and Welfare (THL); 2020. https://urn.fi/URN:ISBN:978-952-343-594-0. Accessed 1 Feb 2025.

  2. Calado F, Griffiths MD. Problem gambling worldwide: an update and systematic review of empirical research (2000–2015). J Behav Addict. 2016. https://doiorg.publicaciones.saludcastillayleon.es/10.1556/2006.5.2016.073.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Gabellini E, Lucchini F, Gattoni ME. Prevalence of problem gambling: A Meta-analysis of recent empirical research (2016–2022). J Gambl Stud. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10899-022-10180-0.

    Article  PubMed  Google Scholar 

  4. Tran LT, Wardle H, Colledge-Frisby S, Taylor S, Lynch M, Rehm J, et al. The prevalence of gambling and problematic gambling: a systematic review and meta-analysis. Lancet Public Health. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S2468-2667(24)00126-9.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Williams RJ, Volberg RA, Stevens RMG. The Population Prevalence of Problem Gambling: Methodological Influences, Standardized Rates, Jurisdictional Differences, and Worldwide Trends. Ontario, Canada: Ontario Problem Gambling Research Centre and the Ontario Ministry of Health and Long Term Care; 2012. https://hdl.handle.net/10133/3068. Accessed 1 Feb 2025.

  6. Christensen AI, Lau CJ, Kristensen PL, Johnsen SB, Wingstrand A, Friis K, et al. The Danish National health survey: study design, response rate and respondent characteristics in 2010, 2013 and 2017. Scand J Public Health. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/1403494820966534.

    Article  PubMed  Google Scholar 

  7. Beullens K, Loosveldt G, Vandenplas C, Stoop I. Response rates in the European social survey: increasing, decreasing, or a matter of fieldwork efforts?? Surv Methods - Insights Field. 2018. https://doiorg.publicaciones.saludcastillayleon.es/10.13094/SMIF-2018-00003.

    Article  Google Scholar 

  8. Eggleston J. Frequent survey requests and declining response rates: evidence from the 2020 census and household surveys. J Surv Stat Methodol. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/jssam/smae022.

    Article  Google Scholar 

  9. Folkhälsomyndigheten (Fohm). Metodbeskrivning för Swelogs befolkningsundersökning om spel och hälsa 2015 [Methods used in the Swelogs national survey on gambling and health 2015]: Article No. 00145–2017. 2017. https://www.folkhalsomyndigheten.se/publikationer-och-material/publikationsarkiv/m/metodbeskrivning-for-swelogs-befolkningsundersokning-om-spel-och-halsa-2015/. Accessed 1 Feb 2025.

  10. Folkhälsomyndigheten (Fohm). Metodbeskrivning för Swelogs befolkningsundersökning om spel och hälsa 2018 [Methods used in the Swelogs national survey on gambling and health 2018]: Article No. 20031. 2020. https://www.folkhalsomyndigheten.se/publikationer-och-material/publikationsarkiv/m/metodbeskrivning-for-swelogs-befolkningsundersokning-om-spel-och-halsa-2018/. Accessed 1 Feb 2025.

  11. Folkhälsomyndigheten (Fohm). Tabellsammanställning för Swelogs prevalensundersökning 2021 [Table compilation for Swelogs prevalence survey 2021]: Article No. 22230. 2023. https://www.folkhalsomyndigheten.se/publikationer-och-material/publikationsarkiv/t/tabellsammanstallning-for-swelogs-prevalens-undersokning-2021/. Accessed 1 Feb 2025.

  12. Fridberg T, Birkelund JF. Pengespil og spilleproblemer i Danmark 2005–2016 [Gambling and gambling problems in Denmark 2005–2016]. Copenhagen, Denmark: SFI– Det Nationale Forskningscenter for Velfærd; 2016. https://www.vive.dk/da/udgivelser/pengespil-og-spilleproblemer-i-danmark-2005-2016-4z6p17xq/. Accessed 1 Feb 2025.

  13. Ólason DÞ. Spilahegðun og spilavandi fullorðinna Íslendinga á árinu 2023 [Gambling behavior and gambling problems among adult Icelandic people in 2023]. Iceland: Dómsmálaráðuneytið; 2024. https://www.stjornarradid.is/library/02-Rit--skyrslur-og-skrar/Sk%C3%BDrsla%20um%20spilaheg%C3%B0un%20og%20spilavanda.pdf. Accessed 1 Feb 2025.

  14. Pallesen S, Mentzoni RA, Syvertsen A, Kristensen JH, Erevik EK, Morken AM. Omfang av penge- og dataspillproblemer i Norge 2022 [Prevalence of gambling- and gaming problems in Norway, 2022]., Bergen. Norway: Universitetet i Bergen; 2023. https://lottstift.no/content/uploads/2023/06/Omfang-av-Penge-og-dataspillproblemer-i-Norge-2022.pdf. Accessed 1 Feb 2025.

  15. Pallesen S, Mentzoni RA, Torsheim T, Erevik E, Molde H, Morken AM. Omfang av penge-og dataspillproblemer i Norge 2019 [Prevalence of gambling- and gaming problems in Norway, 2019]. Bergen. Norway: Universitetet i Bergen; 2020. https://lottstift.no/content/uploads/2021/05/Omfang_av_penge-og_dataspillproblemer_i_Norge_2019-1.pdf. Accessed 1 Feb 2025.

  16. Scholes S, Wardle H, Sproston K, Erens B, Griffiths M, Orford J. Understanding non-response to the British gambling prevalence survey 2007. London, UK: The National Centre for Social Research (NatCen); 2008.

    Google Scholar 

  17. Salonen A, Raisamo S, Suomalaisten rahapelaaminen. 2015. Rahapelaaminen, rahapeliongelmat ja rahapelaamiseen liittyvät asenteet ja mielipiteet 15–74-vuotiailla. [Finnish Gambling 2015. Gambling, gambling problems, and attitudes and opinions on gambling among Finns aged 15–74]: Report 16/2015. Helsinki, Finland: National Institute for Health and Welfare (THL); 2015. https://urn.fi/URN:ISBN:978-952-302-559-2. Accessed 1 Feb 2025.

  18. Kontto J, Tolonen H, Salonen AH. What are we missing? The profile of non-respondents in the Finnish gambling 2015 survey. Scand J Public Health. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/1403494819849283.

    Article  PubMed  Google Scholar 

  19. Finnish Institute for Health and Welfare (THL). Finnish Gambling 2023– Gambling has decreased, gambling problems have become more prevalent and attitudes towards gambling have changed: Statistical report No. 15/2024. 2024. https://urn.fi/URN:NBN:fi-fe2024040414663. Accessed 1 Feb 2025.

  20. Rambøll. Prævalensundersøgelse af pengespil og pengespilsproblemer i Danmark 2021 [Prevalence study of gambling and gambling problems in Denmark, 2021]. Denmark: Spillemyndigheden. 2022. https://www.spillemyndigheden.dk/uploads/2022-05/Pr%C3%A6valensunders%C3%B8gelse%20af%20pengespil%20og%20ludomani%20i%20Danmark%202021.pdf. Accessed 1 Feb 2015.

  21. Statistics Sweden (SCB). Teknisk rapport Spel och hälsa: Teknisk rapport– En beskrivning av genomförande och metoder [Technical report gambling and health– A description of implementation and methods]. 2021.

  22. Hing N, Russell A, Tolchard B, Nower L. Risk factors for gambling problems: an analysis by gender. J Gambl Stud. 2016. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10899-015-9548-8.

    Article  PubMed  Google Scholar 

  23. Raybould JN, Larkin M, Tunney RJ. Is there a health inequality in gambling related harms? A systematic review. BMC Public Health. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12889-021-10337-3.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Sharman S, Butler K, Roberts A. Psychosocial risk factors in disordered gambling: A descriptive systematic overview of vulnerable populations. Addict Behav. 2019. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.addbeh.2019.106071.

    Article  PubMed  Google Scholar 

  25. Jensen HAR, Lau CJ, Davidsen M, Feveile HB, Christensen AI, Ekholm O. The impact of non-response weighting in health surveys for estimates on primary health care utilization. Eur J Public Health. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/eurpub/ckac032.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Salonen A, Latvala T, Castrén S, Selin J, Hellman M, Rahapelikysely. 2016. Rahapelaaminen, rahapelihaitat ja rahapelien markkinointiin liittyvät mielipiteet Uudellamaalla, Pirkanmaalla ja Kymenlaaksossa [Gambling Harms Survey 2016. Gambling, gambling-related harm, and opinions on gambling marketing in Uusimaa, Pirkanmaa and Kymenlaakso]: Report 9/2017. Helsinki, Finland: National Institute for Health and Welfare (THL); 2017. https://urn.fi/URN:ISBN:978-952-302-976-7. Accessed 1 Feb 2025.

  27. Salonen AH, Hellman M, Latvala T, Castrén S. Gambling participation, gambling habits, gambling-related harm, and opinions on gambling advertising in Finland in 2016. Nord Stud Alcohol Drugs. 2018. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/1455072518765875.

    Article  Google Scholar 

  28. Eurostat, Glossary. Equivalised income. https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Glossary:Equivalised_income. Accessed 1 Feb 2025.

  29. Williams RJ, Volberg RA. Best Practices in the Population Assessment of Problem Gambling. Guelph, Ontario, Canada: Ontario Problem Gambling Research Centre; 2010. https://hdl.handle.net/10133/1259. Accessed 1 Feb 2025.

  30. Sautory O. CALMAR 2: A New Version of the CALMAR Calibration Adjustment Program. Ottawa, Canada: In Proceedings of Statistics Canada International Symposium, Challenges in survey taking for the next decade; 2003. https://www150.statcan.gc.ca/n1/en/pub/11-522-x/2003001/session13/7713-eng.pdf?st=zXh37MkN. Accessed 8 April 2025.

  31. Rubin DB. Multiple imputation for nonresponse in surveys. New York: John Wiley & Sons Inc; 1987.

    Book  Google Scholar 

  32. van Buuren S, Groothuis-Oudshoorn K. Mice: multivariate imputation by chained equations in R. J Stat Softw. 2011. https://doiorg.publicaciones.saludcastillayleon.es/10.18637/jss.v045.i03.

    Article  Google Scholar 

  33. Heymans M. Feb. miceafter: Data and Statistical Analyses after Multiple Imputation. https://mwheymans.github.io/miceafter/. Accessed 1 2025.

  34. R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. 2022. https://www.R-project.org/. Accessed 1 Feb 2025.

  35. Härkänen T, Karvanen J, Tolonen H, Lehtonen R, Djerf K, Juntunen T, et al. Systematic handling of missing data in complex study designs– experiences from the health 2000 and 2011 surveys. J Appl Stat. 2016. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/02664763.2016.1144725.

    Article  Google Scholar 

  36. Kiviniemi M. Miten lisätä nuorten miesten vastausaktiivisuutta terveyttä ja hyvinvointia koskevissa kyselytutkimuksissa [How to Increase the Response Activity of Young Men in Health and Well-Being Surveys]: Master’s thesis. 2023. https://urn.fi/URN:NBN:fi:amk-2023111229184. Accessed 1 Feb 2025.

  37. Kelfve S, Kivi M, Johansson B, Lindwall M. Going web or staying paper? The use of web-surveys among older people. BMC Med Res Methodol. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12874-020-01138-0.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Öhman HR, Karppinen H, Lehti TE, Knuutila MT, Tilvis R, Strandberg T, et al. Secular trends in functional abilities, health and psychological well-being among community-dwelling 75- to 95-year-old cohorts over three decades in Helsinki, Finland. Scand J Public Health. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/14034948211007688.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Kopra J, Härkänen T, Tolonen H, Karvanen J. Correcting for non-ignorable missingness in smoking trends. Stat. 2015. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/sta4.73.

    Article  Google Scholar 

  40. McCabe SE, West BT. Selective nonresponse bias in population-based survey estimates of drug use behaviors in the united States. Soc Psychiatry Psychiatr Epidemiol. 2016. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00127-015-1122-2.

    Article  PubMed  Google Scholar 

  41. Mölenberg FJM, de Vries C, Burdorf A, van Lenthe FJ. A framework for exploring non-response patterns over time in health surveys. BMC Med Res Methodol. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12874-021-01221-0.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Sturgis P, Williams J, Brunton-Smith I, Moore J, Fieldwork, Effort. Response rate, and the distribution of survey outcomes: A multilevel Meta-analysis. Public Opin Q. 2017. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/poq/nfw055.

    Article  Google Scholar 

  43. Edwards PJ, Roberts I, Clarke MJ, DiGuiseppi C, Woolf B, Perkins C. Methods to increase response to postal and electronic questionnaires (Review). Cochrane Database Syst Rev. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/14651858.MR000008.pub5.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Shiyab W, Ferguson C, Rolls K, Halcomb E. Solutions to address low response rates in online surveys. Eur J Cardiovasc Nurs. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/eurjcn/zvad030.

    Article  PubMed  Google Scholar 

  45. Holtom B, Baruch Y, Aguinis H, Ballinger GA. Survey response rates: trends and a validity assessment framework. Hum Relat. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/00187267211070769.

    Article  Google Scholar 

  46. Alkerwi A, Sauvageot N, Couffignal S, Albert A, Lair M-L, Guillaume M. Comparison of participants and non-participants to the ORISCAV-LUX population-based study on cardiovascular risk factors in Luxembourg. BMC Med Res Methodol. 2010. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-2288-10-80.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Guyot M, Pelgrims I, Aerts R, Keune H, Remmen R, De Clercq EM, et al. Non-response bias in the analysis of the association between mental health and the urban environment: a cross-sectional study in Brussels, Belgium. Arch Public Health. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13690-023-01118-y.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Jousilahti P, Salomaa V, Kuulasmaa K, Niemelä M, Vartiainen E. Total and cause specific mortality among participants and non-participants of population based health surveys: a comprehensive follow up of 54 372 Finnish men and women. J Epidemiol Community Health. 2005. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/jech.2004.024349.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Larsen SB, Dalton SO, Schüz J, Christensen J, Overvad K, Tjønneland A, et al. Mortality among participants and non-participants in a prospective cohort study. Eur J Epidemiol. 2012. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10654-012-9739-x.

    Article  PubMed  Google Scholar 

  50. Grönroos T, Kouvonen A, Kontto J, Salonen AH, Socio-Demographic, Factors. Gambling behaviour, and the level of gambling expenditure: A Population-Based study. J Gambl Stud. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10899-021-10075-6.

    Article  PubMed  Google Scholar 

  51. Hagfors H, Castrén S, Salonen AH. How gambling motives are associated with socio-demographics and gambling behavior– A Finnish population study. J Behav Addict. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.1556/2006.2022.00003.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Latvala T, Browne M, Rockloff M, Salonen AH. 18-item version of the short gambling harm screen (SGHS-18): validation of screen for assessing gambling-Related harm among Finnish population. Int J Environ Res Public Health. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/ijerph182111552.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Lind K, Hellman M, Obstbaum Y, Salonen AH. Associations between gambling severity and criminal convictions: implications for the welfare state. Addict Res Theory. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/16066359.2021.1902995.

    Article  Google Scholar 

  54. Roukka T, Salonen AH. The winners and the losers: tax incidence of gambling in Finland. J Gambl Stud. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10899-019-09899-0.

    Article  PubMed  Google Scholar 

  55. Schell C, Godinho A, Cunningham JA. Examining change in Self-Reported gambling measures over time as related to socially desirable responding Bias. J Gambl Stud. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10899-020-09970-1.

    Article  PubMed  Google Scholar 

  56. Grönroos T, Kontto J, Young MM, Hodgins DC, Salonen AH. Trends in Lower-Risk gambling by age and net income among Finnish men and women in 2011, 2015, and 2019. J Gambl Stud. 2025. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10899-024-10355-x.

    Article  PubMed  Google Scholar 

  57. Finnish Social Science Data Archive. https://www.fsd.tuni.fi/en/. Accessed 1 Feb 2025.

  58. Statistics Finland. FIONA remote access system. https://stat.fi/tup/tutkijapalvelut/fiona-etakayttojarjestelma_en.html. Accessed 1 Feb 2025.

Download references

Acknowledgements

Not applicable.

Funding

Open Access funding provided by Finnish Institute for Health and Welfare.

The survey was funded by the Ministry of Social Affairs and Health, Helsinki, Finland, based on Sect. 52 of the Appropriation of the Lotteries Act. Daily work of the authors AHS and JK at the Finnish Institute for Health and Welfare (THL) was also funded by the Ministry.

Author information

Authors and Affiliations

Authors

Contributions

JK, HT and AHS were responsible for the study conception and design. AHS and JK conducted literature searches and provided summaries of previous research studies. JK performed the analysis. JK, HT and AHS were responsible for the interpretation of the data. JK and AHS prepared the manuscript. All authors critically revised the manuscript and approved the final version.

Corresponding author

Correspondence to Jukka Kontto.

Ethics declarations

Ethics approval and consent to participate

The core principles of the research ethics, including giving written information about the study, and ensuring voluntary participation, as defined in the World Medical Association’s Declaration Helsinki 2004, were followed throughout the research process. The research protocol was approved by the Ethics Committee of the Finnish Institute for Health and Welfare (THL) (Statement THL/1390/6.02.01/2016). A separate contract was required between THL and Statistics Finland for the use of register-based data (user license number TK-53-1418-18).

Potential participants received written information about the study and the principles of voluntary participation. They were informed that the study protocol involved a register-linkage and non-response analysis. For non-respondents, their part of the study (non-response analysis) is considered as register based study where, according to the national regulations at that time of the study (Personal Data Act 523/1999), no separate consent from individuals was required. This is in line with the General Data Protection Regulation (GDPR), and its Article 6(1)(e) stating that processing the data is necessary for the performance of a task carried out in the public interest.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kontto, J., Tolonen, H. & Salonen, A.H. Using administrative register data for adjusting non-response bias in the finnish gambling harms survey. BMC Public Health 25, 1807 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12889-025-23016-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12889-025-23016-4

Keywords