Skip to content

Information for students, faculty and staff regarding COVID-19. (Updated: 18 January 2021)


Basic information about research on dietary data

A validated, self-reported semi-quantitative food frequency questionnaire (FFQ) is used in DietVIP and DietMON to collect information on habitual diet in the region of Northern Sweden. DietVIP was initiated 1985, and is still ongoing. DietMON contributes with data from the recurrent screenings performed 1986, 1990, 1994, 1999, 2004, 2009 and 2014. Starting 1992, the FFQ is optically readable, and the majority of the questionnaires collected between 1985-1992 have been entered manually.

Access to dietary data from NSDD

The design of the FFQ

The semi-quantitative FFQ contains three main sections covering:

- General portion size: estimated for three different food groups (potatoes/rice/pasta, meat/fish, and vegetables) using four photographs illustrations as shown in the figure.

- Frequencies of food intake: reported on a 9-point scale ranging from never, few times a year, 1-3 times/month, 1 time/week, 2-3 times/week, 4-6 times/week, 1 time/day, 2-3 times/day and >4 times/day.

- Diet supplements: the research participant can indicate supplementation with multivitamins, multiminerals, iron and selenium during the last 14 days or year.

Some general information is also collected related to meal habits and adherence to specific diet regimes.

Processing the FFQ

In addition to the information collected in DietVIP and DietMON on the frequency of intake of individual foods and food groups, estimates of energy- and nutrition intake have been derived (see "variable lists"). Energy- and nutrition intake are estimated by multiplying the daily frequency of intake (times/day) by the energy- and nutrition content reported by the National Food Administration (Bergström et al., 1991) using sex- and age-standardized portion sizes (Johansson et al., 2002).

Validation studies for NSDD

Different versions of the FFQ

Up until the latter half of the 1990s, the original version of the FFQ covering intake of 84 different foods was used in both the VIP and MONICA project. Thereafter a couple of reduced versions (questions about 64-66 different foods) have been used in VIP, all of which are optically readable. In the MONICA project the original long version has been used with the exception of the screening done in 1990, when a version containing 49 questions was used. Economic considerations have caused the reductions in the number of questions included in the FFQ. However, in all the FFQ versions the sections of food and food groups covered by the questions and the 9-point scale used to indicate intake frequency are identical.

Creating a uniform diet database

As a first step, the different versions of the FFQs were carefully compared and versions not sufficiently alike were excluded. The criteria for excluding a version of the FFQ were: (i) unmanageable deviations in intake frequency options, (ii) information on portion size is not included, and (iii) non-harmonizable food combinations in the questions. Based on these criteria, the following versions of the FFQ are included in the NSDD:

  • The original, longer FFQ (84 foods): the optically readable versions BASN, BASG and BAS6, the MONICA questionnaire, and the non-optically read apricot version
  • The shorter FFQ (64-66 foods): the optically readable versions AC4, AC5, AC6, AC00, AC03, AC05, and AC11

Requesting diet data from DietVIP and DietMON

From DietVIP, data can be requested that only includes those research participants that answered the longer version of the FFQ (with 84 foods), only include data from participants who answered the shorter versions of the FFQ (with 64-66 foods), or a combined dataset including all research participants where appropriate adjustments have been made to harmonize the obtained diet data across the different versions of the FFQ. It is thus very important to be aware of the assumptions made when the shorter and longer versions of the FFQs were combined. Please read carefully the section 'Important to consider before starting to work with NSDD data' below.

From DietMON, data can be requested individually from each of the seven MONICA screenings conducted (1986, 1990, 1994, 1999, 2004, 2009, and 2014), or as a combined dataset. Note that the FFQ used during the MONICA screening performed 1990 only contained 49 questions, and nutritional data has not been processed.

Dietary data can also be requested for nested case-control studies within the NSDD. These usually include diet data originating from both DietVIP (with longer and shorter versions of the FFQ) and DietMON, which is important to keep in mind in the process of matching cases and controls.

Note! If additional data on research participants is requested beyond that presented in the NSDD variable list, a request can be made to Åsa Ågren at the Biobank Research Unit, regarding participants from VIP, and the respective project lead regarding participants from the MONICA project (contact information can be provided by Åsa Ågren).

Important to consider before starting to work with NSDD data

No individuals have been excluded based on incomplete answers in the data material released to the researcher. However, variables have been constructed to indicate incomplete answers that can be used to exclude participants due unacceptable FFQ quality and/or biologically unreasonable values.

Some corrections of missing data have been performed. If an answer is missing or unreadable in sections with similar foods an answer (frequency of intake/day) is imputed by taking into consideration the other answers in the same section. For example, if in the section about fat intake, high frequency of intake has been indicated for "the 80% spreadable fat Bregott" but information about the alternative "40% spreadable fat" (lower-fat spreadable margarine), butter or regular margarine was missing or unreadable, then these answers will be set to "never", under the assumption that a person has a preference for a certain type of fat for spreading on sandwiches. No other imputations or estimations have been made in regard to the reported original frequencies of intake.

In the initial processing of the FFQ data, all the reported intakes are translated to the same scale (frequency of intake/day). A missing answer at this stage will be replaced by the median intake in the respective sex and 10-year age group as reported in 1999-2000[m1] . If the recoded frequencies are to be used, the researcher need to consider to what extent observations need to be excluded based on the current hypothesis to be investigated (see below).

The "exclude" variable, which was created to facilitate exclusion of potentially unreliable FFQs, is coded 0, 1, and 2. Zero indicates that both exclusion criteria (see below) were met, i.e. the information is correct and the individuals shall remain in the dataset; 1 indicates those individuals with >10% deficient information in the FFQ and; 2 indicates missing information related to portion size estimates and intake of energy and nutrients can thus not be calculated. Participants with both >10% deficient information in the FFQ and missing information on portion size, i.e. fulfilling the criteria for both code 1 and code 2, are coded as 1. The "exclude" variable can be used by the researcher to appropriately exclude subjects based on the research question.

In addition, the researcher need to consider whether participants reporting biologically improbable daily intake of energy or nutrients should be excluded. To help inform the decision, the variable "FIL" (Food Intake Level=estimated energy intake/basal metabolism) is created. It is up to the researcher to decide which cut-off is appropriate to apply based on the current research question. A recommendation is to exclude individuals with an "FIL" value under the lowest 5 percentile and above the highest 2.5 percentile based on the distribution in the entire population (see table in "statistics").

A number of individuals in the NSDD have participated in VIP and the MONICA project at more than one occasion. This is indicated by the variable "besok" , which is coded 1= first visit, 2 = second visit and 3= third visit. Note what is the individual-specific id and the visit-specific id in the variable list!

Selected references

Bergström L, Kylberg E, Hagman U, Eriksson H, Bruce Å. The food composition data base system (KOST-systemet)—its use for nutrient values. Vår Föda 1991;43:439-47.

Eriksson M, Stegmayr B, Lundberg V. MONICA quality assessments. Scand J Publ Hlth 2003;31(Suppl 61):25-30.

Hallmans G, Ågren Å, Johansson G, Johansson A, Stegmayr B, Jansson J-H, Lindahl B, Rolandsson O, Söderberg S, Nilsson M, Johansson I, Weinehall L. Cardiovascular disease and diabetes in the Northern Sweden Health and Disease Study Cohort—evaluation of risk factors and their interactions. Scand J Publ Hlth 2003;31(Suppl 61):18-24.

Johansson G, Wikman Å, Åhrén A-M, Hallmans G, Johansson I. Underreporting of energy intake in repeated 24-hour recalls related to gender, age, weight status, day of interview, educational level, reported food intake, smoking status and area of living. Publ Hlth Nutr 2001;4(4):919-927.

Johansson I, Hallmans G, Wikman Å, Biessy C, Riboli E, Kaaks R. Validation and calibration of food-frequency questionnaire measurements in the Northern Sweden Health and Disease cohort. Publ Hlth Nutr 2002;5(3):487-496.

Johansson I, Hallmans G, Ericsson S, Hagman U, Bruce Å, Wikman Å, Kaaks R, Riboli R. Evaluation of the accuracy of a dietary questionnaire aimed for the Västerbotten study. Scand J Nutr 38:50-55, 1994.

Lindahl B, Stegmayr B, Johansson I, Weinehall L, Hallmans G. Trends in lifestyle 1986-99 in a 25- to –64-year-old population of the Northern Sweden MONICA project. Scand J Publ Hlth 2003;31(Suppl 61):31-37.

Stegmayr B, Lundberg V, Asplund K. The events registration and survey procedures in the Northern Sweden MONICA Project. Scand J Publ Hlth 2003;31(Suppl 61):9-17.

Weinehall L. The emerging epidemic of cardiovascular disease. History and background of the Northern Sweden initiative on cardiovascular disease. Scand J Publ Hlth 2003;31(Suppl 61):5-8.

Weinehall L, Hallgren C-G, Westman G, Janlert U, Wall S. Reduction of selection bias in primary prevention of cardiovascular disease through involvement of primary health care. Scand J Prim Hlth Care 1998;16:171-176.

Wennberg M, Vessby B, Johansson I. Evaluation of relative intake of fatty acids according to the Northern Sweden FFQ with fatty acid levels in erythrocyte membranes as biomarkers. Public Health Nutr. 2009,15:1-8.

Winkvist, A, Hörnell, A, Hallmans G, Lindahl B, Weinehall L, Johansson I. More distinct food intake patterns among women than men in northern Sweden: a population-based survey. Nutr J 2009 Nutr J. 2009, ;8:12-17.