Skip to main content

Clusters of carbohydrate-rich foods and associations with type 2 diabetes incidence: a prospective cohort study

Abstract

Background

About one in ten adults are living with diabetes worldwide. Intake of carbohydrates and carbohydrate-rich foods are often identified as modifiable risk factors for incident type 2 diabetes. However, strong correlation between food variables can make it difficult to identify true associations. The purpose of this study was to identify clusters of carbohydrate-rich foods and analyse their associations with type 2 diabetes incidence in the Malmö Diet and Cancer Study cohort in southern Sweden.

Methods

Dietary intake of 26 622 participants was assessed using a validated three-part diet history method: a 7-day food diary, a 168-item food frequency questionnaire, and a 60-minute interview. K-means clustering analysis identified five clusters from 21 food variables. The Cox proportional hazard regression model was applied to calculate hazard ratios (HR) and 95% confidence intervals (CI) of the association between clusters and incident type 2 diabetes.

Results

The cluster analysis resulted in five clusters; high vegetables/low added sugar, high sugar-sweetened beverages, high juice, high fruit, and high refined carbohydrates/low fruit & vegetables (reference). During mean follow-up of 18 years, 4046 type 2 diabetes cases were identified. After adjustment for potential confounding (including lifestyle, body mass index, and diet), a high fruit cluster (HR 0.86; 95% CI 0.78, 0.94) was inversely associated with type 2 diabetes compared to the reference cluster. No other significant associations were identified.

Conclusions

A dietary pattern defined by a high intake of fruits was associated with a lower incidence of type 2 diabetes. The findings provide additional evidence of a potential protective effect from fruit intake in reducing type 2 diabetes risk. Future studies are needed to explore this association further.

Peer Review reports

Introduction

Globally, about 537 million adults were living with diabetes in 2021, and the prevalence is expected to reach 783 million by 2045 [1]. Type 2 diabetes is the most common type of diabetes, comprising over 90% of all cases. Diet constitutes a major modifiable risk factor for type 2 diabetes, affecting postprandial glucose levels, insulin resistance, body weight, and more [2]. Multiple studies have identified inverse associations between different dietary patterns and incidence of type 2 diabetes, including for the Mediterranean diet, low glycaemic diets, and various plant-based diets [3,4,5,6].

In the Malmo Diet and Cancer Study (MDCS), previous findings have concluded that a health conscious dietary pattern (high in fiber-rich bread, breakfast cereals, fruits, vegetables, fish, and low-fat yoghurt, and low in low-fiber bread) was associated with lower incidence of type 2 diabetes, while no significant associations were found for a dietary pattern in accordance with the Swedish nutrition recommendations [7, 8]. Furthermore, we have also identified associations between several carbohydrate-rich foods and type 2 diabetes risk, including a reduced risk from intake of fruits and an increased risk from sweets [9]. However, because of the moderate to strong correlation found between many of the dietary variables, the true associations with disease outcomes from individual foods or nutrients are difficult to differentiate. There is thus a need to study associations between patterns of carbohydrate intake and type 2 diabetes risk.

While previous studies have applied a priori approaches to study the carbohydrate quality of diets, such as the glycemic index or carbohydrate ratio indices, this is the first study, to our knowledge, to apply an a posteriori approach of cluster analysis to identify dietary patterns of carbohydrate-rich foods [10]. In contrast to the a priori approach, the a posteriori approach is not limited to the confirmation of known associations of diet and disease, and can thus be applied to explore and identify new dietary patterns and associations [11]. Few studies have applied cluster analysis to identify dietary patterns for the study of the association with type 2 diabetes risk, and an association has not always been identified [12,13,14]. Furthermore, these studies have often been limited to cross-sectional data, highlighting a need for more analyses of dietary patterns in prospective studies [15]. The aim of this study was to identify clusters of carbohydrate-rich food intake and analyse associations with incidence of type 2 diabetes in a large, prospective cohort in southern Sweden.

Subjects and methods

Population

Recruitment and baseline examinations of the Malmö Diet and Cancer Study were carried out between January 1991 and September 1996 in the municipality of Malmö in southern Sweden. A total of 30 446 participants were recruited from a source population of 74 138 individuals, encompassing all males born between 1923 and 1945, and all females born between 1923 and 1950, within the municipality. Recruitment was achieved through advertising and distribution of invitation letters. Individuals with limited Swedish proficiency or mental disability, hindering them from filling out the baseline questionnaire, were excluded from participation. Full description of the recruitment process has been published elsewhere [16]. For this specific study, we excluded participants who did not complete their dietary assessment (n = 2348), with prevalent diabetes at baseline (n = 1230), or with missing data on smoking habits, physical activity or level of education (n = 246). Excluded participants had a similar mean age as the study population, but a higher proportion of males and a higher mean BMI. The total population for this study was 26 622 participants (61% females) (Supplemental Fig. 1). The Malmö Diet and Cancer Study was granted ethical approval by the Regional Ethics Committee (LU 90 − 51). All participants provided their informed written consent.

Dietary assessment

Dietary assessment was conducted during baseline examinations, using a validated, modified diet history method consisting of three parts [17,18,19]. (1) A one-week food diary, recording intakes of prepared meals (commonly lunch and dinner), cold beverages, and supplements. (2) A 168-item food frequency questionnaire (FFQ), assessing portion-sizes and consumption frequencies of foods eaten regularly during the previous year. (3) A 60-minute interview, further assessing portion-sizes, food choices, and methods of preparation. The three-part method was designed to assess all aspects of the participant’s diet while ensuring that no overlapping data were collected. In September 1994, due to slight changes in the coding of dietary data, the interview was shortened from 60 to 45 min. The modification did not have a major effect on the ranking of participants [20].

All the FFQs were checked for missing values, and each participant’s FFQ was compared with the food diary (according to pre-defined instructions) to confirm that there was no overlapping in the registered food consumption. Recipes for the meals recorded in the food diary were sourced from the computer software KOSTSVAR (AIVO AB, Solna, Sweden), and could be altered during the interview if needed.

The modified diet history method has previously shown good ranking validity compared with a reference method of 18-day weighed food records [17, 18]. The energy-adjusted Pearson correlation coefficients for males and females were 0.65 and 0.53 for vegetable intake, 0.60 and 0.77 for fruit, 0.74 and 0.73 for cereals, 0.50 and 0.58 for bread, and 0.69 and 0.51 for potatoes [18].

Dietary data

Data on food intake from the food diary and the FFQ were registered as average intake in grams per day of individual foods. Energy and nutrient intakes were calculated using a food composition database with 1600 food items from the Swedish National Food Agency. Additional recipes and food items were added specifically for this cohort. Extreme and median values of total energy, nutrients and major food groups in the data, as well as extreme portion-sizes, were routinely checked for errors.

The 21 foods included in the cluster analysis were selected based on available data on foods with a high proportion of energy from carbohydrates and encompassed: boiled or baked potato, fried or deep-fried potato, fruits (including berries, canned fruits, and dried fruits), vegetables (including cooked vegetables), juice (from fruits and vegetables), sugar-sweetened beverages (all carbonated and non-carbonated beverages; except juices, dairy products and alcoholic beverages), pastries (all sweetened baked goods), chocolate, sweets (all sugar-sweetened candy, except chocolate), table sugar (all sugar products added to foods and drinks after preparation), ice cream, marmalade/honey/jam (including apple sauce), ketchup, flour, grains/cereals (< 10% fiber), grains/cereals (≥ 10% fiber), soft bread (< 4.5% fiber), soft bread (≥ 4.5% fiber), crisp bread (< 10% fiber), crisp bread (≥ 10% fiber), and rice/pasta. All values were expressed as intake in grams per 1000 kcal of energy intake.

Energy intake was expressed as kcal of total daily intake, while alcohol intake was expressed as percentage of total energy intake, and carbohydrate, protein, fat, monosaccharide, disaccharide, sucrose, and added sugar intakes were expressed as percentage of the non-alcohol energy intake. Added sugar was calculated by adding all intake of sucrose and monosaccharides, excluding sucrose and monosaccharides from fruits and vegetables (including juices) [21]. Intake of fiber (all types of dietary fiber), fish (excluding shellfish), red meat (including processed meat), and coffee were expressed as intake in grams per 1000 kcal of energy intake.

The EAT-Lancet index was recently developed and validated in our cohort and is based on a diet proposed by the EAT-Lancet Commission [22, 23]. The index is regarded as a healthy plant-based diet index with a range between 0 and 42 points. A higher index score indicates a greater adherence to the EAT-Lancet diet.

Covariate data

During baseline examinations, blood pressure, weight, height, waist circumference and percentage of body fat (with a bioelectrical impedance method) were measured, and 45 ml blood samples were collected. Participants were asked to fill out a self-administered questionnaire, providing information on their level of education, occupation, physical activity, smoking habits, alcohol consumption, social network, current health, medical history, current medication, and disease in close relatives.

Physical activity was assessed as the self-reported time spent per week on 17 leisure-time activities. The duration of each activity was multiplied with their respective metabolic equivalent intensity factor. The individuals were subsequently divided into five categories based on their metabolic equivalent of task hours (MET-hours) per week [24]. Alcohol habits was divided by six levels of reported intake: non-consumption and quintiles of consumption. Smoking habits was divided into three categories: current smoker, former smoker and never smoker. Education was divided into six categories corresponding to the highest achieved level of education. Season was divided into four categories, corresponding to the season when baseline assessments were carried out: winter (January–March), spring (April–June), summer (July–September) and fall (October–December). Diet method version was introduced following a minor adjustment of the coding routines of dietary data in September 1994, and was divided into two categories: old and new. Misreporters were participants who potentially misreported their energy intake, defined as having a ratio of reported energy intake to basal metabolic rate outside the 95% confidence interval (CI) of their calculated physical activity level [25]. Diet changers were defined as participants who reported that they had altered their eating habits in the past (due to disease or other reason) [26].

Among participants who entered the study from November 1991 and February 1994, a random 50% sample (n = 12,445) was invited to also participate in the Malmö Diet and Cancer – Cardiovascular Cohort; a sub-cohort of the MDCS [27]. In participants who accepted the invitation (n = 5540), additional blood analyses were carried out, including HbA1c, plasma glucose, and insulin. Homoeostatic model assessment of insulin resistance (HOMA-IR) was determined by using the formula: fasting plasma insulin (mU/L) × blood glucose (mmol/L)/22.5.

Endpoint data

Participants who developed diabetes during the follow-up period (until 31 December 2016) were identified through several different registers, as well as during rescreening of the cohort. A large proportion of cases were reported in more than one register. Both the Regional Diabetes 2000 Registry of Scania (15.2% of cases) and the Swedish National Diabetes Register (57.1% of cases) required a diabetes diagnosis by a physician, in accordance with the established diagnostic criteria (a fasting plasma glucose concentration of ≥ 7.0 mmol/L, or a fasting whole blood concentration of ≥ 6.1 mmol/L, on two different occasions) [28, 29]. Diabetes cases were also identified through the HbA1c Registry at Clinical Chemistry, Malmö (individuals with at least two HbA1c values ≥ 6.0%) (51.6% of cases), as well as through four national registries under the National Board of Health and Welfare: the Cause-of-Death Registry (8.8% of cases), the Swedish Hospital-based Outpatient Care (44.8% of cases), the Swedish National Inpatient Registry (ICD10 codes E10-E14 and O244-O249) (52.7% of cases), and the Swedish Prescribed Drug Registry (ATC code A10) (71.1% of cases) [30,31,32]. Additionally, diabetes cases were identified through the screening of sub-samples of participants in the Malmö Diet and Cancer Study cohort (1992–1994, 1997–2001 and 2007–2012) (2.8%, 7.8% and 11.1% of cases), as well as in the Malmö Preventive Project Cohort (2002–2006) (18.6% of cases). For this study, focusing on type 2 diabetes, we excluded cases who were registered as having developed type 1 diabetes, latent autoimmune diabetes in adults, secondary diabetes or other diabetes condition in the outcome variable. Participants registered with diabetes of unknown type were assumed to be of type 2 and were thus included.

Statistical analysis

Statistical analyses were performed using IBM SPSS Statistics (version 26.0; IBM Corp., Armonk, NY, USA). The main analyses were performed for all participants (n = 26 622). The K-means clustering analysis included 21 energy-adjusted, carbohydrate-rich food variables. The analysis was repeated with two to eight clusters to determine the optimal number of clusters. Baseline characteristics of clusters were analyzed by applying the general linear model for continuous variables (adjusted for age, sex, diet method version, season, and energy intake, when applicable) and cross tabulation for categorical variables.

The Cox proportional hazard regression model was used to calculate hazard ratios (HR) of the association between type 2 diabetes incidence and clusters of dietary intake. The cluster with the most participants, and least defined by one specific food, was used as the reference in our main analysis. Subsequent analyses applying each of the other clusters as the reference were also conducted. Years of follow-up was used as the time scale. The covariates included in the models were chosen based on known associations with type 2 diabetes from literature and were assessed during baseline examinations. In the first model, adjustments were made for sex, age, diet method version, season, and total energy intake. The second model included further adjustment for physical activity, alcohol habits, smoking habits, and education, and the third model included additional adjustment for body mass index (BMI). The fourth model included additional adjustment for two dietary variables: coffee, and red meat. Intake of red meat and processed meat has been associated with an increased risk of type 2 diabetes, while coffee intake has been associated with a reduced risk [2, 33]. Interaction for sex was examined by adding an interaction term in the fourth model, combining the cluster variable with the sex variable, and analyzing each cluster separately against the reference cluster. Potential mediating effects of total carbohydrate intake, fiber intake, and added sugar intake were considered but did not meet the assumptions of mediation [34]. Sensitivity analyses were conducted where potential misreporters of energy intake, diet changers, and participants who developed diabetes within two years from enrolment were excluded from the fourth model in separate analyses.

Results

Cluster characteristics

The best K number of clusters in our analysis was deemed to be five, as this resulted in clearly distinct clusters with a good distribution of participants in each cluster (n > 1000) (Supplemental Fig. 2). Four of these five clusters were clearly defined by a higher consumption of one or two specific foods, and thus aptly labelled after their defining variables: high vegetables/low added sugar, high sugar-sweetened beverages, high juice and high fruit (Table 1). The fifth and largest cluster was labelled the high refined carbohydrates/low fruit & vegetables cluster and encompassed roughly half of all participants (51.2%).

Table 1 Baseline table by clusters of carbohydrate-rich foods in participants in the Malmö Diet and Cancer cohort (n = 26 622)1,2

The high refined carbohydrates/low fruit & vegetables cluster was characterized by a high intake of pastries and low-fiber soft bread, and a low intake of fruit and vegetables (Table 1). The high vegetables/low added sugar cluster was characterized by a high intake of vegetables, and a low intake of sweet foods, including pastries, sugar-sweetened beverages, chocolate, sweets, and table sugar. The high sugar-sweetened beverages cluster was characterized by a high intake of sugar-sweetened beverages, while also having a low intake of fiber-rich soft bread and fiber-rich crisp bread. The high juice cluster was characterized by a high juice intake and the high fruit cluster by a high fruit intake.

For non-clustered variables, the high refined carbohydrates/low fruit & vegetables cluster had the highest intake of red meat, the highest proportion of males and current smokers, the lowest BMI, and the lowest proportion of potential energy misreporters and diet changers (Table 1). The high vegetables/low added sugar cluster had the highest intake of fiber and fish, the lowest proportion of males and high plasma glucose, the highest EAT-Lancet index score, and the highest proportion of university degrees, potential energy misreporters, and diet changers. The high sugar-sweetened beverages cluster had the lowest intake of fiber and fish, the highest waist circumference, the lowest EAT-Lancet index score, and the highest proportion of high plasma glucose and zero-consumers of alcohol. The high juice cluster had the lowest waist circumference. The high fruit cluster had the highest mean age and the lowest proportion of current smokers.

Associations between clusters and incidence of type 2 diabetes

In total, there were 4046 participants who developed type 2 diabetes during a mean follow-up of 18.4 years (SD 6.4 years) (Table 2). In the fully adjusted model, there was a significantly lower risk of developing type 2 diabetes in the high fruit cluster compared to the high refined carbohydrates/low fruit & vegetables cluster (HR 0.86; 95% CI 0.78, 0.94). No other significant associations were identified. In the high sugar-sweetened beverages cluster, there was a significant positive association with incident type 2 diabetes in the basic model only (HR 1.13; 95% CI 1.01, 1.27). No interaction with sex was identified for any of the clusters (Supplemental Table 1). Hence, associations were similar for males and for females.

Table 2 Hazard ratios (95% CI) of incident type 2 diabetes by clusters of carbohydrate-rich foods in participants in the Malmö Diet and Cancer cohort (n = 26 622)

Sensitivity analyses

Excluding potential energy misreporters (18% of the study sample), diet changers (22%), or participants who developed diabetes within two years (1.3%) did not alter the findings (Supplemental Tables 2, 3 & 4).

Discussion

In our fully-adjusted model, we identified a significant inverse association between a high fruit cluster and incidence of type 2 diabetes in a large cohort in southern Sweden. For the high sugar-sweetened beverages cluster, a significant positive association was identified in our basic model only. No associations could be identified for the high vegetables/low added sugar and high juice clusters.

The current findings provide valuable information in support of an inverse association between fruit intake and type 2 diabetes risk. While previous studies have identified inverse associations by studying fruits independently, or as part of dietary patterns that emphasize a higher fruit intake among other healthy food choices, this is the first study to our knowledge to have identified an inverse association with a dietary pattern that was primarily defined by a high fruit intake [2, 4].

The lower risk for the high fruit cluster confirmed previous findings in our cohort, where we identified a significant inverse association between fruit intake and incidence of type 2 diabetes [9]. We have previously also identified an association between a higher intake of monosaccharides, which was strongly associated with fruit intake, and lower risk of type 2 diabetes. In our current study, while the high fruit cluster did consume a higher energy percentage of monosaccharides compared to the reference cluster, the intake was not significantly higher than for all other clusters. In the previous study we also identified strong correlations between fruit intake and intake of other carbohydrate-rich foods, highlighting the need to conduct this study of associations between clusters of carbohydrate-rich foods and type 2 diabetes risk.

Both fruit and vegetable intake have been independently associated with type 2 diabetes risk in several previous studies. Zheng et al. [35] identified an inverse association with type 2 diabetes risk with higher fruit and vegetable intake already from levels below recommended daily intake, while a meta-analysis by Schwingschackl et al. [2] identified a dose-response inverse association with incident type 2 diabetes for daily fruit intake of up to 200–300 g, and for daily vegetable intake up to 300 g. In a previous study we concluded that a median daily fruit intake of 357 g per day in the highest quintile was associated with lower incidence of type 2 diabetes in the MDCS cohort, as well as, in males only, for a median daily vegetable intake of 301 g per day in the highest quintile [9]. However, no association was found for vegetable intake in females, and in our current study we did not find any associations for the high vegetables/low added sugar cluster with type 2 diabetes incidence.

Fruits and vegetables have low energy density and a high content of micronutrients, fiber and polyphenols, all of which may contribute to their potential protective effect. A meta-analysis by Rienks et al. [36] found an association with type 2 diabetes risk for intake of polyphenols, and for flavonoids in particular. However, more studies are needed to determine the potential role for polyphenols in the prevention of type 2 diabetes.

In comparison to the reference cluster, the high fruit cluster also had a higher intake of vegetables, juice, ice cream, flour, and high fiber soft bread and crisp bread, and a lower intake of potato, pastries, chocolate, sweets, table sugar, marmalade/honey/jam, ketchup, and low fiber soft bread and crisp bread. Thus, these foods may also have contributed to the significant difference between the clusters, and several of them have previously been associated with incident type 2 diabetes in the MDCS cohort [9].

Associations between dietary patterns and incidence type 2 diabetes have been studied previously in our cohort. Ericson et al. [7] applied principal component analysis to identify dietary patterns in the total diet of the MDCS cohort and found a significant inverse association between a health-conscious pattern and type 2 diabetes risk. One of the characteristics of this dietary pattern was a high fruit intake. In other studies, plant-based dietary patterns have been found to be associated with a reduced risk of type 2 diabetes, particularly when intake of healthy foods such as fruit and vegetables was emphasized [3]. This may be due a higher intake of nutrient-dense foods with low energy density while limiting or excluding the intake of red and processed meat. Similarly, diets with an emphasis on higher intake of fruit and vegetables, including the Mediterranean diet, the DASH diet, and the Alternative Healthy Eating Index, have all been inversely associated with type 2 diabetes risk [4]. Studies have also shown that the dietary glycemic index in the diet may play a role in the prevention of type 2 diabetes [6, 37, 38]. Many dietary patterns associated with a lower incidence of type 2 diabetes emphasize the intake of foods that are often low in glycemic index, including wholegrains, fruits and vegetables.

Few studies have applied cluster analysis to identify dietary patterns and studied the associations with incident type 2 diabetes in prospective cohorts, and they have shown inconsistent results. Villegas et al. [12] applied K-means cluster analysis to study associations with type 2 diabetes in middle-aged Chinese females. They found an inverse association for a dietary pattern defined by a high intake of dairy milk, and a low intake of staple foods, such as rice, and soy foods, compared to the reference cluster. Brunner et al. [13] used cluster analysis to examine dietary patterns of middle-aged British adults. In comparison with an unhealthy dietary pattern, a healthy pattern defined by a higher intake of fruits and nuts, vegetables, high-fiber bread, and low-fat dairy, and a lower intake of red meat and alcohol was associated with a lower risk of diabetes. A study by Hsiao et al. [14] explored dietary patterns in a small population of older American adults (75 years and older) using cluster analysis. The authors did not identify an increased risk of type 2 diabetes for a sweets & dairy and a Western dietary pattern, compared to a health-conscious dietary pattern.

In our study, we could not identify any significant associations for neither the high juice nor the high sugar-sweetened beverages cluster in our final model. The findings are similar to a previous study in our cohort where no associations for a higher intake of sugar-sweetened beverages or a higher juice intake with type 2 diabetes incidence could be identified [9]. Meanwhile, several other studies have identified significant positive associations for sugar-sweetened beverages and type 2 diabetes risk, while findings have been less clear for juice intake [2, 39, 40].

K-means cluster analysis is one of the most commonly applied clustering algorithms for identifying dietary patterns and has been found to be superior to other common methods of clustering [15, 41, 42]. However, there is no gold standard approach to decide the exact number of clusters to include in the analysis. Our decision was based on the size and distribution of participants in each cluster, and the distinguishable dietary patterns that emerged. There is also a subjective decision in determining how many and what variables to include in the analysis, and whether certain variables should be merged into a larger variable, such as foods into a larger food group. For this study we were specifically interested in studying clusters of carbohydrate-rich foods and their association with incidence of type 2 diabetes. Thus, we did not include other food variables potentially associated with type 2 diabetes risk in our cluster analysis. Different approaches can also be seen as for whether to energy-adjust and standardise the dietary variables. We decided to energy-adjust but not to standardize, by using e.g. Z-scores, as this may result in poor dietary patterns when foods eaten in smaller quantities are given disproportionate weights to foods eaten in larger quantities. Lastly, a decision also has to be made on what cluster to assign as the reference cluster in the analysis, and to interpret the findings accordingly. We applied the high refined carbohydrates/low fruit & vegetables cluster as the reference, as it encompassed more than half of the participants and was not primarily characterized by the intake of one specific food variable.

A major strength in our study is the large study population combined with the long follow-up time, resulting in a large number of type 2 diabetes cases. The large population size enabled the identification of five distinct dietary patterns of carbohydrate-rich foods, with a sizable number of individuals in each cluster, while the large number of cases increased the ability to identify associations between the derived clusters and type 2 diabetes risk. The prospective design of this study limited the risk of reverse causation and reduced selection bias. Another strength is the relatively high validity of the three-part diet method applied in this study. The comprehensive data on covariates provided the opportunity to make ample adjustment for possible confounders in our models. However, residual confounding due to unknown factors may still be present. This study also relied solely on self-reported dietary data that was only collected at baseline. Although past diet changers were excluded in one of the sensitivity analyses, we cannot rule out that there may be participants who have made major changes to their diets since baseline examinations. Additionally, relying on self-reported dietary data provides a well-known risk of measurement error due to misreporting, leading to a misclassification of exposure. Hence, we performed an additional sensitivity analysis where potential misreporters of energy intake were excluded, which did not alter our findings. Generalizability of the findings may be limited due to a less diverse population and different eating habits compared to present-day Malmo, and some limitations in representativity [16]. Meanwhile, the study population constituted a large, socio-demographically representative sample, and could be divided into five distinct and substantial clusters. Lastly, despite excluding participants with type 1 diabetes, latent autoimmune diabetes in adults, secondary diabetes or other diabetes condition from the study, about half of all included cases were registered as being of unknown type. However, considering the age of the participants and the exclusion of baseline diabetes cases it is likely to assume that the vast majority of these cases were of type 2. Removing these cases would have greatly reduced the power of the study.

In conclusion, we identified a reduced risk of incident type 2 diabetes with a dietary pattern primarily defined by a high fruit intake. To our knowledge, this is the first study that has investigated the associations between clusters of carbohydrate-rich foods and incidence of type 2 diabetes. The findings provide additional evidence that a dietary pattern high in fruits may lower the risk of developing type 2 diabetes. Future studies are needed to explore this association further.

Data availability

The data are available after application to the MDC Steering Committee (http://malmo-kohorter.lu.se/malmo-cohorts).

References

  1. Federation ID. IDF Diabetes Atlas. 10th ed. Brussels: International Diabetes Federation; 2021.

    Google Scholar 

  2. Schwingshackl L, Hoffmann G, Lampousi AM, Knuppel S, Iqbal K, Schwedhelm C, Bechthold A, Schlesinger S, Boeing H. Food groups and risk of type 2 Diabetes Mellitus: a systematic review and meta-analysis of prospective studies. Eur J Epidemiol. 2017;32:363–75. https://0-doi-org.brum.beds.ac.uk/10.1007/s10654-017-0246-y.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Qian F, Liu G, Hu FB, Bhupathiraju SN, Sun Q. Association between Plant-based dietary patterns and risk of type 2 Diabetes: a systematic review and Meta-analysis. JAMA Intern Med. 2019. https://0-doi-org.brum.beds.ac.uk/10.1001/jamainternmed.2019.2195.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Jannasch F, Kroger J, Schulze MB. Dietary patterns and type 2 Diabetes: a systematic literature review and Meta-analysis of prospective studies. J Nutr. 2017;147:1174–82. https://0-doi-org.brum.beds.ac.uk/10.3945/jn.116.242552.

    Article  PubMed  CAS  Google Scholar 

  5. Uusitupa M, Khan TA, Viguiliouk E, Kahleova H, Rivellese AA, Hermansen K, Pfeiffer A, Thanopoulou A, Salas-Salvado J, Schwab U et al. (2019) Prevention of type 2 Diabetes by Lifestyle changes: a systematic review and Meta-analysis. Nutrients 11. https://0-doi-org.brum.beds.ac.uk/10.3390/nu11112611.

  6. Ley SH, Hamdy O, Mohan V, Hu FB. Prevention and management of type 2 Diabetes: dietary components and nutritional strategies. Lancet. 2014;383:1999–2007. https://0-doi-org.brum.beds.ac.uk/10.1016/s0140-6736(14)60613-9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  7. Ericson U, Brunkwall L, Alves Dias J, Drake I, Hellstrand S, Gullberg B, Sonestedt E, Nilsson PM, Wirfalt E, Orho-Melander M. Food patterns in relation to weight change and incidence of type 2 Diabetes, coronary events and Stroke in the Malmo Diet and Cancer cohort. Eur J Nutr. 2019;58:1801–14. https://0-doi-org.brum.beds.ac.uk/10.1007/s00394-018-1727-9.

    Article  PubMed  Google Scholar 

  8. Mandalazi E, Drake I, Wirfalt E, Orho-Melander M, Sonestedt E. A high Diet Quality based on dietary recommendations is not Associated with Lower incidence of type 2 Diabetes in the Malmo Diet and Cancer Cohort. Int J Mol Sci. 2016;17. https://0-doi-org.brum.beds.ac.uk/10.3390/ijms17060901.

  9. Olsson K, Ramne S, Gonzalez-Padilla E, Ericson U, Sonestedt E. Associations of carbohydrates and carbohydrate-rich foods with incidence of type 2 Diabetes. Br J Nutr. 2020;1–11. https://0-doi-org.brum.beds.ac.uk/10.1017/S0007114520005140.

  10. Liu J, Rehm CD, Shi P, McKeown NM, Mozaffarian D, Micha R. A comparison of different practical indices for assessing carbohydrate quality among carbohydrate-rich processed products in the US. PLoS ONE. 2020;15:e0231572. https://0-doi-org.brum.beds.ac.uk/10.1371/journal.pone.0231572.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  11. Hu FB. Dietary pattern analysis: a new direction in nutritional epidemiology. Curr Opin Lipidol. 2002;13:3–9. https://0-doi-org.brum.beds.ac.uk/10.1097/00041433-200202000-00002.

    Article  PubMed  CAS  Google Scholar 

  12. Villegas R, Yang G, Gao YT, Cai H, Li H, Zheng W, Shu XO. Dietary patterns are associated with lower incidence of type 2 Diabetes in middle-aged women: the Shanghai women’s Health Study. Int J Epidemiol. 2010;39:889–99. https://0-doi-org.brum.beds.ac.uk/10.1093/ije/dyq008.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Brunner EJ, Mosdol A, Witte DR, Martikainen P, Stafford M, Shipley MJ, Marmot MG. Dietary patterns and 15-y risks of major coronary events, Diabetes, and mortality. Am J Clin Nutr. 2008;87:1414–21. https://0-doi-org.brum.beds.ac.uk/10.1093/ajcn/87.5.1414.

    Article  PubMed  CAS  Google Scholar 

  14. Hsiao PY, Mitchell DC, Coffman DL, Craig Wood G, Hartman TJ, Still C, Jensen GL. Dietary patterns and relationship to obesity-related health outcomes and mortality in adults 75 years of age or greater. J Nutr Health Aging. 2013;17:566–72. https://0-doi-org.brum.beds.ac.uk/10.1007/s12603-013-0014-y.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  15. Devlin UM, McNulty BA, Nugent AP, Gibney MJ. The use of cluster analysis to derive dietary patterns: methodological considerations, reproducibility, validity and the effect of energy mis-reporting. Proc Nutr Soc. 2012;71:599–609. https://0-doi-org.brum.beds.ac.uk/10.1017/S0029665112000729.

    Article  PubMed  Google Scholar 

  16. Manjer J, Carlsson S, Elmstahl S, Gullberg B, Janzon L, Lindstrom M, Mattisson I, Berglund G. The Malmo Diet and Cancer Study: representativity, cancer incidence and mortality in participants and non-participants. Eur J Cancer Prev. 2001;10:489–99. https://0-doi-org.brum.beds.ac.uk/10.1097/00008469-200112000-00003.

    Article  PubMed  CAS  Google Scholar 

  17. Riboli E, Elmstahl S, Saracci R, Gullberg B, Lindgarde F. The Malmo Food Study: validity of two dietary assessment methods for measuring nutrient intake. Int J Epidemiol. 1997;26(Suppl 1):161–73. https://0-doi-org.brum.beds.ac.uk/10.1093/ije/26.suppl_1.s161.

    Article  Google Scholar 

  18. Elmstahl S, Riboli E, Lindgarde F, Gullberg B, Saracci R. The Malmo Food Study: the relative validity of a modified diet history method and an extensive food frequency questionnaire for measuring food intake. Eur J Clin Nutr. 1996;50:143–51.

    PubMed  CAS  Google Scholar 

  19. Berglund G, Elmstahl S, Janzon L, Larsson SA. The Malmo Diet and Cancer Study. Design and feasibility. J Intern Med. 1993;233:45–51. https://0-doi-org.brum.beds.ac.uk/10.1111/j.1365-2796.1993.tb00647.x.

    Article  PubMed  CAS  Google Scholar 

  20. Wirfalt E, Mattisson I, Johansson U, Gullberg B, Wallstrom P, Berglund G. A methodological report from the Malmo Diet and Cancer study: development and evaluation of altered routines in dietary data processing. Nutr J. 2002;1:3. https://0-doi-org.brum.beds.ac.uk/10.1186/1475-2891-1-3.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Ramne S, Alves Dias J, Gonzalez-Padilla E, Olsson K, Lindahl B, Engstrom G, Ericson U, Johansson I, Sonestedt E. Association between added sugar intake and mortality is nonlinear and dependent on sugar source in 2 Swedish population-based prospective cohorts. Am J Clin Nutr. 2019;109:411–23. https://0-doi-org.brum.beds.ac.uk/10.1093/ajcn/nqy268.

    Article  PubMed  Google Scholar 

  22. Stubbendorff A, Sonestedt E, Ramne S, Drake I, Hallstrom E, Ericson U. Development of an EAT-Lancet index and its relation to mortality in a Swedish population. Am J Clin Nutr. 2022;115:705–16. https://0-doi-org.brum.beds.ac.uk/10.1093/ajcn/nqab369.

    Article  PubMed  Google Scholar 

  23. Willett W, Rockstrom J, Loken B, Springmann M, Lang T, Vermeulen S, Garnett T, Tilman D, DeClerck F, Wood A, et al. Food in the Anthropocene: the EAT-Lancet Commission on healthy diets from sustainable food systems. Lancet. 2019;393:447–92. https://0-doi-org.brum.beds.ac.uk/10.1016/S0140-6736(18)31788-4.

    Article  PubMed  Google Scholar 

  24. Mutie PM, Drake I, Ericson U, Teleka S, Schulz CA, Stocks T, Sonestedt E. Different domains of self-reported physical activity and risk of type 2 Diabetes in a population-based Swedish cohort: the Malmo diet and Cancer study. BMC Public Health. 2020;20:261. https://0-doi-org.brum.beds.ac.uk/10.1186/s12889-020-8344-2.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Mattisson I, Wirfalt E, Aronsson CA, Wallstrom P, Sonestedt E, Gullberg B, Berglund G. Misreporting of energy: prevalence, characteristics of misreporters and influence on observed risk estimates in the Malmo Diet and Cancer cohort. Br J Nutr. 2005;94:832–42. https://0-doi-org.brum.beds.ac.uk/10.1079/bjn20051573.

    Article  PubMed  CAS  Google Scholar 

  26. Sonestedt E, Wirfalt E, Gullberg B, Berglund G. Past food habit change is related to obesity, lifestyle and socio-economic factors in the Malmo Diet and Cancer Cohort. Public Health Nutr. 2005;8:876–85. https://0-doi-org.brum.beds.ac.uk/10.1079/phn2005736.

    Article  PubMed  Google Scholar 

  27. Hedblad B, Nilsson P, Janzon L, Berglund G. Relation between insulin resistance and carotid intima-media thickness and stenosis in non-diabetic subjects. Results from a cross-sectional study in Malmo, Sweden. Diabet Med. 2000;17:299–307. https://0-doi-org.brum.beds.ac.uk/10.1046/j.1464-5491.2000.00280.x.

    Article  PubMed  CAS  Google Scholar 

  28. SND. Scania Diabetes Registry Gothenburg. : University of Gothenburg; Swedish National Data Service. https://snd.gu.se/en/catalogue/study/ext0074. Accessed 16 June 2022.

  29. NDR. The Swedish National Diabetes Register Gothenburg. Registercentrum Västra Götaland. https://www.ndr.nu/#/english. Accessed 16 June 2022.

  30. Socialstyrelsen. Statistical Databases Stockholm: National Board of Health and Welfare. https://www.socialstyrelsen.se/en/statistics-and-data/statistics/statistical-databases/. Accessed 16 June 2022.

  31. Socialstyrelsen, The National Patient Register Stockholm. National Board of Health and Welfare. https://www.socialstyrelsen.se/en/statistics-and-data/registers/register-information/the-national-patient-register/. Accessed 16 June 2022.

  32. Socialstyrelsen. Läkemedelsregistret Stockholm: National Board of Health and Welfare. https://www.socialstyrelsen.se/statistik-och-data/register/alla-register/lakemedelsregistret/. Accessed 16 June 2022.

  33. Carlstrom M, Larsson SC. Coffee consumption and reduced risk of developing type 2 Diabetes: a systematic review with meta-analysis. Nutr Rev. 2018;76:395–417. https://0-doi-org.brum.beds.ac.uk/10.1093/nutrit/nuy014.

    Article  PubMed  Google Scholar 

  34. Baron RM, Kenny DA. The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51:1173–82. https://0-doi-org.brum.beds.ac.uk/10.1037//0022-3514.51.6.1173.

    Article  PubMed  CAS  Google Scholar 

  35. Zheng JS, Sharp SJ, Imamura F, Chowdhury R, Gundersen TE, Steur M, Sluijs I, van der Schouw YT, Agudo A, Aune D, et al. Association of plasma biomarkers of fruit and vegetable intake with incident type 2 Diabetes: EPIC-InterAct case-cohort study in eight European countries. BMJ. 2020;370:m2194. https://0-doi-org.brum.beds.ac.uk/10.1136/bmj.m2194.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Rienks J, Barbaresko J, Oluwagbemigun K, Schmid M, Nothlings U. Polyphenol exposure and risk of type 2 Diabetes: dose-response meta-analyses and systematic review of prospective cohort studies. Am J Clin Nutr. 2018;108:49–61. https://0-doi-org.brum.beds.ac.uk/10.1093/ajcn/nqy083.

    Article  PubMed  Google Scholar 

  37. Livesey G, Taylor R, Livesey HF, Buyken AE, Jenkins DJA, Augustin LSA, Sievenpiper JL, Barclay AW, Liu S, Wolever TMS et al. (2019) Dietary glycemic index and load and the risk of type 2 Diabetes: a systematic review and updated Meta-analyses of prospective cohort studies. Nutrients 11. https://0-doi-org.brum.beds.ac.uk/10.3390/nu11061280.

  38. Livesey G, Taylor R, Livesey HF, Buyken AE, Jenkins DJA, Augustin LSA, Sievenpiper JL, Barclay AW, Liu S, Wolever TMS et al. (2019) Dietary glycemic index and load and the risk of type 2 Diabetes: Assessment of Causal relations. Nutrients 11. https://0-doi-org.brum.beds.ac.uk/10.3390/nu11061436.

  39. Imamura F, O’Connor L, Ye Z, Mursu J, Hayashino Y, Bhupathiraju SN, Forouhi NG. Consumption of sugar sweetened beverages, artificially sweetened beverages, and fruit juice and incidence of type 2 Diabetes: systematic review, meta-analysis, and estimation of population attributable fraction. BMJ. 2015;351:h3576. https://0-doi-org.brum.beds.ac.uk/10.1136/bmj.h3576.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Ma L, Hu Y, Alperet DJ, Liu G, Malik V, Manson JE, Rimm EB, Hu FB, Sun Q. Beverage consumption and mortality among adults with type 2 Diabetes: prospective cohort study. BMJ. 2023;381:e073406. https://0-doi-org.brum.beds.ac.uk/10.1136/bmj-2022-073406.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Fransen HP, May AM, Stricker MD, Boer JM, Hennig C, Rosseel Y, Ocke MC, Peeters PH, Beulens JW. A posteriori dietary patterns: how many patterns to retain? J Nutr. 2014;144:1274–82. https://0-doi-org.brum.beds.ac.uk/10.3945/jn.113.188680.

    Article  PubMed  CAS  Google Scholar 

  42. Sauvageot N, Schritz A, Leite S, Alkerwi A, Stranges S, Zannad F, Streel S, Hoge A, Donneau AF, Albert A, et al. Stability-based validation of dietary patterns obtained by cluster analysis. Nutr J. 2017;16:4. https://0-doi-org.brum.beds.ac.uk/10.1186/s12937-017-0226-9.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors would like to thank all participants in the Malmö Diet and Cancer Study. We would also like to acknowledge the information provided by the National Diabetes Register of Sweden, the Malmö HbA1c Register, the Regional Diabetes 2000 Register and the registers provided by the Swedish Board of Health and Welfare.

Funding

The study was funded by the Swedish Research Council (2016 − 01501, 2020 − 01412), the Swedish Heart-Lung Foundation (2016 − 0267, 2019 − 0555), the Albert Påhlsson Foundation, and Medical Training and Research Agreement (ALF) grants. Support was also provided by the Swedish Foundation for Strategic Research (IRC15-0067).

Open access funding provided by Lund University.

Author information

Authors and Affiliations

Authors

Contributions

KO and ES designed research; KO performed statistical analysis and wrote the paper; ES, EGP, SJ, AS, YB, SR, and UE contributed to the interpretation of the findings, and critically reviewed, commented on, read, edited, and approved the final manuscript; KO and ES had primary responsibility for final content.

Corresponding author

Correspondence to Kjell Olsson.

Ethics declarations

Ethics approval and consent to participate

The Malmö Diet and Cancer Study was granted ethical approval by the Regional Ethics Committee (LU 90 − 51).

Consent to publish

Not applicable.

Competing interests

The authors declare that there are no competing interests. All authors certify that they have no affiliations with or involvement in any organization or entity with financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Olsson, K., González-Padilla, E., Janzi, S. et al. Clusters of carbohydrate-rich foods and associations with type 2 diabetes incidence: a prospective cohort study. Nutr J 22, 71 (2023). https://0-doi-org.brum.beds.ac.uk/10.1186/s12937-023-00906-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12937-023-00906-0

Keywords