Neighborhood Influences on the Geography of Type 2 Diabetes in Malaysia: A Geospatial Modelling Study ===================================================================================================== * Kurubaran Ganasegeran * Mohd Rizal Abdul Manaf * Nazarudin Safian * Lance A. Waller * Feisul Idzwan Mustapha * Khairul Nizam Abdul Maulud * Muhammad Faid Mohd Rizal ## Abstract Type 2 diabetes (T2D) often exhibits long-standing disparities across populations. Spatial regression models can identify areas of epidemiological conformity and transitions between local neighborhoods to inform timely, localized public health interventions. We identified areal-level distributions of T2D rates across Malaysia and synthesized prediction models to estimate local effects and interactions of different neighborhood covariates affecting local T2D burden. We obtained aggregated counts of national level T2D cases data by administrative-districts between 2016-2020 and computed district-wise crude rates to correlate with district-level neighborhood demographic, socio-economic, safety, fitness, access to built-environments, and urban growth indicators from various national sources and census data. We applied simultaneous spatial autoregressive (SAR) models coupled with two-way interaction analyses to account for spatial autocorrelation and estimate risk factors for district-level T2D rates in Malaysia. The variation in spatial lag estimates of T2D rates by districts was influenced by the proportion of households living below 50% of the median income (β = 0.009, *p* = 0.002) and national poverty line (β = - 0.012, *p* = 0.001), income inequalities (β = - 2.005, *p* = 0.004), CCTV coverage per 1000 population (β = 0.070, *p* = 0.023), average property crime index per 1000 population (β = 0.014, *p* = 0.033), access to bowling centers (β = - 0.003, *p* = 0.019), and parks (β = 0.007, *p* = 0.001). Areal-level district-wise crude T2D rate estimates were influenced by neighborhood socio-economic vulnerabilities, neighborhood safety, and neighborhood access to fitness facilities, after accounting for residual spatial correlation via SAR models. Keywords * type 2 diabetes * neighborhood * geospatial * geography * population * public health policy ## Introduction Neighborhoods are places with social and cultural meaning to local residents, defining where they typically reside, work, play, or have a sense of belonging.1,2 Neighborhoods are built based on people’s living needs, expectations, and circumstances, alongside with urbanization processes offering basic physical and landscape features (e.g., parks, sidewalks, or bicycle lanes), resulting from local township planning, maintenance, or restructuring.2 The ultimate target of such interventions is to achieve sustainable neighborhood outcomes for livability, equity, and viability.3 More generally, neighborhoods reflect geographical entities of smaller confined residential areas or human settlements, nested within larger units like cities, states, or regions.2 With respect to neighborhood operations and type 2 diabetes (T2D), local risks often are based on local upstream social, political, or commercial drivers influencing local community health throughout the life course4 – e.g., as measured by the rise of type 2 diabetes (T2D) or obesity burdens with evidence of longstanding disparities between sociodemographic sub-populations.5,6 Theoretical frameworks postulate contemporary neighborhoods being susceptible to T2D risk owing to limited access to healthy food, and the influence of proximity or density metrics to fitness facilities or green spaces – and when coupled with poverty, local crimes, social delinquency, local colonialism, or globalization processes, these neighborhood environments cause potential psychosocial stress to local residents, prompting a decline of lifestyle behaviors, and resulting greater susceptibility to T2D risks or other chronic conditions.6,7 Socio-economic vulnerabilities such as poverty, income inequalities, and unemployment rates have been positively linked to various lifestyle and chronic health conditions.7-11 These established linkages were mostly interpreted at the individual level across the literature without considering the specific geographical scales capable of providing a snapshot to local population risks for public health interventions. Areal-level economic markers such as the Gini coefficient offer measures of income inequalities (i.e., the extent of household income distributed unevenly) captured by national population censuses that are useful for understanding local landscapes of human behavior and living circumstances across communities - where the rich are often clustered in areas of decent living, while the poor typically struggle to survive within the social and commercial environmental hazards – often predisposing them to greater health needs within chaotic neighborhoods.3,12,13 The rural-urban matrix coupled with each neighborhood’s demography (i.e., ethnic/racial disparities or minorities; women; middle-to-older aged people) and neighborhood’s socio-economic vulnerabilities often reveal significant correlations with high neighborhood-level obesity and T2D rates.8,14 These vulnerable groups are often segregated or pressured in deprived neighborhoods, accelerating their neighborhood safety concerns due to higher incidence of social delinquency, local crime, or drug addiction – a phenomenal tendency towards increasing psychological stress and T2D risk.15,16 Local safety concerns may decrease physical activity - a cornerstone for T2D risk management - among residents dwelling within deprived neighborhoods, likely due to reduced walking, cycling, or access to nearby residential built-environments for exercise or recreational activities (e.g., neighborhood parks, sports, or fitness facilities).14,16 In addition, physical inactivity often is connected to people’s demographic, socio-economic characteristics, and health outcomes, where ethnic/racial minorities or women from lower income groups in urban areas often exhibit greater risks for poorer health, especially in rural areas.17,18 Some characteristics of human settlements, such the percentage living in rural areas are known to lower rates of physical inactivity as compared to their urban counterparts,19 thus accelerating risk for chronic diseases among local residents.16,20 Consistent with the regional atlas estimates illustrating that Malaysia topped T2D prevalence among countries in the Western Pacific Region as of 2021,21 the National Health and Morbidity Survey (NHMS) 2019 Report claimed that approximately 1 in 5 adults in Malaysia was afflicted with T2D, totalling to approximately 3.9 million people above the age of 18 years.22 Countrywide point prevalence of T2D increased from 11.2% in 2011 to 18.3% in 2019.22 The burden of T2D escalated alongside demographic, societal, and cultural changes within Malaysian communities. Common individual-level risk factors for T2D generally include physical inactivity, poor dietary behaviours, and demographic profiles such as age, ethnicity, socio-economic status, or type of occupations.23-26 But these attributes often associate with neighborhood communities,27 where local social determinants of health can catalyse poor health amongst different residential areas.28 Local variations in local health are greatly influenced by social stratifications, causing disparities in T2D prevalence rates and associated risk factors, all of which can be influenced by *“place.”* Here, we explore the distribution of T2D crude-rates by administrative districts in Malaysia, and subsequently determine the influence of local demography, socio-economic vulnerabilities, safety, urban growth indicators, neighborhood fitness, and neighborhood access to built-environments on the geography of T2D. As neighborhoods are comprised of multiple structural and social determinants shaping the landscape of human living conditions across geography, we further examine potential interaction effects between selected neighborhood covariates and the spatial variation of T2D rates. Our study assessed a wide range of neighborhood indicators complementing the attributes outlined within the Sustainable Development Goals (SDG) framework for healthy and dignified living. ## Methods ### Study Design, Setting, and Population For this ecological study, we assembled and linked district-level population data (n = 144), countrywide, for Malaysia, using multiple data sources from the National Diabetes Register (NDR),29 administrative shapefiles (level 1 for states; level 2 for administrative districts),30 and population wellbeing surveys and social indicators integrated within the Malaysian Population Census.31 Data on 271,553 active T2D adults aged ≥ 20 years from 2016 to 2020 by administrative-districts were retrieved from the National Diabetes Register (NDR) of Malaysia.29 Active T2D cases diagnosed based on local clinical practice guidelines32 with at least one follow-up visit to the registered primary health clinic in a year were captured.33 In line with universal healthcare coverage, primary health services in Malaysia serve a catchment area within five kilometers of community neighborhoods, which are typically nested within local administrative-districts, as principally captured within the NDR. ### Measures Our main outcome measure was district-wise crude rates of T2D computed from aggregated cases reported to the NDR. We estimated associations between local T2D rates and covariates consisting of neighborhood demography (e.g., proportions of ethnic minorities), neighborhood socio-economic vulnerability (e.g., income inequalities), urban growth, neighborhood safety (e.g, property crime indices), and proximity to built-environment opportunities for fitness activities (e.g., access to parks or sports facilities). ### Metrics and Data Sources Covariates retrieved from different data sources are described as follows: 1. *Diabetes rates* **-** We computed T2D crude prevalence rates per 100,000 population based on the proportion of cumulative cases from the years 2016-2020 to the total adult population aged ≥ 20 years for each district in Malaysia [total population of adults aged ≥ 20 years approximated to 19,697,300 people countrywide, consisting of 144 districts31, a common measure allowing comparisons across different regions or areas irrespective of population size and interpretating coefficients synthesized from regression models34. 2. *Neighborhood’s demography* **-** In line with global trends35-39, the Malaysian National Health and Morbidity Survey (NHMS 2019) reported that T2D was highly prevalent in women, ethnic minority (Indians) followed by ethnic Bumiputera, and advancing age22. At the district-level, we operationalized neighborhood’s demography of adults aged ≥ 20 years as continuous variables based on the proportion of women and men, minority ethnic composition (proportion of Indians), proportion of Bumiputera (i.e., Bumiputera Malay, Bumiputera Others), proportion of Chinese, and the proportion of adults aged 20-34 years, 35-49 years, 50-64 years, and ≥65 years, retrieved from the Malaysian population census31. 3. *Neighborhood’s socio-economic vulnerability* **-** Neighborhood’s socio-economic vulnerability was measured through three domain indicators of poverty, income inequalities, and unemployment. Poverty was measured based on the United Nations Sustainable Development Goals (Goal 1: No Poverty) 2019 indicator that reports the proportion of households living below the national poverty line (poverty line income for Malaysia in 2019 was MYR 2208 per month; conversion based on 2019 average exchange rate was USD 533.10) by administrative districts from the Malaysian population census31. Additionally, poverty was also measured based on the proportion of people living below fifty percent of median income by administrative districts from the Malaysian population census31. This metric is defined as the percentage of people in the population by administrative districts who live in households whose per capita income or consumption is below half of the median income (median household income for Malaysia in 2019 was MYR 5873 per month; conversion based on 2019 average exchange rate was USD 1417.98; below 50% of median income equals to MYR 2937.50 or less per month; conversion based on 2019 average exchange rate was USD 709.54) or consumption per capita. The median was measured based on the 2017 Purchasing Power Parity (PPP) using the Poverty and Inequality Platform from the World Bank40. While the poverty line defines a measure defining a state of adequate requirement of income by an individual to fulfill the basic needs of livelihood (as determined by national policy), the current study added the “below 50% of median income” as second, alternative measure of local poverty based on local concentrations of individuals in the lower quarter of the national income distribution. This covariate is a relative measure of poverty which could change considerably overtime based on country’s economic performance, as poverty thresholds could rise or decline rapidly during periods of economic growth or downturn, affecting growth or development of neighborhoods social structure within urbanization processes. The two measures (included separately in our regression models) represent different aspects of local poverty and comparisons of results can provide more nuanced insight into the types of associations between T2D and income41. Local income *inequality* was conceptualized based on the United Nations Sustainable Development Goals (Goal 10: Reducing Inequalities) 2019 indicator as measured by local Gini coefficients in districts from the Malaysian population census31. A local Gini coefficient of 0 indicates perfect income equality within the subregion and 1 (or 100) reflects maximal income inequality40. Unemployment rates by administrative districts was measured with reference to the United Nations Sustainable Development Goals (Goal 8: Decent Work and Economic Growth) indicator, retrieved from the Malaysian population census31. 4. *Urban growth indicator -* Urbanization processes would forcibly displace persons through internal migration, resulting in wider urban-rural gap of human settlements that influences township planning or restructuring. We measured urbanization processes that affect neighborhoods based on urban growth indicator. Urban growth rate was measured as the difference of urban population shift between the years of 2010 and 2020 in Malaysia retrieved from the Malaysian population census31. 5. *Neighborhood’s safety -* Neighborhood safety was conceptualized based on the United Nations Sustainable Development Goals (Goal 16: Peace, Justice, and Strong Institutions) indicator; defined as the proportion of population that feel safe walking alone around the neighborhood they live, while motivating local communities to access nearby public area facilities for engagement in fitness, physical, or sports activities31. Neighborhood safety was measured based on a proxy variable sourced from the 2021 Ministry of Housing and Local Government Authority social population statistics that computes the number of closed-circuit television (CCTV) cameras installed in a local authority area by administrative districts in Malaysia31. We further computed neighborhood safety as the number of CCTV coverage per 1000 population and acknowledged the effectiveness of the coverage for enhancing safety in deprived neighborhoods42. We computed average property crime index per 1000 population between the years 2018 and 2020 by administrative districts, sourced from the Royal Malaysia Police data, available through Malaysian population census31. Property crime index was defined according to the Standing Order of the Inspector General of Police (PTKPN) D203 that include frequently reported cases with sufficient significance to be considered as an important indicator; these include crimes related to house break-ins and thefts, vehicle thefts (e.g., van, lorry, motorcar, motorcycle), and other thefts (e.g., pick pocket, bicycle theft, public property theft)31. We calculated average drug addicts per 1000 population between the years 2018 and 2020 by administrative districts, sourced from the National Anti-Drugs Agency Malaysia data, available through the Malaysian population census31. Drug addicts were contextualized as per case data (i.e., those who have one or more offences in the current year). It refers to the psychoactive chemicals used (excluding alcohol, tobacco, and inhalants) not for medical purposes where their usage is prohibited, and that these substances lead to increased physical and psychological dependence and tolerance causing adverse effects on health, self, family, and society31. These crimes or addiction activities are vulnerable to cause societal chaos, vandalism, public area property dysfunction, or safety issues within disordered neighborhoods43,44. 6. *Neighborhood fitness indicator and built-environment proximity -* Neighborhood fitness indicator by administrative-districts was assessed with a continuous variable based on the proportion of adult population aged ≥ 20 years not engaged in fitness, exercise, or sports activities, retrieved from the Population Wellbeing Fitness Survey31. The coverage of built-environments within neighborhoods were assessed through the availability of public facilities for communities’ fitness, recreational, exercise, or sports activities. This indicator was measured as the percentage of neighborhood areal access to recreational parks, jogging tracks, cycling tracks, mini stadiums, football fields, gymnasium, and bowling centers within five kilometers of proximity from resident’s home, which serves as a proxy for fitness, exercise, or physical activities available to local neighborhood residents. The rationale for the inclusion of different covariates on the types of sports, exercise, fitness, or recreational facilities in our model was based on the compositional structure of neighborhoods demography, as human settlements are ultimately composed of communities with different age groups. As a result, different sports or fitness activities attract residents of different age motivated to engage in different activity levels within their local neighborhoods. For example, members of older age groups often are likely to be engaged in light intensity physical activity such as walking, thus would seek access to recreational parks; whereas younger to middle aged groups are likely to be engaged in moderate to vigorous intensity physical activity such as running or jogging, bowling, running on a treadmill, soccer, or cycling, thus would likely access jogging tracks, cycling tracks, mini stadiums, football fields, gymnasium, or bowling centers45. Neighborhood sports and recreational facilities infrastructure development falls primarily under the jurisdiction of local municipalities. They are structured based on the fundamentals of “sports place theory,” a subset of the “central place theory” that correlates the provision of sports or recreational facilities with the level or urbanization processes within neighborhoods (i.e. greater urbanization have higher land use for the population, thus catalyze greater development of sports or fitness infrastructures for healthy urbanism)46-48. The national level cut-off points metrics of accessibility to sports and recreational facilities of at least five kilometers from residence home retrieved for this study represents a standard measure used in township development by local municipalities, a measure that also accounts for neighborhood demographics, urbanization processes, and neighborhood socioeconomic statuses for different types of sports infrastructure, as reported in previous work49,50. Our data for access to sports, fitness, or recreational facilities were sourced from the Population Wellbeing Fitness Survey31. ### Data Analysis We conducted a comprehensive analysis involving the associations between T2D rates and neighborhood indicators across administrative districts. Following a summary of descriptive statistics for all variables used in our analysis, we executed exploratory spatial data analyses (ESDA), non-spatial correlations, spatial autocorrelations, and spatial regression modelling to evaluate the relationships between neighborhood-level attributes and T2D rates. We evaluated the distribution of crude rates via Q-Q plots if they were normally distributed, and if not, they were log-transformed to meet the assumption of a Gaussian distribution for spatial model fitting. As crude rates were subjected to variance instability that leads to spurious outliers, we performed rate smoothing via Empirical Bayes approach via GeoDA version 1.18 software (Center for Spatial Data Science University of Chicago, IL, USA) to visualize T2D distribution via a smoothed rate map. ### Cartographic Visualizations We performed Geographic Information System (GIS) linkage through overlay of state boundaries and administrative districts through shapefiles, and spatially joined attribute data of various data sources from multiple agencies. ESDA involved cartographic development of a quantile map using Quantum GIS (QGIS), version 3.22 Bialowieza. This enabled geo-visualization of the distribution of T2D by administrative districts in Malaysia. Bivariate choropleth maps were built (via plugins ‘Bivariate Legend’) in QGIS using *n2* classes (three categories for each covariate yielding a total of nine categories) to map the combinations of statistically significant covariates associated with diabetes rates across geographies from the spatial regression models. The results were expressed as tertile groups, meaning 33% of the districts were classified in each of the “low,” “medium,” and “high” categories. ### Non-spatial Correlations and Spatial Autocorrelations We conducted non-spatial analysis using Pearson correlations (*r*) between logged T2D rates and neighborhood indicators measured at the same area (within districts) using SPSS software version 23.0 (IBM). We then computed Global Moran’s *I* indexes to inspect the spatial autocorrelation of the outcome and of the covariates being tested, using first-order Queen’s contiguity spatial weights matrix that define neighbors as administrative-districts sharing a common border or corner using GeoDA version 1.18 software (Center for Spatial Data Science University of Chicago, IL, USA). We determined Queen’s contiguity as the best spatial weight matrix for our work after running a series of twenty matrices for local clustering of T2D rates (i.e., Queen contiguity first and second order; Rook contiguity first and second order; distance matrix 150, 200, 250, 300 kilometers; inverse distance 150, 200, 250, 300 kilometers; k-nearest neighbor where k=2, 3, 4, 5; and k-nearest neighbor inverse where k=2, 3, 4, 5). The Moran’s *I* value ranges approximately between -1 (negative spatial autocorrelation, showing neighborhood areas have dissimilar pattern) and 1 (positive spatial autocorrelation, suggesting neighborhood areas clustering with similar high or low values). A Moran’s *I* value near zero suggests complete spatial randomness. Moran’s *I* is expressed as follows51,52: ![Formula][1] where *Yi* denotes outcome of interest (i.e., T2D rates) in area *i,* ![Graphic][2] is the mean of covariate of interest, and *wij* is an *n × n* spatial weight matrix that measures the “closeness” between area *i* and its neighbor *j*. The adjacency-based spatial weight matrix is defined as51,52: ![Formula][3] ### Spatial Simultaneous Autoregressive Modelling We first fit an ordinary least squares (OLS) regression model at the univariate and multivariate level for a log-linear distribution of T2D crude rates. The OLS regression equation is expressed as follows: ![Formula][4] where *y* is a vector of the main outcome variable (i.e., the natural log of T2D rates), *X* is a matrix of observations of the covariates, *β* is a vector coefficient for the covariates, and *ε* is the vector of independent and identically distributed random error terms51,53. We next considered spatial econometric modelling approaches through spatial simultaneous autoregressive (SAR) models (i.e., spatial lag and spatial error models) if the OLS regression residuals exhibited statistically significant spatial autocorrelation, via a Global Moran’s *I* statistic and Lagrange Multiplier tests, using a first-order Queen’s contiguity matrix. A spatial lag model (SLM) examines how logged T2D rates in a district is influenced by the disease burden in adjacent districts. The interpretation of the spatial lag parameter (*ρ*) refers how the average logged T2D rates in neighboring districts is associated with the logged T2D rates of a focal district. The equation is expressed as follows: ![Formula][5] where *y* is the outcome variable (i.e., logged T2D rates), *Wy* is the spatially lagged main outcome variable for y (i.e., the weighted average of outcome variable of neighboring areas), *W* is the spatial weight matrix, *X* is the matrix of observations of the covariates, *β* is a vector coefficient for the covariates, *ε* is the vector of independent and identically distributed random error terms, *ρ* is the spatial autoregressive coefficient for the lagged variable *Wy* and the values range between -1 and 1 [*ρ* > 1 indicates neighborhoods surrounding each other have similar values relative to the outcome (i.e., high or low logged T2D rates); *ρ* < 0 indicates high T2D rates neighborhoods being surrounded by low logged T2D rates neighborhoods and vice versa; *ρ* = 0 indicates no spatial dependence)]51,53. In contrast, a spatial error model (SEM) estimates the extent to which the OLS residual of a district is correlated with its adjacent districts. The spatial error parameter (*λ*) measures the strength of the relationship between the average residuals or errors in neighboring districts and the residual or error of a given district. The SEM equation is expressed as follows: ![Formula][6] where *y* is the outcome variable (i.e., logged T2D rates), *Wu* is the spatially lagged error term for the main outcome variable *y*, *W* is the spatial weight matrix, *λ* is the spatial autoregressive coefficient for the error terms, *X* is the matrix of observations of the covariates, *β* is a vector coefficient for the covariates, *ε* is the vector of independent and identically distributed random error terms. ### Covariate Selection and Interactions We note the saturation of our neighborhood covariates that were hypothetically plausible to influence T2D rates by administrative districts. With multiple covariates in the model, there were tendencies for non-spatial and spatial correlations/regression models to be influenced by confounders and interaction effects. We therefore implemented a strategy for covariate selection in the multivariate OLS and spatial models: 1. We first visualized all covariates correlations through scatterplot matrices and assessed these correlations for statistical significance. 2. We then determined covariates of significant clustering at the univariate spatial autocorrelation level, with positive Global Moran’s *I* values (presence of clustering) as fit to be included in the multivariate OLS and spatial regression models. 3. We subsequently fit regression models via ordinary least squares (OLS) and assessed potential collinearity between covariates via variation inflation factor (VIF) values. Here, covariates with statistically significant association with the outcome and VIF values of ≤ 5 were determined to be suitable for inclusion in our final multivariate OLS model. 4. In addition, covariates evidenced to have biological plausibility (i.e. even if the covariate was not statistically significant at the univariate OLS level but importantly being a risk factor for diabetes as evidenced by the national country report, or the literature theoretically, or are biologically influencing predictors like gender, age, or ethnicity) were included in the full multivariate models. Our approach for regression model building was based on the conceptualization that covariates selected in the models were based on analytical-knowledge thinking (i.e. we considered each covariate alone within the classes then seek to build a set of covariates within each domain of neighborhood covariates, finding the most significant and the least correlated covariates within that domain (visualized in the scatterplot matrices followed by the subsequent steps listed above) Given the grouping of covariates into domains addressing similar types of measures, we avoided generic, automated variable selection techniques (e.g. stepwise approach) that would forcibly select or deselect neighborhood attributes based on statistical significance alone which has a tendency to yield high R-squared values for biased estimates, or cause collinearity within covariates54. We further performed a two-way interaction analysis for the statistically significant attributes from the SAR models to distinguish and quantify the main and interaction effects within the spatial attributes tested in our analysis, an advancement in neighborhood analysis not previously reported elsewhere. All spatial autocorrelations, spatial regressions, and interaction analyses were conducted using GeoDa version 1.18 (Center for Spatial Data Science University of Chicago, IL, USA). ### Interpretations of Statistical Significance and Model Performance The statistic with the highest value (and lowest *p*-value) will indicate the proper specification for the data. We took a more nuanced approach as suggested recently in the interpretation of statistical significance for models dealing with spatial dependence53; that if *p*-value is less than 0.001 (\***|), between 0.001 and 0.01 (**), between 0.01 and 0.05 (*), between 0.05 and 0.10, or greater than 0.10 to consider the evidence linking main outcome and neighborhood attributes (i.e., against the null hypotheses) to be very strong, strong, moderate, weak, or none, respectively. If spatial models were necessary, the OLS and SAR models were compared using the Akaike Information Criterion (AIC), whereby a lower AIC value indicates a better model fit. ## Results ### Neighborhood Characteristics Table (1) enumerates neighborhood characteristics by administrative districts in Malaysia. The average proportion of women in a district was 50% while the average proportion of ethnic minorities (Indians) in a district was 4%, with the maximum of 20% Indians being populated in the district of Port Dickson. The average proportion of the majority ethnic group (Bumiputera) in a district was 81%. Regarding age composition, on average, 30% were aged 35-49 years, 21% were aged 50-64 years, and 11% were aged 65 years or older. In an average Malaysian district, the proportion of households living below fifty percent of median income was 16.78%, the proportion of households living below the national poverty line was 10.96%, while the average income inequality and unemployment rates by districts was 0.36 and 5.43% respectively. With rapid urbanization across Malaysia, on average, urban growth rate was 3.18%. For neighborhood safety, the average CCTV coverage per 1000 population was 0.22, with the maximum coverage of 7.85 CCTV per 1000 population in the city of Putrajaya. The average proportion of property crimes per 1000 population was 2.60, while an average of 1.42 per 1000 population using illegal drugs in any neighborhood. The average proportion of population not engaged in fitness, exercise, or sports activities was 44%. The average percentage of areas with access to parks was 68.31%, while the average percentages with access to jogging track, cycling track, mini stadium, football field, gymnasium, and bowling center were 63.92%, 57.74%, 51.57%, 83.64%, 59.8%, and 26.8% respectively. View this table: [Table 1.](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T1) Table 1. Neighborhood characteristics (n = 144) ### Distribution of Type 2 Diabetes Because QQ-plots suggests that the distribution of T2D rates was not Gaussian, we log transformed this variable to meet the assumption of normal error distribution for fitting linear regression, spatial autocorrelations, and spatial simultaneous autoregressive (SAR) models (Supplementary Figure 1). ![Supplementary Figure 1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F3.medium.gif) [Supplementary Figure 1](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F3) Supplementary Figure 1 Results of QQ-plots showing non-Gaussian distribution of type 2 diabetes crude rates (top panel) and Gaussian distribution of logged type 2 diabetes crude rates (bottom panel) ### Logged Transformed Distribution Figure (1) shows a quantile map of the distribution of logged T2D crude-rates among adults aged ≥20 years by administrative districts in Malaysia. Higher rates of T2D were clustered along the districts nested within the Southern, Central, Northern, and parts of the East Coast regions in the country. For East Malaysia (Borneo) states, T2D rates were highly clustered along the Northwest districts of Sarawak. Lower rates of T2D were mostly clustered in the East Coast region, Northeast region in the state of Sarawak, and most districts in the state of Sabah. Quantile map of the untransformed crude rates is available in Supplementary Figure 2. ![Supplementary Figure 2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F4.medium.gif) [Supplementary Figure 2](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F4) Supplementary Figure 2 Results of quantile map showing untransformed crude rates of type 2 diabetes per 100,000 population by administrative districts in Malaysia ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F1) Figure 1. Distribution of T2D rates (logged) by administrative districts in Malaysia ### Geostatistical Interpretation of T2D Distribution The box-map revealed three upper outliers, namely the districts of Meradong, Sarikei, and Simunjan in the state of Sarawak, but no unusual low-rate outliers. The population mean [standard deviation (SD)] of T2D rates was 1704.61 (1010.70) per 100,000 people, and the rates ranged between 163.70 to 6392.31 per 100,000 population throughout districts in Malaysia. The median [interquartile range (IQR)] of T2D rates distribution at the countrywide level was 1593.92 (1243.45) per 100,000 people. Up to a quarter of districts had less than 972.15 per 100,000 people with T2D; these lower-rate regions were clustered around the North of Kelantan, parts of Selangor, Kedah, North-West Sarawak, and almost all districts in Sabah. A quarter up to half of the districts reported T2D rates to range between 972.15 per 100,000 people and less than 1593.92 per 100,000 people; most of these moderate-rate districts were dense in the state of Pahang, East and West of Sarawak. From half up to three quarter of the districts had T2D rates ranged between 1593.92 per 100,000 people to less than 2215.59 per 100,000 people; these districts were distributed across the Southern, Central, and Northern regions, with some scattered in the East Coast of Malaysia. Over three quarters of the districts had T2D rates ranging between 2215.59 per 100,000 people to less than 4080.76 per 100,000 people; these areas mostly included districts in the Southern, Central, Northern and East-Coast regions. The districts in the North, South, North-West, and South-West areas of Sarawak observed similar crude rates. The visualization of the geographic distribution of crude rates appears in Supplementary Figure 3. ![Supplementary Figure 3](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F5.medium.gif) [Supplementary Figure 3](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F5) Supplementary Figure 3 Results showing a box map – boxplot geostatistical summary of the distribution of type 2 diabetes crude rates per 100,000 population among adults by administrative districts in Malaysia. ### Rate Smoothing Since local rates are based on different population sizes, local rate estimates can vary in precision. Empirical Bayes smoothing approaches allow us to pool information from neighboring districts to improve precision for rate estimates from low-population-size districts52. The Empirical Bayes smoothed rate map for district-level T2D rates appears in Supplementary Figure 4. In this application, the observed distribution of smoothed rates does not vary much from that of the logged transformed crude rates, due to the relatively high rate of T2D compared to the rarer health outcomes (e.g., cancer) often featured in small area estimates of local rates. ![Supplementary Figure 4](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F6.medium.gif) [Supplementary Figure 4](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F6) Supplementary Figure 4 Results showing an Empirical Bayes smoothed rates map of type 2 diabetes distribution by administrative districts in Malaysia. ### Non-Spatial and Spatial Correlations of Diabetes Rates The Global Moran’s *I* (spatial autocorrelation) for logged T2D rates was 0.370 (*p* = 0.001), indicating a significant positive spatial autocorrelation. The Local Indicators of Spatial Autocorrelation (LISA), i.e., the local Moran’s index, identified spatial clusters where high rates occurred near other high rates across districts within the state of Negeri Sembilan, Kedah, and parts of Sarawak, whereas clusters of low rates near other low rates were concentrated across rural districts in Sabah and small areal districts in Selangor (Figure 2A). Figure (2B) enumerates the LISA significance of these clusters. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F2) Figure 2. [A] Univariate LISA cluster map; [B] Univariate LISA significance map of spatial clustering and outliers The Global Moran’s *I* for neighborhood characteristics appears in Table (2). The spatial autocorrelation analysis (*I*) had eleven additional statistically significant coefficients in contrast to the statistically significant coefficients available in the non-spatial conventional linear correlations (*r*). We found that the proportion of women, the proportion of Bumiputera, the proportion of Indians, the proportion of adults aged 35-49 years, 50-64 years, and 65 years or older, proportion of households living below 50% of median income, proportion of households living below national poverty line, income inequalities, drug users per 1000 population, proportion of population not engaged in fitness/exercise or sports activities, percentage of areas with access to parks, jogging track, cycling track, football field, gymnasium, and bowling center within five kilometers of proximity from neighborhoods showed spatial autocorrelation coefficients larger than conventional linear correlation coefficients (*I* > *r*), suggesting that these correlations were highly determined by geographic locations. View this table: [Table 2.](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T2) Table 2. Non-spatial correlation with logged T2D rates and spatial autocorrelations for covariates reported from small areas (n = 144) ### Baseline Regressions We synthesized a baseline univariate ordinary least squares (OLS) regression with VIF values to determine covariates to be included in the multivariate OLS and SAR models (Supplementary Table 1). All covariates with statistical significance, VIF values ≤ 5, significant clustering, and biologically plausible were included in the full multivariate OLS model (Table 3). Subsequently we evaluated the diagnostic performance of the model if further subjected for spatially weighted spatial lag or error models. View this table: [Supplementary Table 1](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T5) Supplementary Table 1 Results of univariate ordinary least squares regression with variation inflation factor (VIF) values for variable selection into the multivariate OLS model. View this table: [Table 3.](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T3) Table 3. Full multivariate OLS model estimation on the association between neighborhood indicators and local logged diabetes rates (n = 144) ### Full Multivariate Ordinary Least Squares (OLS) Model Estimation on the Association between Neighborhood Indicators and Logged T2D Rates Table (3) exhibits the findings of the full multivariate ordinary least squares (OLS) model estimation on the associations between neighborhood indicators and T2D rates (logged). On the influence of neighborhood’s socio-economic vulnerability, there was strong evidence that T2D rates had a positive relationship with the proportion of households living below fifty percent of the median income; each one percent increase of households living below fifty percent of the median income was associated with 0.009 increase of T2D rates (β = 0.009, *p* = 0.006). However, there was strong evidence that T2D rates had a negative relationship with the proportion of households living below the national poverty line; each one percent increase of households living below the national poverty line was associated with 0.012 decline of T2D rates (β = 0.009, *p* = 0.006). There was moderate evidence that T2D rates had a negative relationship with income inequalities; each one unit rise in income inequalities was associated with 2.028 decline of T2D rates (β = -2.028, *p* = 0.010). For neighborhood’s accessibility to fitness facilities, there was strong evidence that T2D rates had a positive relationship with areal proximity to parks; each additional one percent of areal proximity to a park was associated with 0.007 percent rise of T2D rates (β = 0.007, *p* = 0.003). In contrast, there was moderate evidence that T2D rates had a negative relationship areal proximity to a bowling center; each additional one percent of areal proximity to a bowling center was associated with a 0.003 decline of T2D rates (β = - 0.003, *p* = 0.022) (Table 3). The OLS model also weakly adjusted for one neighborhood safety covariate (i.e., property crimes per 1000 population, β = 0.013, *p* = 0.067) and one neighborhood accessibility to fitness facility covariate (i.e., access to mini stadium, β = 0.003, *p* = 0.097). A standard deviation map of residuals from the OLS regression is exhibited in Supplementary Figure 5. The p-value for the Global Moran’s *I* residuals of the OLS model was statistically significant (*p* = 0.035). Similarly, the p-values for the Lagrange Multiplier tests (lag value) were statistically significant, indicating that a subsequent execution of the spatial lag model (SLM) was necessary (Table 3). The model accounted for 45.6% of the total variance for the relationships between T2D rates and neighborhood attributes. ![Supplementary Figure 5](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F7.medium.gif) [Supplementary Figure 5](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F7) Supplementary Figure 5 Results showing a standard deviation map of the Ordinary Least Squares Regression (OLS) residuals by administrative districts in Malaysia. ### Full Multivariate Spatial Lag Model (SLM) Estimation on the Association between Neighborhood Indicators and Logged T2D Rates Table (4) shows the results of the spatial lag model (SLM) estimation on the associations between neighborhood indicators and T2D rates (logged). Overall, the coefficients of the covariates in the SLM model were further adjusted than those yielded in the OLS model. The statistical significance of the coefficients was stronger in the SLM model as compared to the OLS model, with an addition of two covariates: CCTV coverage and property crimes having moderate strength of significance in the model (*p* = 0.023 and *p* = 0.033 respectively). View this table: [Table 4.](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T4) Table 4. Full multivariate SLM model on the association between neighborhood indicators and logged diabetes rates (n = 144) With reference to neighborhood socio-economic vulnerability, each additional one percent of households living below fifty percent of the median income was associated with 0.009 percent rise of logged T2D rates, with all other covariates held constant (β = 0.009, *p* = 0.002). In contrast, each additional one percent of households living below the national poverty line was associated with 0.012 percent decline of logged T2D rates, with all other covariates held constant (β = - 0.012, *p* = 0.001). Each additional one unit rise of income inequality was associated with 2.005 percent decline of logged T2D rates, with all other covariates held constant (β = - 2.005, *p* = 0.004) (Table 4). On the influence of neighborhood safety, we found that each additional one unit rise of CCTV coverage was associated with 0.070 percent increase of logged T2D rates, with all other covariates held constant (β = 0.070, *p* = 0.023). Each additional one unit rise in property crimes was associated with 0.014 percent increase of T2D rates, with all other covariates held constant (β = 0.014, *p* = 0.033) (Table 4). For neighborhood’s accessibility of built-environments, we found that each additional one percent rise of areal proximity to a park was associated with a 0.007 percent increase of logged T2D rates (β = 0.007, *p* = 0.001), and each additional one percent rise of areal proximity to a bowling center was associated with a 0.003 percent decline of logged T2D rates ((β = - 0.003, *p* = 0.019) (Table 4). The SLM model also weakly adjusted for one neighborhood demography covariate (i.e., proportion of adults aged 35-49 years, *p* = 0.056) and one neighborhood accessibility to fitness facility covariate (i.e., mini stadium, *p* = 0.051) (Table 4). The model accounted for 47.2% of the total variance for the relationships between logged T2D rates and neighborhood attributes. Residuals are reported in Supplementary Figure 6. ![Supplementary Figure 6](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F8.medium.gif) [Supplementary Figure 6](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F8) Supplementary Figure 6 Results showing a standard deviation map of the Spatial Lag Regression (SLM) model residuals by administrative districts in Malaysia. ### Evaluating Two-Way Interactions between Spatial Attributes The geo-visualization of bivariate quantile maps for the statistically significant covariates in the SLM model that influenced the distribution of T2D rates by administrative districts is available in Supplementary Figures 7-13. ![Supplementary Figure 7](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F9.medium.gif) [Supplementary Figure 7](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F9) Supplementary Figure 7 Supplementary figure 7 demarcates a bivariate choropleth map on the associations between diabetes rates and the proportion of households living below 50% of median income by administrative districts in Malaysia. Districts within the states of Johor (Kota Tinggi, Pontian, Kluang, Segamat, Mersing), Negeri Sembilan (Kuala Pilah, Rembau), Selangor (Sabak Bernam, Ulu Selangor), Kedah (Yan), Terengganu (Hulu Terengganu), Sabah (Beaufort, Kuala Penyu), and Sarawak (Kapit, Tatau, Daro, Sarikei, Meradong, Betong, Sri Aman, Lubok Antu, Simunjan, Serian) fell into the high-proportion/high-rates group. One district in Sabah (Penampang), three in North Sarawak (Miri, Bintulu, Tatau), some scattered districts in the Central, Northern and East-Coast regions fell into the low-proportion/high-rates diabetes group. ![Supplementary Figure 8](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F10.medium.gif) [Supplementary Figure 8](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F10) Supplementary Figure 8 Supplementary figure 8 shows a bivariate choropleth map on the associations between diabetes rates and the proportion of households living below national poverty line by administrative districts in Malaysia. Districts within the states of Johor (Mersing), Kedah (Yan), Sabah (Kunak), and Sarawak (Lawas, Selangau, Daro, Meradong, Betong, Simunjan, Serian) fell into the high-proportion/high-rates group. Relatively sparse districts within the Southern, Central, and Northern regions plus four districts in Sarawak (Miri, Bintulu, Tatau, Kapit) and one in Sabah (Penampang) fell into the low-proportion/high-rates diabetes group. ![Supplementary Figure 9](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F11.medium.gif) [Supplementary Figure 9](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F11) Supplementary Figure 9 Supplementary figure 9 exhibits a bivariate choropleth map on the associations between diabetes rates and income inequalities (measured by Gini coefficient) by administrative districts in Malaysia. Districts within the states of Negeri Sembilan (Tampin), Melaka (Jasin), Selangor (Sabak Bernam), Sabah (Kunak), and Sarawak (Selangau) fell into the high-inequalities (third tertile)/high-rates group. Most districts within the Southern, Central, East Coast, and Northern regions plus districts in the North, East, and South of Sarawak and one in Sabah (Penampang) fell into the low-inequalities (first tertile)/high-rates diabetes group. ![Supplementary Figure 10](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F12.medium.gif) [Supplementary Figure 10](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F12) Supplementary Figure 10 Supplementary figure 10 shows a bivariate choropleth map on the associations between diabetes rates and CCTV coverage per 1000 population by administrative districts in Malaysia. High CCTV coverage/high-diabetes rates were mainly focused in two major points, namely the districts of Cameron Highlands (Pahang) and Timur Laut (Pulau Pinang). Diabetes rates were relatively high across most districts in Peninsular Malaysia and the state of Sarawak, coherently these districts had relatively low CCTV coverage per 1000 population. ![Supplementary Figure 11](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F13.medium.gif) [Supplementary Figure 11](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F13) Supplementary Figure 11 Supplementary figure 11 shows a bivariate choropleth map on the associations between diabetes rates and property crimes per 1000 population by administrative districts in Malaysia. Districts with high property crimes/high-diabetes rates were found in Johor (Kota Tinggi), Negeri Sembilan (Tampin, Kuala Pilah, Rembau), Pahang (Maran), Selangor (Sabak Bernam), Terengganu (Besut), Kedah (Kubang Pasu, Kota Setar, Pendang, Kuala Muda), Sabah (Penampang), and Sarawak (Miri, Bintulu, Sarikei, Meradong). Consistently, diabetes rates were relatively high across districts in Kerian (Perak), Bandar Baharu (Kedah), Cameron Highlands (Pahang), Marang (Terengganu), and Lawas, Kapit, Tatau, Selangau, Daro, Betong, Lubok Antu, Simunjan, Serian (Sarawak) where property crimes were relatively low. ![Supplementary Figure 12](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F14.medium.gif) [Supplementary Figure 12](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F14) Supplementary Figure 12 Supplementary figure 12 exhibits a bivariate choropleth map on the associations between diabetes rates and neighbourhood areal access to recreational parks by administrative districts in Malaysia. Districts with high percentage access/high-diabetes rates were found in Johor (Kota Tinggi, Mersing, Kluang, Segamat), Melaka (Jasin), Negeri Sembilan (Jelebu), Pahang (Cameron Highlands, Jerantut), Selangor (Ulu Selangor, Sabak Bernam), Perak (Kampar), Sabah (Kunak), and Sarawak (Tatau). Diabetes rates were relatively high across districts in Rembau (Negeri Sembilan), Maran (Pahang), Besut and Hulu Terengganu (Terengganu), Kubang Pasu, Kota Setar, Yan, and Pendang (Kedah), and Lawas, Kapit, Selangau, Daro, Betong, Lubok Antu, Simunjan, Serian, and Sarikei (Sarawak) where access to recreational parks were low. ![Supplementary Figure 13](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/10/27/2024.10.26.24316183/F15.medium.gif) [Supplementary Figure 13](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/F15) Supplementary Figure 13 Supplementary figure 13 exhibits a bivariate choropleth map on the associations between diabetes rates and neighbourhood areal access to bowling centers by administrative districts in Malaysia. Districts with high percentage access/high-diabetes rates were found in Johor (Kota Tinggi, Pontian, Kluang, Segamat), Negeri Sembilan (Jempol, Tampin), Pahang (Temerloh), Perak (Batang Padang), Kedah (Kuala Muda, Kota Setar, Kubang Pasu), Sabah (Kunak), and Sarawak (Bintulu). Diabetes rates were relatively high across districts in Jelebu and Kuala Pilah (Negeri Sembilan), Sabak Bernam (Selangor), Cameron Highlands and Jerantut (Pahang), Yan and Pendang (Kedah), Marang and Hulu Terengganu (Terengganu), Lawas, Kapit, Selangau, Daro, Betong, Lubok Antu, Simunjan, Serian, Tatau, Meradong, Sri Aman, and Sarikei (Sarawak) where access to bowling centers were low. Next, we ran 31 two-way interaction modelling analyses contextualized within the framework of statistical significance from the full multivariate SLM model to investigate the interactions between (1) access to fitness facilities with neighborhood poverty and inequity; (2) access to fitness facilities with neighborhood safety and inequity; and (3) access to fitness facilities with neighborhood poverty and safety on the influence T2D. These models allow logical interpretations within the SLM model exploring attenuating or accelerating influences of interaction effects influencing neighborhood T2D rates in Malaysia. In the SLM two-way interaction modelling analysis that evaluated access to fitness facilities with neighborhood poverty and inequity, we found significant negative association with local income inequality (β = -6.952, p = 0.002) but no significant association with the proportion of households living below 50% of median income (β = 0.004, p = 0.134). The effect of local income inequality was moderated by the presence of parks as indicated by the significant interaction for (parks*income inequality) (β = 0.063, p-value for interaction = 0.037) (Model 3, Supplementary Table 2). View this table: [Supplementary Table 2](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T6) Supplementary Table 2 Results of two-way interaction analyses adjusted for spatial correlation between access to fitness facilities with neighborhood poverty and inequity indicators from spatial lag model (SLM). In assessing two-way interactions between access to fitness facilities with neighborhood safety and income inequity covariates, the SLM model showed a relatively weak significance for interaction terms in the SLM model: (parks*income inequalities) (β = 0.059, p-value = 0.047), and no significant interaction with local CCTV coverage per 1000 population (β = 0.048, p = 0.069) (Model 4, Supplementary Table 3). View this table: [Supplementary Table 3](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T7) Supplementary Table 3 Results of two-way interaction analyses adjusted for spatial correlation between access to fitness facilities with neighborhood safety and inequity indicators from spatial lag model (SLM). Finally, we assessed the two-way interactions between access to fitness facilities with neighborhood poverty and safety. We found significant positive associations with the main effects, property crimes per 1000 population (β = 0.060, p = 0.005) and access to parks (β = 0.005, p = 0.003), but negative associations with the proportion of population living below poverty line (β = -0.011, p<0.001) or access to bowling center (β = -0.005, p<0.001). However, the interaction effect (parks*property crimes) revealed significant negatively association with T2D distribution (β = - 0.001, p-value = 0.010) (Model 5, Supplementary Table 4). View this table: [Supplementary Table 4](http://medrxiv.org/content/early/2024/10/27/2024.10.26.24316183/T8) Supplementary Table 4 Results of two-way interaction analyses adjusted for spatial correlation between access to fitness facilities with neighborhood poverty and safety indicators from spatial lag model (SLM). Taken together, we find significant local associations between local measures of income inequality, property crimes per 1000 population, and access to parks. However, these variables, and other measures of local physical activity locations, also reveal moderated effects on T2D in the presence of other local measures of income, access to physical activity, and safety. These interactions suggest further research exploring the specific neighborhood compositions of these features to provide insight on local prevention measures. ## Discussion We found that areas with high T2D rates were predominantly concentrated along the West of Peninsular Malaysia, across districts in the Southern, Central, and Northern regions, and parts of the areal districts in the North, South, and West of Sarawak state. We observed significant spatial autocorrelations on T2D rates and neighborhood indicators, suggesting that these areal-level indicators had fully or partly influenced the formation of T2D clusters across local geographies in Malaysia. As noted above, our final SLM model revealed a triad of interactions between neighborhood socio-economic vulnerabilities, neighborhood safety features, the influence of local residential access to neighborhood built-environments fitness activities, and T2D burden. These associations were consistent with previous geospatial investigations from the USA14-16,55, Saudi Arabia56, Australia57, Bangladesh58, and Brazil59. We note that these studies originate from both developed and developing nations with different geo-political, social, and cultural norms; alongside differences of spatial attributes within the rural-urban matrix. The applications of place-based correlations and regression modelling at different spatial scales in those studies, either at the regional, state, or county levels yielded different directions of relationships of the effect sizes when linking with local neighborhood risks, a condition susceptible to the modifiable areal unit problem (MAUP) and provide insight into the spatial scale of main effects and their interactions on local risk. Our study explored neighborhood effects at the district level in Malaysia, a boundary used by the national population census estimates that allow smaller area estimates for policy making. Income and poverty are strong population health indicators with higher income typically indicative for better health. Studies have found positive correlations between these indicators and T2D60,61, but our study found otherwise. A unit rise in the proportion of households living below the national poverty line was related to a decrease in T2D rates, but a unit rise in the proportion of households living below fifty percent of the median household income increased the risk of T2D across neighborhoods. We note that the metrics of spatial economic covariates through spatially adjusted econometric approaches with ecological linkages to T2D rates have different cut-offs and that the income distributions in Malaysia have local associations with the urban/rural geography of the country. Specifically, the proportion of households living below the national poverty line (i.e., extreme poor with monthly income cut-off of USD 533.10) were mostly distributed across the rural districts in the state of Sabah, and parts of the rural districts within the East Coast region of the country – these areas had low T2D rates (see joint distributions in Supplementary Figures 7-13). In contrast, the proportion of households living below fifty percent of median household income (i.e., households with income cut-off of USD 709.54) had positive relationships with T2D rates. Although not within the extremes of poverty, this income group exists within the upper end of the bottom 40% (B40) lower income group classification in Malaysia that were densely distributed in most rural districts, with some clusters observed across the urban districts in the country. The distribution of T2D rates within this income group was mostly dense across the districts in East Malaysia and Southern regions, with some clusters around the Northern and Central metropolitans, and townships within the East Coast regions (see joint distributions in Supplementary Figures 7-13), consistent with the effect direction of our regression models. Inconsistencies of spatial distribution between different economic covariates linking to T2D or obesity rates persisted in previous works8,62-67, suggesting that local interactions of income with other risk factors may moderate observed associations. Other plausible explanations for differing associations could be that extreme poor communities in rural areas have limited health literacy or access to health facilities for T2D screening, whereas the urban poor had better access to subsidized and affordable health services for T2D screening, again, suggesting interactions between risk factors at the neighborhood level. While poverty accelerates disease risk and poor health outcomes, the associations between income inequality as measured by the Gini coefficient exhibited inconsistencies throughout the spatial epidemiology literature8,68,69. We found that a unit rise in income inequality decreased T2D rates, when principally it was anticipated to be high, a condition most likely attributable to the “Swiss paradox” wherein increased income inequality that supposedly cause negative health impacts would inversely show better population health outcomes when measured on a finer geographical scale of aggregation68. These areas mostly comprised of districts within the rural regions of similar population distributions residing below the national poverty line as described earlier. As income inequality is a contextual variable specific to geographical scales influenced by local social or commercial drivers, we note that inconsistencies between the association of income inequality and chronic conditions can be influenced by the different spatial measurement units (e.g., state, district, or county levels) or the scales used (i.e., a single measure through Gini coefficient or composite measures of neighborhood deprivation scores), and, as suggested by our interaction analysis, potential mediators or confounders that change the degree of associations where areal level attributes within neighborhood characteristics interact with local health outcomes6,8. Inequalities and poverty collectively pressure local communities within deprived neighborhoods, causing poor health outcomes. We postulate plausible psychosocial and neo-materialist theories whereby different social gradients within community hierarchy pose different levels of stress within neighborhood living circumstances, in addition to poor communities paying less attention to health needs, practicing healthy lifestyles, or having easy accessibility to health or fitness facilities as compared to richer ones70-72. Deprived neighborhoods are often compromised with neighborhood safety features as consequence of social delinquency, accelerated local crime rates, or drug addiction. Our findings showed that apart from poverty attributes, neighborhood safety features interacted with access to fitness facilities (i.e., bowling centers) and parks, likely due to fear and psychosocial stress depriving local communities from actively participating in physical fitness, a circumstance that accelerate incidence of obesity and T2D rates in local neighborhoods, consistent with previous literature6,8,15,16. ## Strengths and Limitations Our countrywide results provide baseline comprehensive evidence for defining local neighborhood policy interventions to control the T2D burden at the district level in Malaysia. We note areal disparities of different neighborhood attributes and clusters of T2D, thereby suggesting local behavioral health interventions, health promotions, or local townships restructure or sustainability efforts based on specific targets and areas rather than the practice of implementing generic interventions nationwide. As policies are implemented at an aggregate level rather than being individualized, the ecological regression approach promptly pointed out areas of high, low, or epidemiologically transitioning areas of local diabetes burden with their associated place-based characteristics, guiding planning, implementation, and ongoing evaluation of local public health responses. The data used in this study are reliable, accurate, and generalizable, as countrywide disease registries are official health data captured through proper diagnostics by health personnels using validated clinical practice guidelines, thereby tackling many limitations from more commonly used cross-sectional surveys that are subject to non-generalizability, social-desirability bias, compromised accuracies due to measurements used, and potential residual spatial correlation. Our use of spatial modelling approaches accounts for spatial autocorrelations and spillover effects73-76, thus strengthening evidence and interpretation of effect associations. We also acknowledge limitations of the study. While the data were aggregated to the district level in the NDR database, we note that T2D cases were captured based on the primary care clinic’s location in that district’s established catchment area (i.e., principally within five kilometers of residence from the clinic). Therefore, we were unable to establish spatial-analytical connections to the smallest spatial unit (i.e. sub-district or mukim levels) for more robust estimates as they may share local borders or corners that may contaminate the exact location of analysis within the anonymized residence address of the NDR data. ## Conclusion Our study established that district-level T2D rates in Malaysia are influenced by neighborhood-level socio-economic vulnerability, safety, and access to fitness facilities. The interactions between these covariates provides insight into the patterns of these risk factors, and their interacting effects, at this geographic scale, allowing for localized planning of public health interventions to reduce risk. ## Competing Interests None declared. ## Patient Consent for Publication Not applicable. ## Ethics Statement Ethical approval was granted from the ethics committee of Universiti Kebangsaan Malaysia (UKM), Ministry of Higher Education Malaysia (JEP-2022-445) and the Medical Research Ethics Committee (MREC), Ministry of Health Malaysia [NMRR ID-22-01264-EE7 (IIR)]. ## Author Contributions K.G. and M.R.A.M. designed the study. M.R.A.M., N.S., F.I.M. and M.F.M.R. had access and collected data. N.S. and M.F.M.R. managed the data. K.G. and L.A.W. conducted the data analysis. K.G., M.R.A.M., N.S., L.A.W., K.N.A.M. and F.I.M. contributed to the interpretation of the data. K.G. drafted the first version of the manuscript. M.R.A.M., L.A.W. and KNAM critically revised the manuscript for important intellectual content. All authors read and approved the final manuscript. ## Funding This work was supported by the Ministry of Higher Education (MOHE) Malaysia Fundamental Research Grant Scheme (FRGS/1/2022/SKK04/UKM/01/1). ## Data Availability Statement The data that support the findings of this study are available from the authors but restrictions apply to the availability of these data, which were used under license from the relevant agencies for the current study, and so are not publicly available. Data are, however, available from the authors upon reasonable request and with permission from the relevant agencies or ministries. ## Acknowledgement We thank the various government agencies and ministries in support of this work. We acknowledge the use of Royal Malaysia Police force and National Anti-Drug Agency data by the Ministry of Home Affairs of Malaysia, social population statistics of CCTV coverage data by the Ministry of Housing and Local Government of Malaysia, population census (MyCensus 2020) and Population Wellbeing Fitness Survey data, all of which were sourced through the Department of Statistics Malaysia (DOSM) under the Ministry of Economy Malaysia. We acknowledge the support by the Ministry of Higher Education Malaysia and the Universiti Kebangsaan Malaysia for funding this work and the resources provided to execute this countrywide study. We also thank the Non-Communicable Disease Section, Public Health Division of the Ministry of Health Malaysia, and the Office of Director General of Public Health for providing the National Diabetes Register (NDR) data and the support of this study to be in line with Malaysia’s Public Health Priorities target. ## Footnotes * E-mail addresses: NS: nazarudin{at}ppukm.ukm.edu.my LAW: lwaller{at}emory.edu FIM: dr.feisul{at}moh.gov.my KNAM: knam{at}ukm.edu.my MFMR: faidrizal{at}yahoo.com * Received October 26, 2024. * Revision received October 26, 2024. * Accepted October 27, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.Bird, E., Ige, J., Burgess-Allen, J., Pinto, A. & Pilkington, P. Public health and wellbeing research group, healthy people healthy places evidence tool: evidence and practical linkage for design, planning and health. Technical Report. University of the West of England, Bristol [http://eprints.uwe.ac.uk/31390](http://eprints.uwe.ac.uk/31390) (2017). 2. 2.Duncan, D.T. & Kawachi, I. Neighborhoods and Health. New York, Oxford University Press (2018). 3. 3.Khatibi, M., Khaidzir, K.A.M. & Syed Mahdzar, S.S. Measuring the sustainability of neighborhoods: a systematic literature review. iScience 26:105951 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36866036&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 4. 4.Ige-Elegbede, J. et al. Designing healthier neighbourhoods: a systematic review of the impact of the neighbourhood design on health and wellbeing. Cities Health 6(5):1004–1019 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36618774&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 5. 5.Duncan, D.T., Kawachi, I., White, K. & Williams, D.R. The geography of recreational open space: influence of neighborhood racial composition and neighborhood poverty. J. Urban Health 90(4): 618–631 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11524-012-9770-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23099625&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 6. 6.Mujahid, M.S., Maddali, S.R., Gao, X., Oo, K.H., Benjamin, L.A. & Lewis, T.T. The impact of neighborhoods on diabetes risk and outcomes: centering health equity. Diabetes Care 46(9):1609–1618 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37354326&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 7. 7.Manderson, L. & Jewett, S. Risk, lifestyle and non-communicable diseases of poverty. Global Health 19:13 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36864476&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 8. 8.Kim, D., Wang, F. & Arcan, C. Geographic association between income inequality and obesity among adults in New York state. Prev. Chronic Dis, 15:180217 (2018). 9. 9.Jayathilaka, R., Joachim, S., Mallikarachchi, V., Perera, N. & Ranawaka, D. Do chronic illnesses and poverty go hand in hand? Plos One 15(10): je0241232 (2020). 10. 10.Chen, Y., Zhou, X., Bullard, K.M., Zhang, P., Imperatore, G. & Rolka, D.B. Income-related inequalities in diagnosed diabetes prevalence among US adults, 2001-2018. Plos One 18: e0283450 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37053158&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 11. 11.Sidahmed, S., Geyer, S. & Beller, J. Socioeconomic inequalities in diabetes prevalence: the case of South Africa between 2003 and 2016. BMC Public Health 23:324 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36788553&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 12. 12.Chetty, R., et al. The association between income and life expectancy in the United States, 2001-2014. JAMA 315(16):1750-1766 (2016). Erratum in: *JAMA* 317(1):90 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2016.4226&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27063997&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 13. 13.Ganasegeran, K. et al. How socio-economic inequalities cluster people with diabetes in Malaysia: geographic evaluation of area disparities using a non-parameterized unsupervised learning method. J. Epidemiol. Glob. Health 14: 169–183 (2024). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=38315406&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 14. 14.Hipp, J.A. & Chalise, N. Spatial analysis and correlates of county-level diabetes prevalence, 2009-2010. Prev. Chronic Dis. 12: E08 (2015). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25611797&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 15. 15.Tamayo, A. et al. Police-recorded crime and perceived stress among patients with type 2 diabetes: the Diabetes Study of Northern California (DISTANCE). J. Urban Health 93(5): 745–757 (2016). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27613180&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 16. 16.Hanigan, M., Heisler, M. & Choi, H. Relationship between county-level crime and diabetes: mediating effect of physical inactivity. Prev. Med. Rep. 20:101220 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33088677&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 17. 17.Marshall, S.J., Jones, D.A., Ainsworth, B.E., Reis, J.P., Levy, S.S. & Macera, C.A. Race/ethnicity, social class, and leisure-time physical inactivity. Med. Sci. Sports Exerc. 39(1):44–51 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1249/01.mss.0000239401.16381.37&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17218883&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000243574800009&link_type=ISI) 18. 18.Bennett, G.G., McNeill, L.H., Wolin, K.Y., Duncan, D.T., Puleo, E. & Emmons, K.M. Safe to walk? Neighborhood safety and physical activity among public housing residents. Plos. Med. 4(10):1599–1607 (2007). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17958465&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000251113600010&link_type=ISI) 19. 19.Patterson, P.D., Moore, C.G., Probst, J.C. & Shinogle, J.A. Obesity and physical inactivity in rural America. J. Rural Health 20(2):151–159 (2004). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1748-0361.2004.tb00022.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15085629&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000232015800008&link_type=ISI) 20. 20.Shrestha, S. & Spatial, S. Clusters of county-level diagnosed diabetes and associated risk factors in the United States. Open Diabetes J. 5: 29–37 (2012). 21. 21.International Diabetes Federation. Diabetes in the Western Pacific – 2021. IDF Diabetes Atlas [https://www.idf.org/our-network/regions-members/western-pacific/members/western-pacific-members.html](https://www.idf.org/our-network/regions-members/western-pacific/members/western-pacific-members.html) (2021). 22. 22.Institute for Public Health. National Health and Morbidity Survey (NHMS) 2019: Non-Communicable Diseases, Healthcare Demand and Health Literacy—Key Findings. Ministry of Health: Setia Alam, Malaysia 2020. 23. 23.Cox, M., Boyle, P.J., Davey, P.G., Feng, Z. & Morris, A.D. Locality deprivation and type 2 diabetes incidence: a local test of relative inequalities. Soc. Sci. Med. 65(9): 1953–1964 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.socscimed.2007.05.043&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17719709&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000250640900011&link_type=ISI) 24. 24.Agardh, E., Allebeck, P., Hallqvist, J., Moradi, T. & Sidorchuk, A. Type 2 diabetes incidence and socio-economic position: a systematic review and meta-analysis. Int. J. Epidemiol. 40(3): 804–818 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyr029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21335614&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000293618300033&link_type=ISI) 25. 25.Laraia, B.A., Karter, A.J., Warton, E.M., Schillinger, D., Moffet, H.H., & Adler, N. Place matters: neighborhood deprivation and cardiometabolic risk factors in the Diabetes Study of Northern California (DISTANCE). Soc. Sci. Med. 74(7):1082–1090 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.socscimed.2011.11.036&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22373821&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 26. 26.Scobie, I.N. & Samaras, K. Fast facts: diabetes mellitus. Fifth ed. Abingdon, Oxford: Health Press (2014). 27. 27.Cities Changing Diabetes (CCD). Urban diabetes understanding the challenges and opportunities [http://www.citieschangingdiabetes.com](http://www.citieschangingdiabetes.com) (2015). 28. 28.Rayner, M., Wickramasinghe, K., Williams, J., McColl, K. & Mendis, S. An Introduction to Population-Level Prevention of Non-Communicable Diseases. United Kingdom: Oxford University Press (2017). 29. 29.Ministry of Health Malaysia. National Diabetes Registry System [http://ndr.moh.gov.my/account/login?return=/dashboard/index.php](http://ndr.moh.gov.my/account/login?return=/dashboard/index.php) (2023). 30. 30.United Nations Office for Coordination of Humanitarian Affairs. Administrative Shapefiles Malaysia [www.un.org/en/our-work/deliver-humanitarian-aid](https://www.un.org/en/our-work/deliver-humanitarian-aid) (2021). 31. 31.Department of Statistics Malaysia. Malaysia Population Census (My Census) (2020). 32. 32.Ministry of Health Malaysia. Malaysian Clinical Practice Guidelines Diabetes. Management of Type 2 Diabetes Mellitus (6th Edition). [https://www.moh.gov.my/moh/resources/Penerbitan/CPG/Endocrine/CPG\_T2DM\_6th\_Edition\_2020\_13042021.pdf](https://www.moh.gov.my/moh/resources/Penerbitan/CPG/Endocrine/CPG\_T2DM\_6th_Edition_2020_13042021.pdf) (2020). 33. 33.Ministry of Health Malaysia. National Diabetes Registry Report. [https://www.moh.gov.my/index.php/pages/view/1905](https://www.moh.gov.my/index.php/pages/view/1905) (2020). 34. 34.Rosenbaum, P.R. & Rubin, D.B. Difficulties with regression analyses of age-adjusted rates. Biometrics 40(2): 437–443 (1984). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2307/2531396&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=6487727&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1984TL29400015&link_type=ISI) 35. 35.Pham, T.M., Carpenter, J.R., Morris, T.P., Sharma, M. & Petersen, I. Ethnic differences in the prevalence of type 2 diabetes diagnoses in the UK: Cross-sectional analysis of the Health Improvement Network Primary Care Database. Clin. Epidemiol. 11:1081–1088 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/CLEP.S227621&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 36. 36.Pradeepa, R. & Mohan, V. Epidemiology of type 2 diabetes in India. Indian J. Ophthalmol. 69: 2932–2938 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access\_num=10.4103/ijo.IJO_1627_21&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34708726&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 37. 37.Bai, A., Tao, J., Tao, L. & Liu, J. Prevalence and risk factors of diabetes among adults aged 45 years or older in China: A national cross-sectional study. *Endocrinol*. Diabetes Metab. 24: e00265 (2021). 38. 38.Kautzky-Willer, A., Leutner, M. & Harreiter, J. Sex differences in type 2 diabetes. Diabetologia 66:986–1002 (2023). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00125-023-05891-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36897358&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 39. 39.Yan, Z., Cai, M., Han, X., Chen, Q. & Lu, H. The interaction between age and risk factors for diabetes and prediabetes: a community-based cross-sectional study. Diabetes Metab. Syndr. Obes. 16:85–93 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36760587&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 40. 40.The World Bank. Indicators for Malaysia [https://www.worldbank.org/en/home](https://www.worldbank.org/en/home) (2022). 41. 41.Haveman, R. & Mullikin, M. 2001. Alternative Measures of National Poverty: Perspectives and Assessment. In Ethics, Poverty and Inequality and Reform in Social Security. ed. Erik Schokkaert. London: Ashgate Publishing Ltd (2001). 42. 42.Gerell, M. CCTV in deprived neighbourhoods – a short-time follow-up of effects on crime and crime clearance. Nord. J. Criminol. 22:221–39 (2021). 43. 43.Gorman, D.M., Gruenewald, P.J. & Waller, L.A. Linking places to problems: geospatial theories of neighborhoods, alcohol and crime. GeoJournal 78: 417–428 (2013). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23750067&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 44. 44.Duncan, D.T., Palamar, J.J. & Williams, J.H. Perceived neighborhood illicit drug selling, peer illicit drug disapproval and illicit drug use among U.S. high school seniors. Subst. Abuse Treat. Prev. Policy 9:35 (2014). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25182042&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 45. 45.Gomes, C.S. et al. Spatial analysis of leisure-time physical activity in an urban area. Rev. Bras. Epidemiol. 24(suppl 1): e210012 (2021). 46. 46.Bale, J. Sports geography. Routledge (2002). 47. 47.Hoekman, R., Breedveld, K. & Kraaykamp, G. A landscape of sport facilities in the Netherlands. International Journal of Sport Policy and Politics 8:305–320 (2016). 48. 48.Wang, J., Li, J. & Cheng, J. Spatial disparity of sports infrastructure development and urbanization determinants in China: Evidence from the sixth national sports venues census. Applied Spatial Analysis and Policy 17:573–598 (2024). 49. 49.Cereijo, L. et al. Access to and availability of exercise facilities in Madrid: an equity perspective. Int. J. Health Geogr. 18(1):15 (2019). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31266518&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 50. 50.Vehkakoski, K. & Norra, J. The situation of local sports facilities in Finland. In Neighborhood Sport Facility. Eds. Suomi, K. & Kotthaus, D. LIKES Research Reports on Physical Activity and Health 327. Grano Oy Jyvaskyla: Finland (2017). 51. 51.Gebreab, S.Y. Statistical methods in spatial epidemiology. In: Duncan DT and Kawachi I (Eds); Neighborhoods and Health. New York, Oxford University Press (2018). 52. 52.Waller, L.A. & Gotway, C.A. Applied spatial statistics for public health data. Hoboken, NJ: Wiley (2019). 53. 53.Chi, G. & Zhu, J. Models dealing with both spatial dependence and spatial heterogeneity. In Spatial Regression Models for the Social Sciences pp. 139–154. SAGE Publications, Inc., doi:10.4135/9781544302096 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4135/9781544302096&link_type=DOI) 54. 54.Harrel, F.E. Regression modelling strategies, with applications to linear models, logistic and ordinal regression, and survival analysis. Second edition. New York: Springer (2015). 55. 55.Park, S., Zachary, W.W., Gittelsohn, J., Quinn, C.C. & Surkan, P.J. Neighborhood influences on physical activity among low-income African American adults with type 2 diabetes mellitus. Diabetes Educ. 46(2):181–190 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32100614&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 56. 56.Al-Hanawi, M.K., Chirwa, G.C. & Pulok, M.H. Socio-economic inequalities in diabetes prevalence in the Kingdom of Saudi Arabia. Int. J. Health Plann. Manage. 35:233–246 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31460681&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 57. 57.Walsan, R., Feng, X., Mayne, D.J., Pai, N. & Bonney, A. Neighborhood environment and type 2 diabetes comorbidity in serious mental illness. J. Prim. Care Community Health 11:2150132720924989 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32450744&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 58. 58.Khan, J.R., Sultana, A., Islam, M.M. & Biswas, R.K. A negative association between prevalence of diabetes and urban residential area greenness detected in nationwide assessment of urban Bangladesh. Sci. Rep. 11(1):19513 (2021). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34593885&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 59. 59.Gaspar, R.S., Rossi, L., Hone, T. & Dornelles, A.Z. Income inequality and non-communicable disease mortality and morbidity in Brazil States: a longitudinal analysis 2002-2017. Lancet Reg. Health Am. 2:100042 (2021). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36779037&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 60. 60.Oliveira, F.L.P. et al. Spatial clusters of diabetes: individual and neighborhood characteristics in the ELSA-Brasil cohort study. Cad. Saude Publica 39(5): e00138822 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37162112&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 61. 61.Menke, A., Casagrande, S., Geiss, L. & Cowie, C.C. Prevalence of and trends in diabetes among adults in the United States, 1988-2012. JAMA 314(10):1021-1029 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2015.10029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26348752&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 62. 62.Mueller, G. & Berger, K. The influence of neighbourhood deprivation on the prevalence of diabetes in 25- to 74-year-old individuals: first results from the Dortmund Health Study. Diabet. Med. 29:831–833 (2012). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22132907&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 63. 63.Muller, G. et al. Regional and neighborhood disparities in the odds of type 2 diabetes: results from 5 population-based studies in Germany (DIAB-CORE consortium). Am. J. Epidemiol. 178(2): 221–230 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kws466&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23648804&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 64. 64.Garcia, L., Lee, A., Zeki Al Hazzouri, A., Neuhaus, J., Epstein, M. & Haan, M. The impact of neighborhood socioeconomic position on prevalence of diabetes and prediabetes in older Latinos: The Sacramento Area Latino Study on Aging. Hisp. Health Care Int. 13(2):77–85 (2015). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26078026&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 65. 65.Rachele, J.N., Kavanagh, A.M., Brown, W.J., Healy, A.M. & Turrell, G. Neighborhood disadvantage and body mass index: a study of residential relocation. Am. J. Epidemiol. 187(8):1696–1703 (2018). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29351569&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 66. 66.Lord, J., Roberson, S. & Odoi, A. Investigation of geographic disparities of pre-diabetes and diabetes in Florida. BMC Public Health 20:1226 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32787830&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 67. 67.Krishnamoorthy, Y., Rajaa, S., Verma, M., Kakkar, R. & Kalra, S. Spatial patterns and determinants of diabetes mellitus in Indian adult population: a secondary data analysis from nationally representative surveys. Diabetes Ther. 14(1):63–75 (2023). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36329233&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 68. 68.Cohen, S.A., Greaney, M.L. & Klassen, A.C. A "Swiss paradox" in the United States? Level of spatial aggregation changes the association between income inequality and morbidity for older Americans. Int. J. Health Geogr. 18(1):28 (2019). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31775750&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 69. 69.Ting, S., Zang, W., Chen, C. & Chen, D. Income distribution and health: What do we know from Chinese data? Plos One 17(1): e0263008 (2022). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35073367&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) 70. 70.Kawachi, I., Kennedy, B.P., Lochner, K., & Prothrow-Stith, D. Social capital, income inequality, and mortality. Am. J. Public Health 87(9):1491–1498 (1997). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2105/AJPH.87.9.1491&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9314802&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F27%2F2024.10.26.24316183.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997XY23500016&link_type=ISI) 71. 71.Lynch, J.W., Smith, G.D., Kaplan, G.A. & House, J.S. Income inequality and mortality: importance to health of individual income, psychosocial environment, or material conditions. BMJ 320(7243):1200-1204 (2000). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjEzOiIzMjAvNzI0My8xMjAwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMTAvMjcvMjAyNC4xMC4yNi4yNDMxNjE4My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 72. 72.Marmot, M. & Wilkinson, R.G. Psychosocial and material pathways in the relation between income and health: a response to Lynch et al. BMJ 322:1233–1236 (2001). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjEzOiIzMjIvNzI5Ni8xMjMzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMTAvMjcvMjAyNC4xMC4yNi4yNDMxNjE4My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 73. 73.Anselin, L. Spatial econometrics: methods and models. Dordrecht: Kluwer Academic (1988). 74. 74.Anselin, L. Under the hood: Issues in the specification and interpretation of spatial regression models. Agricultural Economics 27: 247–267 (2002). 75. 75.Anselin, L. Spatial externalities, spatial multipliers, and spatial econometrics. International Regional Science Review 26(2), 153–166 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0160017602250972&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000181958400002&link_type=ISI) 76. 76.Anselin, L. Spatial regression. In A.S. Fotheringham & P.A. Rogerson (Eds.), The SAGE Handbook of Spatial Analysis pp. 254-270. London: SAGE Publications (2009). [1]: /embed/graphic-1.gif [2]: /embed/inline-graphic-1.gif [3]: /embed/graphic-2.gif [4]: /embed/graphic-3.gif [5]: /embed/graphic-4.gif [6]: /embed/graphic-5.gif