-
Background:
The process of geocoding produces output coordinates of varying degrees of quality. Previous studies have revealed that simply excluding records with low-quality geocodes from analysis can introduce significant bias, but depending on the number and severity of the inaccuracies, their inclusion may also lead to bias. Little quantitative research has been presented on the cost and/or effectiveness of correcting geocodes through manual interactive processes, so the most cost effective methods for improving geocoded data are unclear. The present work investigates the time and effort required to correct geocodes contained in five health-related datasets that represent examples of data commonly used in Health GIS.
Results:
Geocode correction was attempted on five health-related datasets containing a total of 22,317 records. The complete processing of these data took 11.4 weeks (427 hours), averaging 69 seconds of processing time per record. Overall, the geocodes associated with 12,280 (55%) of records were successfully improved, taking 95 seconds of processing time per corrected record on average across all five datasets. Geocode correction improved the overall match rate (the number of successful matches out of the total attempted) from 79.3 to 95%. The spatial shift between the location of original successfully matched geocodes and their corrected improved counterparts averaged 9.9 km per corrected record. After geocode correction the number of city and USPS ZIP code accuracy geocodes were reduced from 10,959 and 1,031 to 6,284 and 200, respectively, while the number of building centroid accuracy geocodes increased from 0 to 2,261.
Conclusions:
The results indicate that manual geocode correction using a web-based interactive approach is a feasible and cost effective method for improving the quality of geocoded data. The level of effort required varies depending on the type of data geocoded. These results can be used to choose between data improvement options (e.g., manual intervention, pseudocoding/geo-imputation, field GPS readings).
-
Background:
An increasing number of studies suggest that characteristics of context, or the attributes of the places within which we live, work and socialize, are associated with variations in health-related behaviours and outcomes. The challenge for health research is to ensure that these places are accurately represented spatially, and to identify those aspects of context that are related to variations in health and amenable to modification. This study focuses on the design of a wearable global positioning system (GPS) data logger for the purpose of objectively measuring the temporal and spatial features of human activities. Person-specific GPS data provides a useful source of information to operationalize the concept of place.
Results:
We designed and tested a lightweight, wearable GPS receiver, capable of logging location information for up to 70 hours continuously before recharging. The device is accurate to within 7 m in typical urban environments and performs well across a range of static and dynamic conditions.DiscussionRather than rely on static areal units as proxies for places, wearable GPS devices can be used to derive a more complete picture of the different places that influence an individual's wellbeing. The measures are objective and are less subject to biases associated with recall of location or misclassification of contextual attributes. This is important for two reasons. First, it brings a dynamic perspective to place and health research. The influence of place on health is dynamic in that certain places are more or less relevant to wellbeing as determined by the length of time in any location and by the frequency of activity in the location. Second, GPS data can be used to assess whether the characteristics of places at specific times are useful to explaining variations in health and wellbeing.
-
Background:
Rural-urban disparities in health and healthcare are often attributed to differences in geographic access to care and health seeking behavior. Less is known about the differences between rural locations in health care seeking and outcomes. This study examines how commuting patterns in different rural areas are associated with perforated appendicitis.
Results:
Controlling for age, sex, insurance type, comorbid conditions, socioeconomic status, appendectomy rates, hospital type, and hospital location, we found that patient residence in a rural ZIP code with significant levels of commuting to metropolitan areas was associated with higher risk of perforation compared to residence in rural areas with commuting to smaller urban clusters. The former group was more likely to seek care in an urbanized area, and was more likely to receive care in a Children's Hospital.
Conclusion:
To our knowledge, this is the first study to differentiate rural dwellers with respect to outcomes associated with appendicitis as opposed to simply comparing "rural" to "urban". Risk of perforated appendicitis associated with commuting patterns is larger than that posed by several individual indicators including some age-sex cohort effects. Future studies linking the activity spaces of rural dwellers to individual patterns of seeking care will further our understanding of perforated appendicitis and ambulatory care sensitive conditions in general.
-
Background:
A method that assesses bacterial spread based on spatially explicit (latitude and longitude) and non-spatially explicit data was explored. It measures microbial genotypes (defined by electrophoretic patterns or EP), host, location (farm), interfarm Euclidean distance, and time. Its proof of concept (construct and internal validity) was evaluated using a dataset that included 113 Staphylococcus aureus EPs isolated from 1126 bovine milk samples (collected in 23 farms between 1998 and 2005).
Results:
Construct validity was assessed by comparing results between the interfarm Euclidean distance (a spatially explicit measure) and the (non-spatial) interfarm number of isolates/EP (a measure assumed to be the gold standard of this evaluation). Internal validity was estimated by comparing results obtained with different versions of the same indices (which accounted for various potential sources of bias). The distance associated with EP spread correlated with the interfarm number of isolates/EP (r=.59, P=0.01). Concordance was observed between: (a) EP distance (estimated microbial dispersal over time) and EP speed (distance/year, r=.72, P=0.002), and (b) the interfarm number of isolates/EP (when measured on the basis of non-repeated cow testing) and the same measure when based on repeated tests (r=.88, P
-
Background:
Kulldorff's spatial scan statistic and its software implementation - SaTScan - are widely used for detecting and evaluating geographic clusters. However, two issues make using the method and interpreting its results non-trivial: (1) the method lacks cartographic support for understanding the clusters in geographic context and (2) results from the method are sensitive to parameter choices related to cluster scaling (abbreviated as scaling parameters), but the method provides no direct support for making these choices. We employ both established and novel geovisual analytics methods to address these issues and to enhance the interpretation of SaTScan results. We demonstrate our geovisual analytics approach in a case study analysis of cervical cancer mortality in the U.S.
Results:
We address the first issue by providing an interactive visual interface to support the interpretation of SaTScan results. Our research to address the second issue prompted a broader discussion about the sensitivity of SaTScan results to parameter choices. Sensitivity has two components: (1) the method can identify clusters that, while being statistically significant, have heterogeneous contents that are composed of both high-risk and low-risk locations and (2) the method can identify clusters that are unstable in location and size as the spatial scan scaling parameter is varied. To investigate cluster result stability, we conducted multiple SaTScan runs with systematically selected parameters. The results, when scanning a large spatial dataset (e.g., U.S. data aggregated by county), demonstrate that no single spatial scan scaling value is known to be optimal to identify clusters that exist at different scales; instead, multiple scans that vary the parameters are necessary. We introduce a novel method of measuring and visualizing reliability that facilitates identification of homogeneous clusters that are stable across analysis scales. Finally, we propose a logical approach to proceed through the analysis of SaTScan results.
Conclusions:
The geovisual analytics approach described in this paper facilitates the interpretation of spatial cluster detection methods by providing cartographic representation of SaTScan results and by providing visualization methods and tools that support selection of SaTScan parameters. Our methods distinguish between heterogeneous and homogeneous clusters and assess the stability of clusters across analytic scales.MethodWe analyzed the cervical cancer mortality data for the United States aggregated by county between 2000 and 2004. We ran SaTScan on the dataset fifty times with different parameter choices. Our geovisual analytics approach couples SaTScan with our visual analytic platform, allowing users to interactively explore and compare SaTScan results produced by different parameter choices. The Standardized Mortality Ratio and reliability scores are visualized for all the counties to identify stable, homogeneous clusters. We evaluated our analysis result by comparing it to that produced by other independent techniques including the Empirical Bayes Smoothing and Kafadar spatial smoother methods. The geovisual analytics approach introduced here is developed and implemented in our Java-based Visual Inquiry Toolkit.
-
Background:
Rural-urban disparities in health and healthcare are often attributed to differences in geographic access to care and health seeking behavior. Less is known about the differences between rural locations in health care seeking and outcomes. This study examines how commuting patterns in diverse rural areas are associated with differential experience of perforated appendicitis.
Results:
Controlling for age, sex, insurance type, comorbid conditions, socioeconomic status, appendectomy rates, hospital type, and hospital location, we found that patient residence in a rural ZIP code with significant levels of commuting to metropolitan areas was associated with higher risk of perforation compared to residence in rural areas with commuting to smaller urban clusters. The former group was more likely to seek care in an urbanized area, and was more likely to receive care in a children's hospital.
Conclusion:
To our knowledge, this is the first study to differentiate rural dwellers with respect to outcomes associated with appendicitis as opposed to simply comparing "rural" to "urban". Risk of perforated appendicitis associated with commuting patterns is larger than that posed by several individual indicators including some age-sex cohort effects. Future studies linking the activity spaces of rural dwellers to individual patterns of seeking care will further our understanding of perforated appendicitis and ambulatory care sensitive conditions in general.
-
Background:
Malaria constitutes a major cause of morbidity in the Brazilian Amazon where an estimated 6 million people are considered at high risk of transmission. Indigenous peoples in the Amazon are particularly vulnerable to potentially epidemic disease such as malaria; notwithstanding, very little is known about the epidemiology of malaria in Indian reservations of the region. The aim of this paper is to present a spatial analysis of malaria cases over a four-year time period (2003–2006) among indigenous peoples of the Brazilian State of Rondônia, southwestern Amazon, by using passive morbidity data (results from Giemsa-stained thick blood smears) gathered from the National Malaria Epidemiologic Surveillance System databank.
Results:
A total of 4,160 cases of malaria were recorded in 14 Indian reserves in the State of Rondônia between 2003 and 2006. In six reservations no cases of malaria were reported in the period. Overall, P. vivax accounted for 76.18 of malaria cases reported in the indigenous population of Rondônia. The P. vivax/P. falciparum ratio for the period was 3.78. Two reserves accounted for over half of the cases reported for the total indigenous population in the period – Roosevelt and Pacaas Novas – with a total of 1,646 (39.57%) and 1,145 (27.52%) cases, respectively. Kernel mapping of malaria mean Annual Parasite Index – API according to indigenous reserves and environmental zones revealed a heterogeneous pattern of disease distribution, with one clear area of high risk of transmission comprising reservations of west Rondônia along the Guaporé-Madeira River basins, and another high risk area to the east, on the Roosevelt reserve.
Conclusion:
By means of kernel mapping, it was shown that malaria risk varies widely between Indian reserves and environmental zones defined on the basis of predominant ecologic characteristics and land use patterns observed in the southwestern Brazilian Amazon. The geographical approach in this paper helped to determine where the greatest needs lie for more intensively focused malaria control activities in Indian reserves in the region. It also provided a reference to assess the effectiveness of control measures that have been put in place by Brazilian public health authorities.
-
Background:
Plague is a rapidly progressing, serious illness in humans that is likely to be fatal if not treated. It remains a public health threat, especially in sub-Saharan Africa. In spite of plague's highly focal nature, a thorough ecological understanding of the general distribution pattern of plague across sub-Saharan Africa has not been established to date. In this study, we used human plague data from sub-Saharan Africa for 1970–2007 in an ecological niche modeling framework to explore the potential geographic distribution of plague and its ecological requirements across Africa.
Results:
We predict a broad potential distributional area of plague occurrences across sub-Saharan Africa. General tests of model's transferability suggest that our model can anticipate the potential distribution of plague occurrences in Madagascar and northern Africa. However, generality and predictive ability tests using regional subsets of occurrence points demonstrate the models to be unable to predict independent occurrence points outside the training region accurately. Visualizations show plague to occur in diverse landscapes under wide ranges of environmental conditions.
Conclusion:
We conclude that the typical focality of plague, observed in sub-Saharan Africa, is not related to fragmented and insular environmental conditions manifested at a coarse continental scale. However, our approach provides a foundation for testing hypotheses concerning focal distribution areas of plague and their links with historical and environmental factors.
-
Background:
Reliable access to basic services can improve a community's resilience to HIV/AIDS. Accordingly, work is being done to upgrade the physical infrastructure in affected areas, often employing a strategy of decentralised service provision. Spatial characteristics are one of the major determinants in implementing services, even in the smaller municipal areas, and good quality spatial information is needed to inform decision making processes. However, limited funds, technical infrastructure and human resource capacity result in little or no access to spatial information for crucial infrastructure development decisions at local level.This research investigated whether it would be possible to develop a GIS for basic infrastructure planning and management at local level. Given the resource constraints of the local government context, particularly in small municipalities, it was decided that open source software should be used for the prototype system.
Results:
The design and development of a prototype system illustrated that it is possible to develop an open source GIS system that can be used within the context of local information management. Usability tests show a high degree of usability for the system, which is important considering the heavy workload and high staff turnover that characterises local government in South Africa. Local infrastructure management stakeholders interviewed in a case study of a South African municipality see the potential for the use of GIS as a communication tool and are generally positive about the use of GIS for these purposes. They note security issues that may arise through the sharing of information, lack of skills and resource constraints as the major barriers to adoption.
Conclusion:
The case study shows that spatial information is an identified need at local level. Open source GIS software can be used to develop a system to provide local-level stakeholders with spatial information. However, the suitability of the technology is only a part of the system – there are wider information and management issues which need to be addressed before the implementation of a local-level GIS for infrastructure management can be successful.
-
Background:
Climate may exert a strong influence on health, in particular on vector-borne infectious diseases whose vectors are intrinsically dependent on their environment. Although critical, linking climate variability to health outcomes is a difficult task. For some diseases in some areas, spatially and temporally explicit surveillance data are available, but comparable climate data usually are not. We utilize spatial models and limited weather observations in Puerto Rico to predict weather throughout the island on a scale compatible with the local dengue surveillance system.
Results:
We predicted monthly mean maximum temperature, mean minimum temperature, and cumulative precipitation at a resolution of 1,000 meters. Average root mean squared error in cross-validation was 1.24°C for maximum temperature, 1.69°C for minimum temperature, and 62.2 millimeters for precipitation.
Conclusion:
We present a methodology for efficient extrapolation of minimal weather observation data to a more meaningful geographical scale. This analysis will feed downstream studies of climatic effects on dengue transmission in Puerto Rico. Additionally, we utilize conditional simulation so that model error may be robustly passed to future analyses.
-
Background:
With limited resources available, injury prevention efforts need to be targeted both geographically and to specific populations. As part of a pediatric injury prevention project, data was obtained on all pediatric medical and injury incidents in a fire district to evaluate geographical clustering of pediatric injuries. This will be the first step in attempting to prevent these injuries with specific interventions depending on locations and mechanisms.
Results:
There were a total of 4803 incidents involving patients less than 15 years of age that the fire district responded to during 2001–2005 of which 1997 were categorized as injuries and 2806 as medical calls. The two cohorts (injured versus medical) differed in age distribution (7.7 ± 4.4 years versus 5.4 ± 4.8 years, p
-
Background:
A sharp rise in the malaria mortality rate has been observed recently in western Kenya. Malaria is transmitted by mosquito vectors. Malaria control strategies can be more successful if the distribution and abundance of mosquito vectors is predicted. However, how mosquito vectors are distributed in space remain poor understood, and this question is rarely studied using spatial methods. This study aims to provide a better understanding of the distribution and abundance of mosquito vectors. To achieve this objective, spatial and non-spatial methods were employed. The data on the distribution of adult mosquitoes, and mosquito breeding habitats in a study area in western Kenya, and environmental variables were analyzed.
Results:
The models developed using spatial methods outperformed the models developed using non-spatial methods. Houses close to locations where mosquito breeding habitats were repeatedly observed had more abundant adult female mosquitoes. Distance to high-order streams was identified as an effective predictor for the distribution of adult mosquitoes.
Conclusion:
The spatial method is more effective in modeling the distribution of adult mosquitoes than the non-spatial method. The results of this study can be used to facilitate decision-making related to mosquito surveillance and malaria prevention.
-
Background:
The creation of successful health policy and location of resources increasingly relies on evidence-based decision-making. The development of intuitive, accessible tools to analyse, display and disseminate spatial data potentially provides the basis for sound policy and resource allocation decisions. As health services are rationalized, the development of tools such graphical user interfaces (GUIs) is especially valuable at they assist decision makers in allocating resources such that the maximum number of people are served. GIS can used to develop GUIs that enable spatial decision making.
Results:
We have created a Web-based GUI (wGUI) to assist health policy makers and administrators in the Canadian province of British Columbia make well-informed decisions about the location and allocation of time-sensitive service capacities in rural regions of the province. This tool integrates datasets for existing hospitals and services, regional populations and road networks to allow users to ascertain the percentage of population in any given service catchment who are served by a specific health service, or baskets of linked services. The wGUI allows policy makers to map trauma and obstetric services against rural populations within pre-specified travel distances, illustrating service capacity by region.
Conclusion:
The wGUI can be used by health policy makers and administrators with little or no formal GIS training to visualize multiple health resource allocation scenarios. The GUI is poised to become a critical decision-making tool especially as evidence is increasingly required for distribution of health services.
-
Background:
In health and place research, definitions of areas, area characteristics, and health outcomes should ideally be coherent with one another. Yet current approaches for delimiting areas mostly rely on spatial units "of convenience" such as census tracts. These areas may be homogeneous along socioeconomic conditions but heterogeneous along other environmental characteristics. This heterogeneity can lead to biased measurement of environment characteristics and misestimation of area effects on health. The objective of this study was to assess the soundness of census tracts as units of analysis for measuring the active living potential of environments, hypothesised to be associated with walking.
Results:
Starting with data at the smallest census area level available, zones homogeneous along three indicators of active living potential, i.e. population density, land use mix, and accessibility to services were designed. Delimitation of zones ensued from statistical clustering of the smallest areas into seven clusters or "types of environment". Mapping of clusters into a GIS led to the delineation of 898 zones characterised by one of seven types of environment, corresponding to different levels of active living potential. Homogeneity of census tracts along indicators of active living potential varied. A greater proportion (83%) of variation in accessibility to services was attributable to differences between census tracts suggesting within-tract homogeneity along this variable. However, census tracts were heterogeneous with respect to population density and land use mix where a greater proportion of the variation was attributable to within-tract differences. About 55% of tracts were characterised by a combination of three or more "types of environment" suggesting substantial within-tract heterogeneity in the active living potential of environments.
Conclusion:
Soundness of census tracts for measuring active living potential may be limited. Measuring active living potential with error may lead to misestimation of associations with walking, therefore limiting the correctness of inference about area effects on walking. Future studies should aim to determine homogeneity of spatial units "of convenience" along environment characteristics of interest prior to examining their association with health. Further evidence is needed to assess the extent of this methodological issue with other indicators of environment context relevant to other health indicators.
-
Background:
Cholera has claimed many lives throughout history and it continues to be a global threat, especially in countries in Africa. The disease is listed as one of three internationally quarantinable diseases by the World Health organization, along with plague and yellow fever. Between 1999 and 2005, Africa alone accounted for about 90% of over 1 million reported cholera cases worldwide. In Ghana, there have been over 27000 reported cases since 1999. In one of the affected regions in Ghana, Ashanti region, massive outbreaks and high incidences of cholera have predominated in urban and overcrowded communities.
Results:
A GIS based spatial analysis and statistical analysis, carried out to determine clustering of cholera, showed that high cholera rates are clustered around Kumasi Metropolis (the central part of the region), with Moran's Index = 0.271 and P
-
Background:
Historical, social and economic reasons can lead to major differences in the allocation of health system resources and research funding. These differences might endanger the progress in diagnostic and therapeutic approaches of socio-economic important diseases. The present study aimed to assess different benchmarking approaches that might be used to analyse these disproportions. Research in two categories was analysed for various output parameters and compared to input parameters. Germany was used as a high income model country. For the areas of cardiovascular and respiratory medicine density equalizing mapping procedures visualized major geographical differences in both input and output markers.
Results:
An imbalance in the state financial input was present with 36 cardiovascular versus 8 respiratory medicine state-financed full clinical university departments at the C4/W3 salary level. The imbalance in financial input is paralleled by an imbalance in overall quantitative output figures: The 36 cardiology chairs published 2708 articles in comparison to 453 articles published by the 8 respiratory medicine chairs in the period between 2002 and 2006. This is a ratio of 75.2 articles per cardiology chair and 56.63 articles per respiratory medicine chair. A similar trend is also present in the qualitative measures. Here, the 2708 cardiology publications were cited 48337 times (7290 times for respiratory medicine) which is an average citation of 17.85 per publication vs. 16.09 for respiratory medicine. The average number of citations per cardiology chair was 1342.69 in contrast to 911.25 citations per respiratory medicine chair. Further comparison of the contribution of the 16 different German states revealed major geographical differences concerning numbers of chairs, published items, total number of citations and average citations.
Conclusion:
Despite similar significances of cardiovascular and respiratory diseases for the global burden of disease, large input and output imbalances have been revealed in the present study which point to a need for changes in funding policies. The present study supplies data that could be used for decision making in the field of health systems funding.
-
Background:
An epidemic may exhibit different spatial patterns with a change in geographic scale, with each scale having different conduits and impediments to disease spread. Mapping disease at each of these scales often reveals different cluster patterns. This paper will consider this change of geographic scale in an analysis of yellow fever deaths for New Orleans in 1878. Global clustering for the whole city, will be followed by a focus on the French Quarter, then clusters of that area, and finally street-level patterns of a single cluster. The three-dimensional visualization capabilities of a GIS will be used as part of a cluster creation process that incorporates physical buildings in calculating mortality-to-mortality distance. Including nativity of the deceased will also capture cultural connection.
Results:
Twenty-two yellow fever clusters were identified for the French Quarter. These generally mirror the results of other global cluster and density surfaces created for the entire epidemic in New Orleans. However, the addition of building-distance, and disease specific time frame between deaths reveal that disease spread contains a cultural component. Same nativity mortality clusters emerge in a similar time frame irrespective of proximity. Italian nativity mortalities were far more densely grouped than any of the other cohorts. A final examination of mortalities for one of the nativity clusters reveals that further sub-division is present, and that this pattern would only be revealed at this scale (street level) of investigation.
Conclusion:
Disease spread in an epidemic is complex resulting from a combination of geographic distance, geographic distance with specific connection to the built environment, disease-specific time frame between deaths, impediments such as herd immunity, and social or cultural connection. This research has shown that the importance of cultural connection may be more important than simple proximity, which in turn might mean traditional quarantine measures should be re-evaluated.
-
Background:
Seasonal patterns in cardiac disease in the northern hemisphere are well described in the literature. More recently age and gender differences in cardiac mortality and to a lesser extent morbidity have been presented. To date spatial differences between the seasonal patterns of cardiac disease has not been presented. Literature relating to seasonal patterns in cardiac disease in the southern hemisphere and in Australia in particular is scarce. The aim of this paper is to describe the seasonal, age, gender, and spatial patterns of cardiac disease in Melbourne Australia by using acute myocardial infarction admissions to hospital as a marker of cardiac disease.
Results:
There were 33,165 Acute Myocardial Infarction (AMI) admissions over 2186 consecutive days. There is a seasonal pattern in AMI admissions with increased rates during the colder months. The peak month is July. The admissions rate is greater for males than for females, although this difference decreases with advancing age. The maximal AMI season for males extends from April to November. The difference between months of peak and minimum admissions was 33.7%. Increased female AMI admissions occur from May to November, with a variation between peak and minimum of 23.1%. Maps of seasonal AMI admissions demonstrate spatial differences. Analysis using Global and Local Moran's I showed increased spatial clustering during the warmer months. The Bivariate Moran's I statistic indicated a weaker relationship between AMI and age during the warmer months.
Conclusion:
There are two distinct seasons with increased admissions during the colder part of the year. Males present a stronger seasonal pattern than females. There are spatial differences in AMI admissions throughout the year that cannot be explained by the age structure of the population. The seasonal difference in AMI admissions warrants further investigation. This includes detailing the prevalence of cardiac disease in the community and examining issues of social and environmental justice.
-
IntroductionThe reasons for elevated breast cancer rates in the upper Cape Cod area of Massachusetts remain unknown despite several epidemiological studies that investigated possible environmental risk factors. Data from two of these population-based case-control studies provide geocoded residential histories and information on confounders, creating an invaluable dataset for spatial-temporal analysis of participants' residency over five decades.
Methods:
The combination of statistical modeling and mapping is a powerful tool for visualizing disease risk in a spatial-temporal analysis. Advances in geographic information systems (GIS) enable spatial analytic techniques in public health studies previously not feasible. Generalized additive models (GAMs) are an effective approach for modeling spatial and temporal distributions of data, combining a number of desirable features including smoothing of geographical location, residency duration, or calendar years; the ability to estimate odds ratios (ORs) while adjusting for confounders; selection of optimum degree of smoothing (span size); hypothesis testing; and use of standard software.We conducted a spatial-temporal analysis of breast cancer case-control data using GAMs and GIS to determine the association between participants' residential history during 1947–1993 and the risk of breast cancer diagnosis during 1983–1993. We considered geographic location alone in a two-dimensional space-only analysis. Calendar year, represented by the earliest year a participant lived in the study area, and residency duration in the study area were modeled individually in one-dimensional time-only analyses, and together in a two-dimensional time-only analysis. We also analyzed space and time together by applying a two-dimensional GAM for location to datasets of overlapping calendar years. The resulting series of maps created a movie which allowed us to visualize changes in magnitude, geographic size, and location of elevated breast cancer risk for the 40 years of residential history that was smoothed over space and time.
Results:
The space-only analysis showed statistically significant increased areas of breast cancer risk in the northern part of upper Cape Cod and decreased areas of breast cancer risk in the southern part (p-value = 0.04; ORs: 0.90–1.40). There was also a significant association between breast cancer risk and calendar year (p-value = 0.05; ORs: 0.53–1.38), with earlier calendar years resulting in higher risk. The results of the one-dimensional analysis of residency duration and the two-dimensional analysis of calendar year and duration showed that the risk of breast cancer increased with increasing residency duration, but results were not statistically significant. When we considered space and time together, the maps showed a large area of statistically significant elevated risk for breast cancer near the Massachusetts Military Reservation (p-value range:0.02–0.05; ORs range: 0.25–2.5). This increased risk began with residences in the late 1940s and remained consistent in size and location through the late 1950s.
Conclusion:
Spatial-temporal analysis of the breast cancer data may help identify new exposure hypotheses that warrant future epidemiologic investigations with detailed exposure models. Our methods allow us to visualize breast cancer risk, adjust for known confounders including age at diagnosis or index year, family history of breast cancer, parity and age at first live- or stillbirth, and test for the statistical significance of location and time. Despite the advantages of GAMs, analyses are for exploratory purposes and there are still methodological issues that warrant further research. This paper illustrates that GAM methods are a suitable alternative to widely-used cluster detection methods and may be preferable when residential histories from existing epidemiological studies are available.
-
Background:
Knowledge of the geographical locations of individuals is fundamental to the practice of spatial epidemiology. One approach to preserving the privacy of individual-level addresses in a data set is to de-identify the data using a non-deterministic blurring algorithm that shifts the geocoded values. We investigate a vulnerability in this approach which enables an adversary to re-identify individuals using multiple anonymized versions of the original data set. If several such versions are available, each can be used to incrementally refine estimates of the original geocoded location.
Results:
We produce multiple anonymized data sets using a single set of addresses and then progressively average the anonymized results related to each address, characterizing the steep decline in distance from the re-identified point to the original location, (and the reduction in privacy). With ten anonymized copies of an original data set, we find a substantial decrease in average distance from 0.7 km to 0.2 km between the estimated, re-identified address and the original address. With fifty anonymized copies of an original data set, we find a decrease in average distance from 0.7 km to 0.1 km.
Conclusion:
We demonstrate that multiple versions of the same data, each anonymized by non-deterministic Gaussian skew, can be used to ascertain original geographic locations. We explore solutions to this problem that include infrastructure to support the safe disclosure of anonymized medical data to prevent inference or re-identification of original address data, and the use of a Markov-process based algorithm to mitigate this risk.
-
Background:
Cholera has claimed many lives throughout history and it continues to be a global threat, especially in countries in Africa. The disease is listed as one of three internationally quarantinable diseases by the World Health organization, along with plague and yellow fever. Between 1999 and 2005, Africa alone accounted for about 90% of over 1 million reported cholera cases worldwide. In Ghana, there have been over 27000 reported cases since 1999. In one of the affected regions in Ghana, Ashanti region, massive outbreaks and high incidences of cholera have predominated in urban and overcrowded communities.
Results:
A GIS based spatial analysis and statistical analysis, carried out to determine clustering of cholera, showed that high cholera rates are clustered around Kumasi Metropolis ( the central part of the region) , with Moran's Index = 0.271 and P
-
Background:
Chronic exposure to traffic-related air pollution is associated with a variety of health impacts in adults and recent studies show that exposure varies spatially, with some residents in a community more exposed than others. A spatial exposure simulation model (SESM) which incorporates six microenvironments (home indoor, work indoor, other indoor, outdoor, in-vehicle to work and in-vehicle other) is described and used to explore spatial variability in estimates of exposure to traffic-related nitrogen dioxide (not including indoor sources) for working people. The study models spatial variability in estimated exposure aggregated at the census tracts level for 382 census tracts in the Greater Vancouver Regional District of British Columbia, Canada. Summary statistics relating to the distributions of the estimated exposures are compared visually through mapping. Observed variations are explored through analyses of model inputs.
Results:
Two sources of spatial variability in exposure to traffic-related nitrogen dioxide were identified. Median estimates of total exposure ranged from 8 μg/m3 to 35 μg/m3 of annual average hourly NO2 for workers in different census tracts in the study area. Exposure estimates are highest where ambient pollution levels are highest. This reflects the regional gradient of pollution in the study area and the relatively high percentage of time spent at home locations. However, for workers within the same census tract, variations were observed in the partial exposure estimates associated with time spent outside the residential census tract. Simulation modeling shows that some workers may have exposures 1.3 times higher than other workers residing in the same census tract because of time spent away from the residential census tract, and that time spent in work census tracts contributes most to the differences in exposure. Exposure estimates associated with the activity of commuting by vehicle to work were negligible, based on the relatively short amount of time spent in this microenvironment compared to other locations. We recognize that this may not be the case for pollutants other than NO2. These results represent the first time spatially disaggregated variations in exposure to traffic-related air pollution within a community have been estimated and reported.
Conclusion:
The results suggest that while time spent in the home indoor microenvironment contributes most to between-census tract variation in estimates of annual average exposures to traffic-related NO2, time spent in the work indoor microenvironment contributes most to within-census tract variation, and time spent in transit by vehicle makes a negligible contribution. The SESM has potential as a policy evaluation tool, given input data that reflect changes in pollution levels or work flow patterns due to traffic demand management and land use development policy.
-
Background:
Every year, West Africa is afflicted with Meningococcal Meningitis (MCM) disease outbreaks. Although the seasonal and spatial patterns of disease cases have been shown to be linked to climate, the mechanisms responsible for these patterns are still not well identified.
Results:
A statistical analysis of annual incidence of MCM and climatic variables has been performed to highlight the relationships between climate and MCM for two highly afflicted countries: Niger and Burkina Faso. We found that disease resurgence in Niger and in Burkina Faso is likely to be partly controlled by the winter climate through enhanced Harmattan winds. Statistical models based only on climate indexes work well in Niger showing that 25% of the disease variance from year-to-year in this country can be explained by the winter climate but fail to represent accurately the disease dynamics in Burkina Faso.
Conclusion:
This study is an exploratory attempt to predict meningitis incidence by using only climate information. Although it points out significant statistical results it also stresses the difficulty of relating climate to interannual variability in meningitis outbreaks.
-
Background:
In health and place research, definitions of areas, area characteristics, and health outcomes should ideally be coherent with one another. Yet current approaches for delimiting areas mostly rely on spatial units "of convenience" such as census tracts. These areas may be homogeneous along socioeconomic conditions but heterogeneous along other environmental characteristics. This heterogeneity can lead to biased measurement of environment characteristics and misestimation of area effects on health. The objective of this study was to assess the soundness of census tracts as units of analysis for measuring the active living potential of environments, hypothesised to be associated with walking.
Results:
Starting with data at the smallest census area level available, zones homogeneous along three indicators of active living potential, i.e. population density, land use mix, and accessibility to services, were designed. Delimitation of zones ensued from statistical clustering of the smallest areas into seven clusters or "types of environment". Mapping of clusters into a GIS led to the delineation of 898 zones characterised by one of seven types of environment, corresponding to different levels of active living potential. Homogeneity of census tracts along indicators of active living potential varied. A greater proportion (83%) of variation in accessibility to services was attributable to differences between census tracts suggesting within-tract homogeneity along this variable. However, census tracts were heterogeneous with respect to population density and land use mix where a greater proportion of the variation was attributable to within-tract differences. About 55% of tracts were characterised by a combination of three or more "types of environment" suggesting substantial within-tract heterogeneity in the active living potential of environments.
Conclusion:
Soundness of census tracts for measuring active living potential may be limited. Measuring active living potential with error may lead to misestimation of associations with walking, therefore limiting the correctness of inference about area effects on walking. Future studies should aim to determine homogeneity of spatial units "of convenience" along environment characteristics of interest prior to examining their association with health. Further evidence is needed to assess the extent of this methodological issue with other indicators of environment context relevant to other health indicators.
-
'Mashup' was originally used to describe the mixing together of musical tracks to create a new piece of music. The term now refers to Web sites or services that weave data from different sources into a new data source or service. Using a musical metaphor that builds on the origin of the word 'mashup', this paper presents a demonstration "playlist" of four geo-mashup vignettes that make use of a range of Web 2.0, Semantic Web, and 3-D Internet methods, with outputs/end-user interfaces spanning the flat Web (two-dimensional – 2-D maps), a three-dimensional – 3-D mirror world (Google Earth) and a 3-D virtual world (Second Life ®). The four geo-mashup "songs" in this "playlist" are: 'Web 2.0 and GIS (Geographic Information Systems) for infectious disease surveillance', 'Web 2.0 and GIS for molecular epidemiology', 'Semantic Web for GIS mashup', and 'From Yahoo! Pipes to 3-D, avatar-inhabited geo-mashups'. It is hoped that this showcase of examples and ideas, and the pointers we are providing to the many online tools that are freely available today for creating, sharing and reusing geo-mashups with minimal or no coding, will ultimately spark the imagination of many public health practitioners and stimulate them to start exploring the use of these methods and tools in their day-to-day practice. The paper also discusses how today's Web is rapidly evolving into a much more intensely immersive, mixed-reality and ubiquitous socio-experiential Metaverse that is heavily interconnected through various kinds of user-created mashups.
-
Background:
Seasonal patterns in cardiac disease in the northern hemisphere are well described in the literature. More recently age and gender differences in cardiac mortality and to a lesser extent morbidity have been presented. To date spatial differences between the seasonal patterns of cardiac disease has not been presented. Literature relating to seasonal patterns in cardiac disease in the southern hemisphere and in Australia in particular is scarce. The aim of this paper is to describe the seasonal, age, gender, and spatial patterns of cardiac disease in Melbourne Australia by using acute myocardial infarction admissions to hospital as a marker of cardiac disease.
Results:
There were 33,165 Acute Myocardial Infarction (AMI) admissions over 2186 consecutive days. There is a seasonal pattern in AMI admissions with increased rates during the colder months. The peak month is July. The admissions rate is greater for males than for females, although this difference decreases with advancing age. The maximal AMI season for males extends from April to November. The difference between months of peak and minimum admissions was 33.7%. Increased female AMI admissions occur from May to November, with a variation between peak and minimum of 23.1%. Maps of seasonal AMI admissions demonstrate spatial differences. Analysis using Global and Local Moran's I showed increased spatial clustering during the warmer months. The Bivariate Moran's I statistic indicated a weaker relationship between AMI and age during the warmer months.
Conclusions:
There are two distinct seasons with increased admissions during the colder part of the year. Males present a stronger seasonal pattern than females. There are spatial differences in AMI admissions throughout the year that cannot be explained by the age structure of the population. The seasonal difference in AMI admissions warrants further investigation. This includes detailing the prevalence of cardiac disease in the community and examining issues of social and environmental justice.
-
Background:
Health professionals, policy-makers and researchers need to be able to explore potential associations between prevalence rates and quality of care with a range of possible determinants including socio-economic deprivation and morbidity levels to determine the impact of commissioning and service delivery. In the UK, data in England are only available nationally at practice postcode level. In Scotland, such data are available based on an aggregate of the practices population's postcodes. The use of data assigned to the practice postcode may underestimate the association between ill health and income deprivation. Here, we report on the impact of using data assigned to the practice population by comparing analyses using English and Scottish data.
Results:
Income deprivation based on data assigned to the practice postcode under-estimated deprivation compared to using income deprivation data assigned to the practice population for the five least deprived deciles, and over-estimated deprivation for the five most deprived deciles. The biggest differences were found for the most deprived decile. A similar trend was found for limiting long-term illness (LLTI). Differences between the QOF prevalence rates of the least and most deprived deciles using practice postcode data were similar (0.2% points or less) in England and Scotland for 8 out of 10 clinical domains. Using practice population assigned deprivation, differences in the prevalence rate between the least and most deprived deciles increase for all clinical domains. A similar trend was again found for LLTI. Using practice population assigned deprivation, differences for population achievement increase for all CHD quality indicators with the exception of beta-blockers (CHD10). With practice postcode assigned deprivation, significant differences between the least and most deprived deciles were found for 2 out 8 indicators, compared to 5 using practice population assigned deprivation. For LLTI differences between the lowest and most deprived deciles increased for all indicators when ill health assigned to the practice population was used.
Conclusion:
We have found, through comparing deprivation and ill health data assigned to either the practice postcode or the practice population postcode in Scotland, that analyses based on practice postcode assigned data under-estimated the relationship between deprivation and ill health for both prevalence and quality care. Given the importance of understanding the effect of deprivation and ill health on a range of determinants related to health care, policy makers should ensure that practice population data are available and used at national level in England and elsewhere where possible.
-
ObjectivesThe aims of this study were to model jointly the incidence rates of six smoking related cancers in the Yorkshire region of England, to explore the patterns of spatial correlation amongst them, and to estimate the relative weight of smoking and other shared risk factors for the relevant disease sites, both before and after adjustment for socioeconomic background (SEB).
Methods:
Data on the incidence of oesophagus, stomach, pancreas, lung, kidney, and bladder cancers between 1983 and 2003 were extracted from the Northern & Yorkshire Cancer Registry database for the 532 electoral wards in the Yorkshire region. Using postcode of residence, each case was assigned an area-based measure of SEB using the Townsend index. Standardised incidence ratios (SIRs) were calculated for each cancer site and their correlations investigated. The joint analysis of the spatial variation in incidence used a Bayesian shared-component model. Three components were included to represent differences in smoking (for all six sites), bodyweight/obesity (for oesophagus, pancreas and kidney cancers) and diet/alcohol consumption (for oesophagus and stomach cancers).
Results:
The incidence of cancers of the oesophagus, pancreas, kidney, and bladder was relatively evenly distributed across the region. The incidence of stomach and lung cancers was more clustered around the urban areas in the south of the region, and these two cancers were significantly associated with higher levels of area deprivation. The incidence of lung cancer was most impacted by adjustment for SEB, with the rural/urban split becoming less apparent. The component representing smoking had a larger effect on cancer incidence in the eastern part of the region. The effects of the other two components were small and disappeared after adjustment for SEB.
Conclusion:
This study demonstrates the feasibility of joint disease modelling using data from six cancer sites. Incidence estimates are more precise than those obtained without smoothing. This methodology may be an important tool to help authorities evaluate healthcare system performance and the impact of policies.
-
Background:
In leprosy endemic areas, patients are usually spatially clustered and not randomly distributed. Classical statistical techniques fail to address the problem of spatial clustering in the regression model. Bayesian method is one which allows itself to incorporate spatial dependence in the model. However little is explored in the field of leprosy. The Bayesian approach may improve our understanding about the variation of the disease prevalence of leprosy over space and time.
Methods:
Data from an endemic area of leprosy, covering 148 panchayats from two taluks in South India for four time points between January 1991 and March 2003 was used. Four Bayesian models, namely, space-cohort and space-period models with and without interactions were compared using the Deviance Information Criterion. Cohort effect, period effect over four time points and spatial effect (smoothed) were obtained using WinBUGS. The spatial or panchayat effect thus estimated was compared with the raw standardized morbidity (leprosy prevalence) rate (SMR) using a choropleth map. The possible factors that might have influenced the variations of prevalence of leprosy were explored.
Results:
Bayesian models with the interaction term were found to be the best fitted model. Leprosy prevalence was higher than average in the older cohorts. The last two cohorts 1987–1996 and 1992–2001 showed a notable decline in leprosy prevalence. Period effect over 4 time points varied from a high of 3.2% to a low of 1.8%. Spatial effect varied between 0.59 and 2. Twenty-six panchayats showed significantly higher prevalence of leprosy than the average when Bayesian method was used and it was 40 panchayats with the raw SMR.
Conclusion:
Reduction of prevalence of leprosy was 92% for persons born after 1996, which could be attributed to various intervention and treatment programmes like vaccine trial and MDT. The estimated period effects showed a gradual decline in the risk of leprosy which could be due to better nutrition, hygiene and increased awareness about the disease. Comparison of the maps of the relative risk using the Bayesian smoothing and the raw SMR showed the variation of the geographical distribution of the leprosy prevalence in the study area. Panchayat or spatial effects using Bayesian showed clustersing of leprosy cases towards the northeastern end of the study area which was overcrowded and population belonging to poor economic status.
-
Background:
Chronic exposure to traffic-related air pollution is associated with a variety of health impacts in adults and recent studies show that exposure varies spatially, with some residents in a community more exposed than others. A spatial exposure simulation model (SESM) which incorporates six microenvironments (home indoor, work indoor, other indoor, outdoor, in-vehicle to work and in-vehicle other) is described and used to explore spatial variability in estimates of exposure to traffic-related nitrogen dioxide (not including indoor sources) for working people. The study models spatial variability in estimated exposure aggregated at the census tracts level for 382 census tracts in the Greater Vancouver Regional District of British Columbia, Canada. Summary statistics relating to the distributions of the estimated exposures are compared visually through mapping. Observed variations are explored through analyses of model inputs.
Results:
Two sources of spatial variability in exposure to traffic-related nitrogen dioxide were identified. Median estimates of total exposure ranged from 8 g/m3 to 35 g/m3 of annual average hourly NO2 for workers in different census tracts in the study area. Exposure estimates are highest where ambient pollution levels are highest. This reflects the regional gradient of pollution in the study area and the relatively high percentage of time spent at home locations. However, for workers within the same census tract, variations were observed in the partial exposure estimates associated with time spent outside the residential census tract. Simulation modeling shows that some workers may have exposures 1.3 times higher than other workers residing in the same census tract because of time spent away from the residential census tract, and that time spent in work census tracts contributes most to the differences in exposure. Exposure estimates associated with the activity of commuting by vehicle to work were negligible, based on the relatively short amount of time spent in this microenvironment compared to other locations. We recognize that this may not be the case for pollutants other than NO2. These results represent the first time spatially disaggregated variations in exposure to traffic-related air pollution within a community have been estimated and reported.
Conclusions:
The results suggest that while time spent in the home indoor microenvironment contributes most to between-census tract variation in estimates of annual average exposures to traffic-related NO2, time spent in the work indoor microenvironment contributes most to within-census tract variation, and time spent in transit by vehicle makes a negligible contribution. The SESM has potential as a policy evaluation tool, given input data that reflect changes in pollution levels or work flow patterns due to traffic demand management and land use development policy.
-
'Mashup' was originally used to describe the mixing together of musical tracks to create a new piece of music. The term now refers to Web sites or services that weave data from different sources into a new data source or service. Using a musical metaphor that builds on the origin of the word 'mashup', this paper presents a demonstration "playlist" of four geo-mashup vignettes that make use of a range of Web 2.0, Semantic Web, and 3-D Internet methods, with outputs/end-user interfaces spanning the flat Web (two-dimensional -- 2-D maps), a three-dimensional -- 3-D mirror world (Google Earth) and a 3-D virtual world (Second Life (R)). The four geo-mashup "songs" in this "playlist" are: 'Web 2.0 and GIS (Geographic Information Systems) for infectious disease surveillance', 'Web 2.0 and GIS for molecular epidemiology', 'Semantic Web for GIS mashup', and 'From Yahoo! Pipes to 3-D, avatar-inhabited geo-mashups'. It is hoped that this showcase of examples and ideas, and the pointers we are providing to the many online tools that are freely available today for creating, sharing and reusing geo-mashups with minimal or no coding, will ultimately spark the imagination of many public health practitioners and stimulate them to start exploring the use of these methods and tools in their day-to-day practice. The paper also discusses how today's Web is rapidly evolving into a much more intensely immersive, mixed-reality and ubiquitous socio-experiential Metaverse that is heavily interconnected through various kinds of user-created mashups.
-
Background:
Health professionals, policy-makers and researchers need to be able to explore potential associations between prevalence rates and quality of care with a range of possible determinants including socio-economic deprivation and morbidity levels to determine the impact of commissioning and service delivery. In the UK, data in England are only available nationally at practice postcode level. In Scotland, such data are available based on an aggregate of the practices population's postcodes. The use of data assigned to the practice postcode may underestimate the association between ill health and income deprivation. Here, we report on the impact of using data assigned to the practice population by comparing analyses using English and Scottish data.
Results:
Income deprivation based on data assigned to the practice postcode under-estimated deprivation compared to using income deprivation data assigned to the practice population for the five least deprived deciles, and over-estimated deprivation for the five most deprived deciles. The biggest differences were found for the most deprived decile. A similar trend was found for limiting long-term illness (LLTI). Differences between the QOF prevalence rates of the least and most deprived deciles using practice postcode data were similar (0.2% points or less) in England and Scotland for 8 out of 10 clinical domains. Using practice population assigned deprivation, differences in the prevalence rate between the least and most deprived deciles increase for all clinical domains. A similar trend was again found for LLTI. Using practice population assigned deprivation, differences for population achievement increase for all CHD quality indicators with the exception of beta-blockers (CHD10). With practice postcode assigned deprivation, significant differences between the least and most deprived deciles were found for 2 out 8 indicators, compared to 5 using practice population assigned deprivation. For LLTI differences between the lowest and most deprived deciles increased for all indicators when ill health assigned to the practice population was used.
Conclusion:
We have found, through comparing deprivation and ill health data assigned to either the practice postcode or the practice population postcode in Scotland, that analyses based on practice postcode assigned data under-estimated the relationship between deprivation and ill health for both prevalence and quality care. Given the importance of understanding the effect of deprivation and ill health on a range of determinants related to health care, policy makers should ensure that practice population data are available and used at national level in England and elsewhere where possible.
-
Background:
The Pennsylvania Cancer Atlas (PA-CA) is an interactive online atlas to help policy-makers, program managers, and epidemiologists with tasks related to cancer prevention and control. The PA-CA includes maps, graphs, tables, that are dynamically linked to support data exploration and decision-making with spatio-temporal cancer data. Our Atlas development process follows a user-centered design approach. To assess the usability of the initial versions of the PA-CA, we developed and applied a novel strategy for soliciting user feedback through multiple distributed focus groups and surveys. Our process of acquiring user feedback leverages an online web application (e-Delphi). In this paper we describe the PA-CA, detail how we have adapted e-Delphi web application to support usability and utility evaluation of the PA-CA, and present the results of our evaluation.
Results:
We report results from four sets of users. Each group provided structured individual and group assessments of the PA-CA as well as input on the kinds of users and applications for which it is best suited. Overall reactions to the PA-CA are quite positive. Participants did, however, provide a range of useful suggestions. Key suggestions focused on improving interaction functions, enhancing methods of temporal analysis, addressing data issues, and providing additional data displays and help functions. These suggestions were incorporated in each design and implementation iteration for the PA-CA and used to inform a set of web-atlas design principles.
Conclusion:
For the Atlas, we find that a design that utilizes linked map, graph, and table views is understandable to and perceived to be useful by the target audience of cancer prevention and control professionals. However, it is clear that considerable variation in experience using maps and graphics exists and for those with less experience, integrated tutorials and help features are needed. In relation to our usability assessment strategy, we find that our distributed, web-based method for soliciting user input is generally effective. Advantages include the ability to gather information from users distributed in time and space and the relative anonymity of the participants while disadvantages include less control over when and how often participants provide input and challenges for obtaining rich input.
-
Background:
Since 1999, the expansion of the West Nile virus (WNV) epizooty has led public health authorities to build and operate surveillance systems in North America. These systems are very useful to collect data, but cannot be used to forecast the probable spread of the virus in coming years. Such forecasts, if proven reliable, would permit preventive measures to be put into place at the appropriate level of expected risk and at the appropriate time. It is within this context that the Multi-Agent GeoSimulation approach has been selected to develop a system that simulates the interactions of populations of mosquitoes and birds over space and time in relation to the spread and transmission of WNV. This simulation takes place in a virtual mapping environment representing a large administrative territory (e.g. province, state) and carried out under various climate scenarios in order to simulate the effects of vector control measures such as larviciding at scales of 1/20 000 or smaller.
Results:
After setting some hypotheses, a conceptual mo