Progress 11/01/08 to 09/30/11
Outputs OUTPUTS: This integrated research/extension project aims to develop a cost-efficient and sustainable infrastructure to produce sub-county level population estimates by utilizing state-of-art applied demography, information technology, and geographic information system (GIS) technologies. In the first year, we compared the basic indicators containing individual, family, and household information and other social and economic characteristics by using summary file 1 and 3 for both 1990 and 2000 census data that provided us the base population information to be compared with the our population. Due to different definitions and categories between two census questions, however, we had to select only comparable variables from the summary files and compiled the series of documentation that is important for future reference; e.g., the document that explains the procedure of variable selection, the SAS programming files, the codebook and the data dictionary for the selected variables, and the original and the selected variable files. In the second year, our main task was to compile the large datasets from various sources to conduct small area population estimation by using GIS techniques in the third year. The datasets are compiled from various agencies such as from the U.S. Census Bureau, the U.S. Department of Labor, the National Center for Health Statistics, the Hawaii State Department of Education, the Hawaii State Business, Economic Development and Tourism, Department of Health, and the Hawaii State Department of Health. The information included in the dataset is population size by age and sex, migration status, and military population size, births and deaths, life tables, and the enrollment size for private and public schools. To conduct GIS techniques, we finished geocoding and census data compilation. Due to many unstandardized geo addresses in Hawaii, the geocoding was the most important and difficult task in the project and often required manual work to recode unstandardized geo addresses. In the third year, by incoorporating the three main popular methods, called school enrollement methods, housing unit methods, cohort-survival methods, and GIS techniques, we estimated population estiamtes 2000 and 2010 by 43 schoool complex areas. The GIS procedure "intersect" was performed on the map of 43 school districts with the map of census blocks. This kind of overlay of one map on top of another created crisscross of areas. The share of these areas of housing units and population were determined and then aggregated for school districts. One of the main tasks of this project is to create a strong partnership with stakeholders on the estimation method, products, and further development. We have continued to contact and share the information about this project with the state departments such as DOE and DOH to share the ideas and information on this project. In addition, we have met with the state geographers to share this idea and reach an agreement of sharing and exchanging the most update GIS information with COF. PARTICIPANTS: Individuals (me from the Center on the Family served as principal participant): Hosik Min, Shijen He, Shriram Bhutada, Sarah Yuan, Carol Lin, Saiful Momen Institutions (from the State Department) Alvin Onaka, PhD, State Registrar and Chief of the Office of Health Status Monitoring, Department of Health Keith Yamamoto, Chief, Alcohol and Drug Abuse Division, Hawaii Department of Health Joan M. Delos Santos, Data Processing Systems Analyst, State of Hawaii Office of Planning Craig T. Tasaka, Planning Program Manager, State of Hawaii Office of Planning Royce Jones, Hawaii-Pacific Manager, Environmental Systems Research Institute Inc. (ESRI) TARGET AUDIENCES: Administrators of state agencies and local nonprofit community agencies relating to health, human services, education, and economic development Academic researchers PROJECT MODIFICATIONS: Nothing significant to report during this reporting period.
Impacts This project achieved four main outcomes and impacts. (1) We obtained the base population information by comparing the census data between 1990 and 2000 from the SF1 and SF3 data. We did the rigorous crosswalk between them to match the variables those have different definitions and values and used the distribution methods to align the SF3 with the SF1 because only the SF1 contains the 100% sample, while the SF3 contains the partial. (2) One of the main tasks was to obtain population estimates for 43 school complexes by incorporating various data sources. Using GIS techniques as well as other three major popular methods, school enrollement methods, housing unit methods, cohort survival methods, we estimated population estiamtes 2000 and 2010 by 43 schoool complex areas. The outputs can be provided in both the spreadsheet table format and the maps. Each tables and maps contained the basic population information such as population size by age and sex. However, those estimates for few areas do have some errors because of incomplete information. (3) Another task is to build a sustainable data incorporation for further population estimation. To create the GIS usable datasets was the one of the most challenging parts in this project. To enhance the data accuracy, the team used the private sector information such as zipcode and digital maps. Also, the property tax data including tax map, dwelling records, commercial tax records, permit records, and timeshare construction helped the project enormously. The team set up the procedure to replicate the population estimation for the additional year(s). However, each additional year requires all the necessary sets of data to conduct the estimation. (4) Because this project requires various kinds of administrative data, the center has a successful relationship with DOE and DOH in sharing the data and the idea of this project, and continued to update the current status of the project and exchange the data. The team had the official meeting with the state GIS experts and continued to contact them to solve the geocoding issue.
Publications
- No publications reported this period
|
Progress 10/01/09 to 09/30/10
Outputs OUTPUTS: The key outputs in the second year were a compilation of the two large datasets from various sources to conduct small area population estimation in the next year. One dataset is for the demographic techniques, called the housing method, and the other one is for the Geographic Information System (GIS) techniques. The dataset for the demographic method is compiled from various agencies such as from the U.S. Census Bureau, the U.S. Department of Labor, the National Center for Health Statistics, the Hawaii State Department of Education, the Hawaii State Business, Economic Development and Tourism, Department of Health, and the Hawaii State Department of Health. The information included in the dataset is population size by age and sex, migration status, and military population size, births and deaths, life tables, and the enrollment size for private and public schools. All these information is provided at the county level and since 2000 to present. This level of information meets the minimum requirement for the housing method. Another dataset for the GIS technique comprises of two parts; geocoding and census data compilation. The geocoding is the most important and difficult task in this project because Hawaii has a lot of unstandardized geo addresses. As these addresses are not easy to be coded to analyze, the manual work has to be done. These difficulties are found at the neighbor islands, Big Island in particular, and now is wait for the test run. As a strong partnership with stakeholders is critical, we have continued to contact and share the information about this project with the state departments such as DOE and DOH and the state geographers to share the ideas and information on this project. The final results will be reported to the stakeholders in the workshop or conference after we have accomplished this project, and will be published group's aimed refereed journal. PARTICIPANTS: Individuals (me from the Center on the Family served as principal participant): Hosik Min, Shi-Jen He, Sarah Yuan, Carol Lin, Shriram Bhutada Institutions (from the State Department) Alvin Onaka, PhD, State Registrar and Chief of the Office of Health Status Monitoring, Department of Health Keith Yamamoto, Chief, Alcohol and Drug Abuse Division, Hawaii Department of Health Joan M. Delos Santos, Data Processing Systems Analyst, State of Hawaii Office of Planning Craig T. Tasaka, Planning Program Manager, State of Hawaii Office of Planning Royce Jones, Hawaii-Pacific Manager, Environmental Systems Research Institute Inc. (ESRI) TARGET AUDIENCES: Administrators of state agencies and local nonprofit community agencies relating to health, human services, education, and economic development Hawaii State legislators Service providers and advocates who work with at-risk populations Academic researchers PROJECT MODIFICATIONS: Nothing significant to report during this reporting period.
Impacts As this project is still in the process, real outcomes and impacts are yet to be reported. However, the followings are the progress report from the current work. The main outcomes of the second year are to prepare the datasets for the demographic and GIS analysis. The required datasets for the population estimation methods are from various Federal and State agencies and the data from those agencies are 2000 U.S. Census, 2001-2009 American Community Survey, 2001-2009 Current Population Survey, 2000-2008 Vital Statistics (2008 is the most recent one), 2000-2006 Life Table Report (2006 is the most recent one), 2000-2009 Military Population Report, and 2000-2009 School Enrollments Report from both public and private schools. From those data, we obtain the following demographic information that is essential to conduct the small area estimation: total population size by age and sex, migration status, military population size, vital statistics such as births and deaths, life expectancies from life tables, and the enrollment size for both private and public schools by each grade. All the information we obtained from these data could provide at the county level and are collected from 2000 to present. This level of information meets the minimum requirement for the housing method. We have compiled a series of documentation for future reference: the complete list for the original data sources, the STATA programming document to obtain the selected variable data from the original data, the data dictionary and the codebook for the selected variables, and the elucidation of the procedure we have taken. Another major dataset is for the GIS analysis. Although the GIS techniques require the accurate GIS information, specifically geocoding, the geocoding task involves a rigorous amount of time and effort because of the difficult technical issue such as unstandardized addresses. A graduate assistant who is majoring in GIS and has deeper knowledge and understanding of the Hawaii GIS system has continued to work on these issues and now almost completed. More specifically, we only need to do the test run to confirm the geocoding accuracy. After the test run, we will be able to analyze the small area estimation, which is the main procedure of this project. The data this GIS technique requires are also the same as the demographic methods: various kinds of administrative data such as school enrollment, vital statics data, and population data. These data can be shared by applying GIS information.
Publications
- No publications reported this period
|
Progress 10/01/08 to 09/30/09
Outputs OUTPUTS: This integrated research/extension project aims to develop a cost-efficient and sustainable infrastructure to produce sub-county level population estimates. The present action research project will integrate science, technology, and extension to develop a blueprint and lay the foundation for the data system infrastructure. By utilizing state-of-art applied demography, information technology, and geographic information system (GIS) technologies, the project will contain the capacity to produce timely estimates as needed by key stakeholders; a strong and active partnership with stakeholders on the estimation method, products, and further development; the techniques that can incorporate the innovative information with GIS technologies to streamline estimation, eliminate human errors, and reduce operational cost; the capability to enhance estimation precision and details iteratively; and multiple channels to disseminate products and promote stakeholders' buy-ins. Various internal and external resources will be used to ensure the project's success and outcome quality. As for the first year, we conducted a research that compared the basic indicators containing individual, family, and household information and other social and economic characteristics by using summary file 1 and 3 for both 1990 and 2000 census data. This comparison task is very important for the project, because the census data, the most reliable source for the population information, can provide the base population to be compared with the population estimates that we will create by using various estimation methods. Due to different definitions and categories between two census questions, however, we have to select only comparable variables from the summary files. During the process, we have complied the series of documentation that is important for future reference; e.g., the document that explains the procedure of variable selection, the SAS programming files, the codebook and the data dictionary for the selected variables, and the original and the selected variable files. We have created the county level comparison dataset, and are working on obtaining the sub-county level population estimates. Another major task of the first year was to build a strong partnership with stakeholders. As the project requires various sources of data in nature, building a strong partnership with stakeholders is critical. We have continued to contact and share the information about this project with the state departments such as DOE and DOH to share the ideas and information on this project. In addition, we have met with the state geographers to share this idea and reach an agreement of sharing and exchanging the most update GIS information with COF. Because incorporating GIS techniques with current demographic ones is one of the main objectives, this relationship would be a great help on succeeding the project. Following the goals of the project, the results will be reported to the stakeholders in the workshop or conference and published in group's aimed collaborative book(s) or other refereed journal. PARTICIPANTS: Individuals (me from the Center on the Family served as principal participant): Hosik Min, Shi-Jen He, Sarah Yuan, Carol Lin Institutions (from the State Department): Alvin Onaka, PhD, State Registrar and Chief of the Office of Health Status Monitoring, Department of Health; Keith Yamamoto, Chief, Alcohol and Drug Abuse Division, Hawaii Department of Health; Joan M. Delos Santos, Data Processing Systems Analyst, State of Hawaii Office of Planning; Craig T. Tasaka, Planning Program Manager, State of Hawaii Office of Planning; Royce Jones, Hawaii-Pacific Manager, Environmental Systems Research Institute Inc. (ESRI) TARGET AUDIENCES: Administrators of state agencies and local nonprofit community agencies relating to health, human services, education, and economic development Hawaii State legislators Service providers and advocates who work with at-risk populations Academic researchers PROJECT MODIFICATIONS: Nothing significant to report during this reporting period.
Impacts It is important to note that because all current projects are in the process, there are no real outcomes and impacts yet to be reported. However, the followings are the progress report from the current work. (1) One of the main tasks in the first year is to compare the census data between 1990 and 2000, which is the most reliable data on population. The results of the comparison would serve as the base population to be compared with the estimation outcomes generated by various demographic and GIS methods we would employ. To link the data with geographic location, we used the Census Bureau's summary files (SF). We did the rigorous crosswalk between them to match the variables those have different definitions and values. In this process, we have acquired the series of documentations for the future reference: the complete list for the original and selected variables; the SAS programming document to obtain the selected variable data from the original SF; the data dictionary and the codebook for the selected variables; and the elucidation of the procedure we have taken. The use the SF, however, has a limitation: Only the SF1 contains the 100% sample, while the SF3 contains the partial. Therefore, we are in the process of applying the distribution methods, considering the land size, the population size, and the population composition, to align the SF3 with the SF1. (2) Another major task is to build a strong partnership with stakeholders. Because this project requires various kinds of administrative data such as school enrollment, vital statics data, and GIS information, without the help from those government agencies it would be extremely difficult to achieve the project goal. In addition, considering that the project outputs would be used by those agencies for the decision-making, the use and contents of the final product would be decided based on their needs, and the product would be updated in continual basis, forming a strong and close partnership is crucial. The center has a successful relationship with DOE and DOH in sharing the data and the idea of this project, and continued to update the current status of the project and exchange the data. GIS is one of the most challenging parts in this project. In order to succeed this project, the center has to incorporate GIS techniques into small area population estimation, and the GIS techniques require the accurate GIS information, specifically geocoding. The geocoding task, however, involves a rigorous amount of time and effort because of the difficult technical issue such as unstandardized addresses. To solve this issue, the team had the official meeting with the state GIS experts. And the meeting led us to exchange the mutual interests on this project and agree to share the information through the project. In addition, to increase our capacity of GIS data handling, the center hired a graduate assistant who is majoring in GIS and has deeper knowledge and understanding of the Hawaii GIS system. This collaboration would bring us not only to save tremendous time and effort for our GIS experts, but also to increase the accuracy of the population estimates that we create.
Publications
- No publications reported this period
|
|