Main content
TPDD : Taiwan Indigenous Peoples Population Dynamics open Data 台灣原住民族人口動態開放資料
Date created: | Last Updated:
: DOI | ARK
Creating DOI. Please wait...
Category: Project
Description: Part research outcomes from TIPD at https://osf.io/e4rvz/ TPDD : Taiwan Indigenous Peoples Population Dynamics open Data 台灣原住民族人口動態開放資料 1_CopyLeft(L) 2013-2024, Council of Indigenous Peoples (https://www.cip.gov.tw/), Taiwan. 2_Acknowledgement: The reserach outcomes are part of research outcomes from the research program "Research on Contemporary Taiwan Indigenous Peoples" (https://osf.io/e4rvz/), which is mainly supported by Council of Indigenous Peoples in 2013-2022. The research is also granted by National Science and Technology Council, grant numbers: MOST 106-2420-H-001-008-MY2; MOST 106-2420-H-156-001-MY2; MOST 109-2420-H-001-003; MOST 109-2420-H-156-001; NSTC 112-2410-H-001-075-MY2. We appreciate some research grants from Academia Sinica. 3_誌謝:本計畫是"台灣原住民基礎開放研究資料庫" (TIPD, https://osf.io/e4rvz/) 部分成果,主要由原住民族委員會支援研究經費;國科會亦支持本計畫之相關計畫,計畫編號:MOST 106-2420-H-001-008-MY2; MOST 106-2420-H-156-001-MY2; MOST 109-2420-H-001-003; MOST 109-2420-H-156-001; NSTC 112-2410-H-001-075-MY2;我們亦感謝中研院支持部分研究經費。 4_連絡Contact E-mail: TIPDTwn@gmail.com 為何是CopyLEFT/Why CopyLEFT: "CopyLEFT (L)" is an effective way of building mutual trust that leads to mutual understanding, collaboration, and innovation. "CopyLeft(L)"是促進互信的有效方式,其公開特質將進一步促進相互理解、協同合作、及創新。 語言/Languages: TIPD is bilingual in English and traditional Chinese TIPD以雙語資訊呈現,包括英文及繁體中文 資料來源引用 Citation of data source: If TIPD helps overcoming data & information constraints in your research & work, please kindly acknowledge TIPD's contribution. We will be highly encouraged! 如果TIPD對您研究工作有助益,請不吝引用或說明為研究資料來源,這對我們是另種形式鼓勵! 連絡方式及通報錯誤/Contact and report errata: E-mail: TIPDTwn@gmail.com 電子郵件: TIPDTwn@gmail.com 《當代台灣原住民族:堅毅不屈,燦爛花開》 1940年至2000年間,台灣原住民族資料變的相當稀缺,這不僅使得台灣原住民各類發展被邊緣化,亦使原住民在台灣逐漸”被消失”。2000年戶口及住宅普查再度恢復原住民普查是為重大里程碑,但2010年後台灣實質人口普查已不復存在,陳年問題依舊無解。為解決原住民基礎資料缺乏問題,原住民族委員會自2013 ~ 2022計十年,不預設立場及充份尊重學術專業獨立及自主性,給與研究計畫支持。感謝十年研究期間三位主任委員支持:孫大川(計畫推動者)、林江義、及夷將.拔路兒。計畫實際執行期間,行政支援及協調最大貢獻者,為原民會綜合規劃處王瑞盈前處長,若無他專業支援及豐富行政經驗,本計畫實難以順利進行,研究團隊在這裡公開對王前處長表示衷心感謝。 計畫主持人亦感謝共同主持人李明政教授,兩人十年合作的默契及情誼。這十年期間,原民會在研究經費、資料、計算機硬體設備、及行政給予大力協助,國科會亦給予多年期研究經費支持,使研究團隊終能放手一博,不再受限資料及計算設備匱乏限制。主持人在2013年已深知這是非常難得機會,錯過將不復返,因此這十年期間全心投入,以科學計算技術和資料科學學理,運用高效能及超級運算設備,全力投入原住民基礎研究,建立各類大型開放資料庫,且不設任何限制開放供國內外自由使用(參考TIPD: 台灣原住民基礎開放研究資料庫 https://osf.io/e4rvz/)。 這研究採用的方法及技術非常先進,成果早已受到國際重要學研單位注意。國際著名"開放科學研究中心" (COS, Center for Open Science,參考 https://www.cos.io/ 及簡介 https://www.cos.io/about ),主動連繫及邀請主持人進行視訊專訪,訪談內容是有關當代台灣原住民及原住民基礎開放研究資料庫之研究緣起、過程、方法、影響、未來發展,及過去15年,及研究如何用開放資料、開放科學、及科學計算,使"瀕臨消失"被邊緣化的台灣原住民,如何重新浮上抬面及成為顯性人口。COS將訪談內容整理以文字稿公布,閱讀者為全球專家,因此影響深遠,訪問稿原文連結網址如下:https://www.cos.io/blog/taiwan-indigenous-peoples?hs_preview=QUOrJLoo-176808682520 COS設計及完成之"開放科學架構"(OSF, Open Science Framework, 參考 https://osf.io/ ),目前是全球研究計畫最大註冊中心,超過60萬全球研究人員在此登記註冊。放在OSF的TIPD就是COS最重要架構,目前已是全球最大開放科學及研究計晝註冊單位,影響力相當大。2013計晝剛開始時,COS也才剛成立,OSF架構亦在草創階段,2015年初我決定將都原計晝重要產出TIPD放在OSF時,當時全球也才只有五百多個研究計晝放在OSF公開;主持人在OSF草創階段,以TIPD放在OSF之經驗,至少有四年時間主持人亦同時幫OSF提出架構如何改善及建議,和OSF合作密切,所以他們也深知原民會支持之研究計晝和TIPD研究通程,TIPD和OSF可謂是同步成長。 十年尚難磨一劍,完成基礎研究及克服核心問題後,真正重頭戲才要開始。很榮幸及欣慰能再次將台灣原住民,全面推向國際舞台! 當代台灣原住民相關資料 Related Projects TIPs on OSF: 1. Lin, Ji-Ping, Ming-Cheng Lee, Hiu Ha Chong, Li-Chuan Liu, Kui Kasirisir, and HSIN-CHUNG WANG. 2024. “TIPD: Taiwan Indigenous Peoples Open Research Data 台灣原住民基礎開放研究資料庫.” OSF https://osf.io/e4rvz/. May 8. doi:10.17605/OSF.IO/E4RVZ. 2. Lin, Ji-Ping. 2024. “TPDD: Taiwan Indigenous Peoples Population Dynamics Open Data 台灣原住民族人口動態開放資料.” OSF https://osf.io/ukjgs/. May 7. doi:10.17605/OSF.IO/UKJGS. 3. Lin, Ji-Ping. 2024. “TMDT: Taiwan Indigenous Peoples’ Multidimensional Table Data 台灣原住民族多維表資料庫.” OSF https://osf.io/a23ec/ . May 7. doi:10.17605/OSF.IO/A23EC. 4. Lin, Ji-Ping. 2024. “TICT: Taiwan Indigenous Peoples’ Contingency Tables 台灣原住民族基礎列聯表.” OSF https://osf.io/9nrwd/. May 7. doi:10.17605/OSF.IO/9NRWD. 5. Lin, Ji-Ping. 2024. “TIHD: Taiwan Indigenous Peoples Household Structure Open Data 台灣原住民族家戶結構基礎開放資料庫.” OSF https://osf.io/zbwek. May 7. doi:10.17605/OSF.IO/ZBWEK. 6. Lin, Ji-Ping. 2024. “TICD: Taiwan Indigenous Communities Open Data 台灣原住民族部落開放資料庫.” OSF https://osf.io/esw67/. May 7. doi:10.17605/OSF.IO/ESW67. 7. Lin, Ji-Ping. 2024. “iqTICD: Integrated Query System of Taiwan Indigenous Community Open Data 台灣原住民族部落開放資料庫綜合查詢系統.” OSF https://osf.io/rfe6p/. May 7. doi:10.17605/OSF.IO/RFE6P. 8. Lin, Ji-Ping. 2024. “TIHV: Taiwan Indigenous People’s High-Resolution Visualization of Population Distribution, Migration Dynamics, Traditional Communities by Ethnic Groups 當代台灣原住民高解析度視覺化圖形資料庫:人口分布、遷徙、傳統部落.” OSF https://osf.io/v8zk3/. May 8. doi:10.17605/OSF.IO/V8ZK3. 9. Lin, Ji-Ping. 2024. “TIMD: Taiwan Indigenous Peoples Migration Dynamics 台灣原住民各族群遷徙動態.” OSF https://osf.io/6rpz9/. May 7. doi:10.17605/OSF.IO/6RPZ9. 10. Lin, Ji-Ping. 2024. “iqTICD on Google: Taiwan Indigenous Communities Open Data Query System on Google 台灣原住民族部落開放資料庫Google綜合查詢系統.” Google https://www.google.com/maps/d/u/0/edit?mid=1M6FE6vKd212udjTvvWmFsaKavB_3TpF7&ll=23.678779780141927%2C120.89717315782033&z=6 11. Migration dynamics of hard-to-reach population in the context of regional inequality: Longitudinal research in Taiwan https://osf.io/bs39t/ Introduction 當代台灣原住民相關資料: (1) Joint page on TIPD & TICD open data: https://www.rchss.sinica.edu.tw/capas/pages/1075?cat=80 (2) 台灣原住民基礎開放研究資料庫 TIPD: https://www.rchss.sinica.edu.tw/capas/posts/11206 (3) 台灣原住民族部落開放資料庫 TICD: https://www.rchss.sinica.edu.tw/capas/posts/11205 (4) 台灣原住民族部落開放資料庫綜合查詢系統 Integrated Query System of TICD: https://www.rchss.sinica.edu.tw/capas/posts/11420 (5) 台灣原住民各族群遷徙動態 Migration Dynamics of TIPs: https://www.rchss.sinica.edu.tw/capas/posts/11329 (6) 當代台灣原住民高解析度視覺化圖形資料庫:人口分布、遷徙、傳統部落 High-resolution visualization of TIPs's population distribution, migration dynamics, traditional communities by ethnic groups: https://www.rchss.sinica.edu.tw/capas/posts/11393 TICD 相關網址(TICD URLs): (1) 資料下載(download URLs) (a) TICD on OSF(Open Science Foundation)網址: https://osf.io/esw67/ (b) TICD via AS: https://TICD.RCHSS.sinica.edu.tw (2) TICD查詢網址(query URLs) (a) TICD on Google: https://TICDonGoogle.RCHSS.sinica.edu.tw (b) original URL: https://www.google.com/maps/d/u/0/edit?mid=1M6FE6vKd212udjTvvWmFsaKavB_3TpF7&ll=23.792343818784946%2C120.83930934 INTRODUCTION: The data on population dynamics of Taiwan indigenous peoples (TIPs) is part of research outcomes from TIPD (https://osf.io/e4rvz/). The data on population dynamics of Taiwan indigenous peoples (TIPs) is a comprehensive population dynamics data based on 2013-2012 monthly household register data. Its underlying fundamentals, methodology, and techniques lie in data science, Big Data, record linkage, geocoding, and high-performance scientific computing. Building this population dynamics data involves both deterministic and probabilistic record linkage methods. It consists of two broad categories: one is population dynamics data (open data) and the other genealogy data (non-open data). It is worth stressing that in-memory computing serves as a very crucial role in data construction. As noted, the computing tasks involve complex record linkage, including both deterministic and probabilistic record matching. This is particularly important, for example, while constructing micro genealogy data by record linkage between a master databank and reference databank through the links of parents’ names and spouse’s name. Computational process of constructing population dynamics archives to reduce unnecessary searches in the reference databank, the reference databank is sorted by the order of gender, family name, and given name. Furthermore, the sorted reference databank is indexed by a file that records information on the first row of record for each sequence of gender, family name, and given name in the sorted reference databank. Taking the construction of individual genealogy for example, the research matches each individual in the master databank with the reference databank, with respect to the individual father’s name, mother’s name, and spouse name. The record linkage between the master databank and the sorted reference databank is implemented by the following procedures: first, for any given individual record in the master databank, use information stored in 5 the index file to acquire the first row of information for the name to be matched in the sorted reference databank; second, use information retrieved from the index file to locate the first record in the sorted reference databank; third, search the name to be matched in the sorted reference databank; if the name to be matched is not unique in the sorted reference databank, we choose the record with the maximum likelihood as the person to be matched using information on age and ethnicity; fourth, pick up the matched individual record in the sorted reference databank and insert it at the end of the individual for matching in the master databank. To accelerate the construction of a micro databank of human relationships and kinship, the research takes advantage of in-memory high performance computing (HPC) techniques. The data enables us to conduct researches on birth dynamics, death dynamics, migration dynamics, life table, survival analysis, individual-level social network dynamics, etc. 簡介: 「台灣原住民人口動態資料庫」是2013至2022年間,進行原民會合作研究計畫最重要研究成果之一(參考https://osf.io/e4rvz/)。本研究資料庫是運用科學計算技術及資料科學學理,運用高效能及高精確巨量資料自動化資料錄連結及空間定位方法和技術,在保護隱私前提下,本人口動態(population dynamics)資料是目前全球唯一最完整且合乎學理要求所完成的人口動態貫時性資料(longitudinal data)。此人口動態資料允許我們能真正解構出生、死亡、遷徙三種互相影響複雜的人口動態過程,進行精準的”點對點”人口分布、出生動態、死亡動態、遷徙動態(包括初級、回流、及連續遷徒)、及相關社會網絡研究,允許對研究所謂”難以接觸人口“(例如台灣原住民),進行高度精準時空動態分析。利用本硏究人口動態資料,原住民詳細生命表資訊首次編製出來,對醫療、衞生、照護、保險、年金、健康不平等政策硏究,提供以前沒有之資訊及對決策產生重大影 響。