[go: up one dir, main page]

IDEAS home Printed from https://ideas.repec.org/p/sza/wpaper/wpapers299.html
   My bibliography  Save this paper

Record linkage in the Cape of Good Hope Panel

Author

Listed:
  • Auke Rijpma

    (Department of History, Utrecht University)

  • Jeanne Cilliers

    (Department of Economic History, Lund University)

  • Johan Fourie

    (Department of Economics, Stellenbosch University)

Abstract
In this paper we describe the record linkage procedure to create a panel from Cape Colony census returns, or opgaafrolle, for 1787--1828, a dataset of 42,354 household-level observations. Based on a subset of manually linked records, we first evaluate statistical models and deterministic algorithms to best identify and match households over time. By using household-level characteristics in the linking process and near-annual data, we are able to create high-quality links for 84 percent of the dataset. We compare basic analyses on the linked panel dataset to the original cross-sectional data, evaluate the feasibility of the strategy when linking to supplementary sources, and discuss the scalability of our approach to the full Cape panel.

Suggested Citation

  • Auke Rijpma & Jeanne Cilliers & Johan Fourie, 2018. "Record linkage in the Cape of Good Hope Panel," Working Papers 06/2018, Stellenbosch University, Department of Economics.
  • Handle: RePEc:sza:wpaper:wpapers299
    as

    Download full text from publisher

    File URL: https://www.ekon.sun.ac.za/wpapers/2018/wp062018/wp062018.pdf
    File Function: First version, 2018
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Catherine G. Massey, 2017. "Playing with matches: An assessment of accuracy in linked historical data," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 129-143, July.
    2. Gary Solon & Steven J. Haider & Jeffrey M. Wooldridge, 2015. "What Are We Weighting For?," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 301-316.
    3. Dieter von Fintel & Sophia Du Plessis & Ada Jansen, 2013. "The Wealth Of Cape Colony Widows: Inheritance Laws And Investment Responses Following Male Death In The 17th And 18th Centuries," Economic History of Developing Regions, Taylor & Francis Journals, vol. 28(1), pages 87-108, June.
    4. Maia Güell & José V. Rodríguez Mora & Christopher I. Telmer, 2015. "The Informational Content of Surnames, the Evolution of Intergenerational Mobility, and Assortative Mating," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 82(2), pages 693-735.
    5. Johan Fourie & Dieter von Fintel, 2010. "The dynamics of inequality in a newly settled, pre-industrial society: the case of the Cape Colony," Cliometrica, Journal of Historical Economics and Econometric History, Association Française de Cliométrie (AFC), vol. 4(3), pages 229-267, October.
    6. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    7. Maia Güell & José V. Rodríguez Mora & Christopher I. Telmer, 2014. "Intergenerational Mobility and the Informational Content of Surnames," Working Papers 2014-01, FEDEA.
    8. Clark, Gregory & Cummins, Neil & Hao, Yu & Vidal, Dan Diaz, 2015. "Surnames: A new source for the history of social mobility," Explorations in Economic History, Elsevier, vol. 55(C), pages 3-24.
    9. Steven Ruggles, 2014. "Big Microdata for Population Research," Demography, Springer;Population Association of America (PAA), vol. 51(1), pages 287-297, February.
    10. Johan Fourie, 2013. "The remarkable wealth of the Dutch Cape Colony: measurements from eighteenth-century probate inventories," Economic History Review, Economic History Society, vol. 66(2), pages 419-448, May.
    11. Hao Dong & Cameron Campbell & Satomi Kurosu & Wenshan Yang & James Lee, 2015. "New Sources for Comparative Social Science: Historical Population Panel Data From East Asia," Demography, Springer;Population Association of America (PAA), vol. 52(3), pages 1061-1088, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Vittoria La Serra & Emiliano Svezia, 2023. "A robust record linkage approach for anomaly detection in granular insurance asset reporting," Questioni di Economia e Finanza (Occasional Papers) 821, Bank of Italy, Economic Research and International Relations Area.
    2. Vittoria La Serra & Emiliano Svezia, 2024. "A supervised record linkage approach for anomaly detection in insurance assets granular data," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(5), pages 4181-4205, October.
    3. Johan Fourie & Frank Garmon, 2023. "The settlers’ fortunes: Comparing tax censuses in the Cape Colony and early American republic," Economic History Review, Economic History Society, vol. 76(2), pages 525-550, May.
    4. Martins, Igor & Cilliers, Jeanne & Fourie, Johan, 2019. "Legacies of Loss: The intergenerational outcomes of slaveholder compensation in the British Cape Colony," Lund Papers in Economic History 197, Lund University, Department of Economic History.
    5. Martins, Igor, 2019. "An Act for the Abolition of the Slave Trade: The Effects of an Import Ban on Cape Colony Slaveholders," African Economic History Working Paper 43/2019, African Economic History Network.
    6. Jeanne Cilliers & Erik Green & Robert Ross, 2023. "Did it pay to be a pioneer? Wealth accumulation in a newly settled frontier society," Economic History Review, Economic History Society, vol. 76(1), pages 257-282, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maia Güell & Michele Pellizzari & Giovanni Pica & José V. Rodríguez Mora, 2018. "Correlating Social Mobility and Economic Outcomes," Economic Journal, Royal Economic Society, vol. 128(612), pages 353-403, July.
    2. James Bishop & Iris Chan, 2019. "Is Declining Union Membership Contributing to Low Wages Growth?," RBA Annual Conference Papers acp2019-06, Reserve Bank of Australia.
    3. Guido Neidhöfer, 2019. "Intergenerational mobility and the rise and fall of inequality: Lessons from Latin America," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 17(4), pages 499-520, December.
    4. Hirschauer, Norbert & Grüner, Sven & Mußhoff, Oliver & Becker, Claudia & Jantsch, Antje, 2020. "Can p-values be meaningfully interpreted without random sampling?," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 14, pages 71-91.
    5. Johan Fourie & Jan Luiten Zanden, 2013. "GDP in the Dutch Cape Colony: The National Accounts of a Slave-Based Society," South African Journal of Economics, Economic Society of South Africa, vol. 81(4), pages 467-490, December.
    6. Felici, Marco & Fuerst, Franz, 2023. "The heterogeneous relationship of owner-occupied and investment property with household portfolio choice," Journal of Housing Economics, Elsevier, vol. 62(C).
    7. Carol H. SHIUE, 2016. "A Culture of Kinship: Chinese Genealogies as a Souce for Research in Demographic Economics," JODE - Journal of Demographic Economics, Cambridge University Press, vol. 82(4), pages 459-482, December.
    8. Rupert, Peter & Zanella, Giulio, 2018. "Grandchildren and their grandparents' labor supply," Journal of Public Economics, Elsevier, vol. 159(C), pages 89-103.
    9. Jonathan Davis & Bhashkar Mazumder, 2017. "The Decline in Intergenerational Mobility After 1980," Working Paper Series WP-2017-5, Federal Reserve Bank of Chicago, revised 14 Jan 2022.
    10. Fitzenberger, Bernd & Furdas, Marina & Sajons, Christoph, 2016. "End-of-year spending and the long-run employment effects of training programs for the unemployed," Freiburg Discussion Papers on Constitutional Economics 16/08, Walter Eucken Institut e.V..
    11. Bettendorf, Leon J.H. & Jongen, Egbert L.W. & Muller, Paul, 2015. "Childcare subsidies and labour supply — Evidence from a large Dutch reform," Labour Economics, Elsevier, vol. 36(C), pages 112-123.
    12. Hugo Castro-Silva & Francisco Lima, 2023. "The struggle of small firms to retain high-skill workers: job duration and the importance of knowledge intensity," Small Business Economics, Springer, vol. 60(2), pages 537-572, February.
    13. Colagrossi, Marco & d’Hombres, Béatrice & Schnepf, Sylke V, 2020. "Like (grand)parent, like child? Multigenerational mobility across the EU," European Economic Review, Elsevier, vol. 130(C).
    14. Johan Fourie, 2020. "The settlers of South Africa and the expanding frontier," Working Papers 14/2020, Stellenbosch University, Department of Economics.
    15. Johan Fourie & Dieter Fintel, 2014. "Settler skills and colonial development: the Huguenot wine-makers in eighteenth-century Dutch South Africa," Economic History Review, Economic History Society, vol. 67(4), pages 932-963, November.
    16. Ekama, Kate & Fourie, Johan & Heese, Hans & Martin, Lisa-Cheree, 2021. "When Cape slavery ended: Introducing a new slave emancipation dataset," Explorations in Economic History, Elsevier, vol. 81(C).
    17. Martins, Igor & Cilliers, Jeanne & Fourie, Johan, 2023. "Legacies of loss: The health outcomes of slaveholder compensation in the British Cape Colony," Explorations in Economic History, Elsevier, vol. 89(C).
    18. Warnke, Arne Jonas, 2017. "An investigation of record linkage refusal and its implications for empirical research," ZEW Discussion Papers 17-031, ZEW - Leibniz Centre for European Economic Research.
    19. Michael D. Smith & Dennis Wesselbaum, 2023. "Financial inclusion and international migration in low- and middle-income countries," Empirical Economics, Springer, vol. 65(1), pages 341-370, July.
    20. Piraino, Patrizio & Muller, Sean & Cilliers, Jeanne & Fourie, Johan, 2013. "The transmission of longevity across generations: The case of the settler Cape Colony," SALDRU Working Papers 113, Southern Africa Labour and Development Research Unit, University of Cape Town.

    More about this item

    Keywords

    census; machine learning; micro-data; record linkage; panel data; South Africa;
    All these keywords.

    JEL classification:

    • N01 - Economic History - - General - - - Development of the Discipline: Historiographical; Sources and Methods
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sza:wpaper:wpapers299. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Melt van Schoor (email available below). General contact details of provider: https://edirc.repec.org/data/desunza.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.