Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE
Paper
•
2402.13604
•
Published
OccCANINE is a version of CANINE which has been finetuned to automatically convert occupational descriptions into standardized HISCO codes using a CANINE model. This tool facilitates historical occupational data analysis with over 90% accuracy across 13 languages.
See more on: GitHub.com/christianvedels/OccCANINE
Read the paper on arXiv: https://arxiv.org/abs/2402.13604
Developed at the University of Southern Denmark by Christian Møller Dahl, Torben Johansen and Christian Vedel with contributions from various sources.