Estimating Korean Residence Registration Numbers from Public Information on SNS

Daeseon CHOI  Younho LEE  Yongsu PARK  Seokhyun KIM  

IEICE TRANSACTIONS on Communications   Vol.E98-B   No.4   pp.565-574
Publication Date: 2015/04/01
Online ISSN: 1745-1345
DOI: 10.1587/transcom.E98.B.565
Type of Manuscript: PAPER
Category: Fundamental Theories for Communications
personal data security,  SNS security,  Korean residence registration number,  data estimation,  security,  

Full Text: PDF(2.1MB)>>
Buy this Article

People expose their personal information on social network services (SNSs). This paper warns of the dangers of this practice by way of an example. We show that the residence registration numbers (RRNs) of many Koreans, which are very important and confidential personal information analogous to social security numbers in the United States, can be estimated solely from the information that they have made open to the public. In our study, we utilized machine learning algorithms to infer information that was then used to extract a part of the RRNs. Consequently, we were able to extract 45.5% of SNS users' RRNs using a machine learning algorithm and brute-force search that did not consume exorbitant amounts of resources.