한국어 일반 개체명과 개인정보 특화 개체명 비교 연구
Comparative Study of General Named Entities and Privacy-Specific Named Entities in Korean
  • 최혜지
  • 강채안
  • 김민선
  • 안수빈
  • 비립
  • 외 2명
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

As information technology and artificial intelligence (AI) methods develop, apprehensions over the leaking of personal information during extensive data processing are increasing. To differentiate Personally Identifiable Information (PII) throughout extensive datasets, it is essential to precisely identify and categorize PII, distinct from General Named Entities (GNE). This study examines the prevalence of both categories in authentic interactions by creating a conversation-based dataset annotated with GNE and PII. The analysis indicates that contextual information is crucial for recognizing things associated with 'place,' 'organization,' and 'academic field.' Furthermore, constraints in categorizing things such as 'date' and 'culture' underline the necessity for continual enhancements of currently functioning systems. It also emphasizes the importance for proactive innovations in personal information detection technologies.

키워드

개체명 인식개인정보 탐지개인정보 개체명일반 개체명개인정보 보호Named Entity RecognitionPersonal Information DetectionPersonally Identifiable Information EntitiesGeneral Named EntitiesPrivacy Protection
제목
한국어 일반 개체명과 개인정보 특화 개체명 비교 연구
제목 (타언어)
Comparative Study of General Named Entities and Privacy-Specific Named Entities in Korean
저자
최혜지강채안김민선안수빈비립이종규김한샘
DOI
10.5392/JKCA.2024.24.11.174
발행일
2024-11
저널명
한국콘텐츠학회 논문지
24
11
페이지
174 ~ 192