Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

📰 ArXiv cs.AI

arXiv:2604.09016v1 Announce Type: cross Abstract: This study addresses the challenge of creating datasets for cybercrime analysis while complying with the requirements of regulations such as the General Data Protection Regulation (GDPR) and Organic Law 10/1995 of the Penal Code. To this end, a system is proposed for collecting information from the Telegram platform, including text, audio, and images; the implementation of speech-to-text transcription models incorporating signal enhancement techn

Published 13 Apr 2026
Read full paper → ← Back to Reads