MinUCS
Mining User-Generated Content for Security
9 December 2009 - Venice, Italy
Accepted Papers
- Automating Financial Surveillance.
- Milosavljevic M, Delort JY, Hachey B, Arunasalam B, Radford W, Curran J. (Submission 14.)
- Monitoring Social Attitudes using Rectitude Gains.
- Wawer A. (Submission 12.)
- A proposal for a multilingual epidemic surveillance system.
- Lejeune G, Hatmi M, Doucet A, Huttunen S, Lucas N. (Submission 3.)
- Cross-Lingual Analysis of Concerns and Reports on Crimes in Blogs.
- Nakasaki H, Abe Y, Utsuro T, Kawada Y, Fukuhara T, Kando N, Yoshioka M, Nakagawa H, Kiyota Y. (Submission 9.)
- Automated Event Extraction in the Domain of Border Security.
- Atkinson M, Piskorski N, Tanev H, van der Goot E, Yangarber R, Zavarella V. (Submission 15.)
- Security Level Classification of Confidential Documents written in Turkish.
- Alparslan E., Bahsi H. (Submission 4.)
- Signaling Events in Text Streams.
- Schuhmacher J., Koster C. (Submission 2.)
The Workshop will be a full-day event, on December 9th.
(Program to be announced soon, approximately 09:00-17:00)
Workshop Theme
The vast and growing amount of user-generated textual content, including online news streams, blogs, electronic encyclopedias (e.g., the Wikipedia), and other openly accessible and dynamically changing data readily available on the Web has led to the emergence of new approaches to extracting valuable, structured, and previously unknown information from such data. The aim of this workshop is to bring together researchers from academia and industry who develop technologies for mining open-source user-generated textual data on the Web, as well as end-users interested in exploiting such technologies for knowledge discovery. The emphasis is placed on large-scale text mining systems and application-oriented approaches to processing on-line textual content in the context of security-related applications. Examples of such applications include:- global medical and epidemic surveillance,
- conflict early warning,
- early detection of man-made or environmental hazards,
- risk assessment,
- border surveillance,
- cross-border crime detection,
- terrorism,
- other applications relevant for public health, security, and law enforcement institutions
Topics of interest
- Mining from news streams, blogs, document repositories, and other openly accessible and dynamically changing data, including Web 2.0 content, for the purpose of identifying threats to security or public health,
- Emphasis on multilingual approaches, and work on languages other than English,
- Applications, such as information extraction, classification, summarization, sentiment detection, event detection, event forecasting, trend detection, information fusion, and more,
- Contributions in the form of applications (working systems and prototypes) as well as theoretical results are welcome,
- Application domains include crisis-related event reporting, political and environmental analysis, and medical intelligence, under the general umbrella of the security intelligence domain,
- Methods including machine learning, rule-based, and hybrid approaches.
Workshop Format
We invite papers addressing primarily the language technology, natural language processing, data mining and information retrieval communities, as well as the relevant end-user groups. Submissions are invited in two categories:- (a) regular: 6 pages long, research papers presenting novel approaches and solutions, and
- (b) short (posters): 4 pages long, system demonstrations, descriptions, and work in progress.
