Detecting drinking-related contents on social media by classifying heterogeneous data types

Omar ElTayeby, Todd Eaglin, Malak Abdullah, David Burlinson, Wenwen Dou, Lixia Yao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

One common health problem in the US faced by colleges and universities is binge drinking. College students often post drinking related texts and images on social media as a socially desirable identity. Some public health and clinical research scholars have surveyed different social media sites manually to understand their behavior patterns. In this paper, we investigate the feasibility of mining the heterogeneous data scattered on social media to identify drinking-related contents, which is the first step towards unleashing the potential of social media in automatic detection of binge drinking users. We use the state-of-the-art algorithms such as Support Vector Machine and neural networks to classify drinking from non-drinking posts, which contain not only text, but also images and videos. Our results show that combining heterogeneous data types, we are able to identify drinking related posts with an overall accuracy of 82%. Prediction models based on text data is more reliable compared to the other two models built on image and video data for predicting drinking related contents.

Original languageEnglish (US)
Title of host publicationAdvances in Artificial Intelligence
Subtitle of host publicationFrom Theory to Practice - 30th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2017, Proceedings
EditorsMoonis Ali, Salem Benferhat, Karim Tabia
PublisherSpringer Verlag
Pages364-373
Number of pages10
ISBN (Print)9783319600444
DOIs
StatePublished - Jan 1 2017
Event30th International Conference on Industrial, Engineering, and Other Applications of Applied Intelligent Systems, IEA/AIE 2017 - Arras, France
Duration: Jun 27 2017Jun 30 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10351 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other30th International Conference on Industrial, Engineering, and Other Applications of Applied Intelligent Systems, IEA/AIE 2017
CountryFrance
CityArras
Period6/27/176/30/17

Keywords

  • Binge drinking
  • Image classification
  • Machine learning
  • Social media
  • Text classification
  • Video classification

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Detecting drinking-related contents on social media by classifying heterogeneous data types'. Together they form a unique fingerprint.

  • Cite this

    ElTayeby, O., Eaglin, T., Abdullah, M., Burlinson, D., Dou, W., & Yao, L. (2017). Detecting drinking-related contents on social media by classifying heterogeneous data types. In M. Ali, S. Benferhat, & K. Tabia (Eds.), Advances in Artificial Intelligence: From Theory to Practice - 30th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2017, Proceedings (pp. 364-373). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10351 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-60045-1_38