Rough set based intelligent approach for identification of H1N1 suspect using social media

Authors

  • Vinay Kumar Jain Jaypee University of Engineering & Technology, Guna (MP) India
  • Shishir Kumar Jaypee University of Engineering & Technology, Guna (MP) India

Keywords:

Twitter, Swine flu, Influenza, H1N1, Rough Sets, Text Classification

Abstract

Social media data offer unique challenges and opportunities for monitoring and surveillance of public health. The identification of epidemic suspect depends on doctor’s experience, symptoms and laboratory tests. Delay in identifying the beginning of infectious epidemic results in a big damage to a society. To handle the cases of epidemic effectively, a low-cost, accurate and timely diagnosis system is needed. An intelligent technique based on Rough set theory for identifying suspect of H1N1 using social media, has been presented in this paper.

Classification of symptoms from the dataset has been performed using machine learning techniques. From the large number of symptom attributes mined from the dataset, H1N1 related symptom attributes, have been extracted. These extracted attributes contribute maximum to the decision-making process. Rough set theory has been used to evaluate significant attributes (symptoms) from symptom attribute set by generating reducts using indiscernibility relation. Identification of suspects is performed using significant conditional attributes and dependency rules generated from reducts. The utilization of presented social media based medical decision support system turn out to be an effective approach to assist government and health agencies in decision-making.

Author Biographies

Vinay Kumar Jain, Jaypee University of Engineering & Technology, Guna (MP) India

Ph.D. Scholar, Department of Computer Science & Engineering

Shishir Kumar, Jaypee University of Engineering & Technology, Guna (MP) India

Professor & Head (CSE), Jaypee University of Engineering & Technology, Guna (MP) IndiaJaypee University of Engineering & Technology, Guna (MP) India

References

Apollo Health Services (2015). Apollo Health Services, India

(http://www.apollohealthcity.com /swine-flu/) [Accessed: 25-Sep-

.

Aramaki E., Maskawa S. & Morita M. (2011). Twitter catches the

Flu: detecting influenza epidemics using Twitter. Proceedings of the

Conference on Empirical Methods in Natural Language Processing,

Association for Computational Linguistics, Stroudsburg, PA, USA.

Bodnar, T. & Salathé, M. (2013).Validating models for disease

detection using Twitter. International World Wide Web Conference

Companion. Rio de Janeiro, Brazil.

CDCP (2015). Centers for Disease Control and Prevention (http://

www.cdc.gov /h1n1flu/qa.htm)[Accessed: 25-Sep-2015]

Ceron, A. C. (2013). Every tweet counts? How sentiment analysis

of social media can improve our knowledge of citizens’ political

preferences with an application to Italy and France. New Media &

Society, 16(2):340-358.

Chew, C. M. (2010). Pandemics in the age of twitter: A content

analysis of the 2009 h1n1 outbreak. Master’s thesis, University of

Toronto.

Emarketer (2015). Emarketer (http://www.emarketer.com)

[Accessed : 25-Sep-2015].

Glik, D. (2007). Risk communication for public health emergencies.

Annual Reviews of Public Health, 28:33-54.

Hassanien, A. E. & Ali, J.M.H. (2004). Rough set approach

for generation of classification rules of breast cancer data.

INFORMATICA, 15(1):23–38.

Hu, X., Lei, T. & Liu, H. (2011). Enhancing accessibility

of microblogging messages using semantic knowledge.20th

ACM International Conference on Information and Knowledge

Management, New York, NY, USA.

Hvidsten, T. R. (2013). A tutorial-based guide to the ROSETTA

system: A Rough Set Toolkit for Analysis of Data.

Jain,V. K. & Kumar, S. (2015). An effective approach to track

levels of influenza-A (H1N1) Pandemic in India Using Twitter.

Procedia Computer Science, 70(1):801–807.

Jia,X., Shang, L., Zhou, B. & Yao, Y. (2016). Generalized

attribute reduction in Rough set theory. Knowledge-Based Systems,

(1):204–218.

Jiye, Li. (2007). Rough set based rule evaluations and their

applications, Ph.D Thesis, University of Waterloo.

Khan, K., Ullah, A. & Baharudin, B. (2016). Pattern and semantic

analysis to improve unsupervised techniques for opinion target

identification. Kuwait Journal of Science, 43(1):129-149.

Kumara, S. U. & Inbaranib, H. H. (2015). A novel neighborhood

rough set based classification approach for medical diagnosis.

Procedia Computer Science, 47(1):351–359.

Lampos, V. & Cristianini, N. (2010). Tracking the flu pandemic

by monitoring the social web.In 2nd IAPR Workshop on Cognitive

Information Processing, IEEE Press.

Miao, D., Duan, Q., Zhang, H. & Jiao, N. (2009). Rough set

based hybrid algorithm for text classification. Expert Systems with

Applications, 36(5):9168–9174.

Moses, D. & Deisy, C. (2015). A survey of data mining algorithms

used in cardiovascular disease diagnosis from multi-lead ECG data.

Kuwait Journal of Science, 42(2):206-235.

Nallamuth, R. & Palanichamy, J. (2015). Optimized construction

of various classification models for the diagnosis of thyroid

problems in human beings. Kuwait Journal of Science, 42(2):189-

NHP(2015). National Health Portal of India,(http://www.nhp.gov.

in/diseaseaz/s/swineflu) [Accessed : 25-Sep-2015]

Pandey, P., Kumar, S. & Srivastava, S. (2013). Forecasting using

Fuzzy time series for diffusion of innovation: Case of Tata Nano car

in India. National Academy Science Letters, 36(3):299-309.

Pang, B. & Lee, L. (2008). Opinion mining and sentiment analysis.

Foundations and Trends in Information Retrieval, 2(1):1–135.

Parker, P., Wei, Y., Yates, A., Frieder , O. & Goharian, N. (2013).

A framework for detecting public health trends with Twitter.IEEE/

ACM International Conference on Advances in Social Networks

Analysis and Mining, Niagara Falls, ON, Canada.

Pawlak, Z. (1982). Rough sets. International Journal of Information

and Computer Science. 11(5):341-356.

Pawlak, Z. (1991). Rough sets: theoretical aspects of reasoning

about data. Theory and decision library.Kluwer Academic

Publishers. Norwell, MA, USA.

Peng, Y., Kou, G., Shi, Y. & Chen, Z.X. (2008). A descriptive

framework for the of data mining and knowledge discovery.

International Journal of Information Technology & Decision

Making, 7(4):639-682.

PIB(2015). Preventive measures for Swine flu(http://pib.nic.in/

newsite/PrintRelease.aspx? relid=115710) [Accessed: 15-Aug-

.

Qian, Y. H., Liang, J.Y., Li, D.Y. et al. (2008). Measures for

evaluating the decision performance of a decision table in rough set

theory. Information Sciences, 178(1):181-202.

Santos, J.C. & Matos, S. (2014). Analysing Twitter and web

queries for flu trend prediction. Theoretical Biology and Medical

Modelling, 11(1):1-11.

Stewart, A. & Diaz, E. (2012). Epidemic intelligence: for the

crowd, by the crowd. In Proceedings of the 12th international

conference on Web Engineering, Berlin.

Tay, F.E.H. & Shen, L. (2002). Economic and financial prediction

using Rough sets model.European Journal of Operational Research,

(3):641–659.

Tripathy, B. K., Acharjya, D. P. & Cynthya, V. (2011). A

framework for intelligent medical diagnosis using rough set

with formal concept analysis. International Journal of Artificial

Intelligence & Applications, 2(2):1-14.

Twitter (2015). Twitter Developer Page (https://dev.twitter.com/

docs/api/1/get/search) [Accessed: 25-June-2015].

Downloads

Published

02-05-2018