その他

基本情報

氏名 橋本 隆子
氏名(カナ) ハシモト タカコ
氏名(英語) HASHIMOTO, Takako
所属 商経学部
職名 教授
researchmap研究者コード
researchmap機関

翻訳書、学会発表、講演、作品等の名称

Topic Extraction from Millions of Tweets using Singular Value Decomposition and Feature Selection

単・共の別

共著

発行又は発表の年月

2015/12

発表学会等の名称

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2015

概要

To overcome the scalability problem for big data analysis, in this paper, high performance Singular Vector Decomposition (SVD) library redsvd has been used to identify topics over time from the huge data set of over two hundred million tweets sent in the 21 days following the Great East Japan Earthquake. While we begin with word count vectors of authors and words for each time slot (in our case, every hour), authors’ clusters from each slot are extracted by SVD and k-means. And then, the original fast feature selection algorithm named CWC has been used to extract discriminative words from each cluster. As aresult, authors’ clusters recognized as topics as well as issues of conventional social media analysis method for big data can be visualized overcoming the scalability problem.

担当授業科目1

担当授業科目2

担当授業科目3