Group_5 Mis Assignment 3.docx

  • Uploaded by: yogendra choukiker
  • 0
  • 0
  • November 2019
  • PDF

This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA


Overview

Download & View Group_5 Mis Assignment 3.docx as PDF for free.

More details

  • Words: 912
  • Pages: 7
MANAGEMENT INFORMATION SYSTEMS

PRM 39- TERM III ASSIGNMENT-3

Twitter as a Structured Information System Submitted to: Prof H.K Misra

Submitted by: Group 5 Manoj (P39142) Suraj (P39171) Yogendra (P39176)

1

E-R modelling ER Model is used to model the logical view of the system from data perspective which consists of these components:

ER diagrams are related to data structure diagrams, which focus on the relationships of elements within entities instead of relationships between entities themselves. ER diagrams also are often used in conjunction with data flow diagrams, which map out the flow of information for processes or system.

Flow chart

2

A framework for Intelligent twitter data analysis with nonnegative matrix factorization: The purpose of the framework is to allow users to explore a collection of tweets by showing topics with relevance. In this way, it is easy to detect groups of tweets related to new technologies, events and other topics that are automatically discovered.

Methodology: The framework is based on a three-stage process. The first stage is dataset creation by transforming a collection of tweets in a dataset according to the Vector Space Model. The second stage is the core of the framework, is cantered on the use of Nonnegative Matrix Factorizations (NMF) for extracting human-interpretable topics from tweets that are eventually clustered. The number of topics can be user-defined by applying Subtractive Clustering as a preliminary step before factorization. Cluster analysis and word-cloud visualization are used in the last stage to enable intelligent data analysis. Findings: We applied the framework to a case study of three collections of tweets both with manual and automatic selection of the number of topics. Given the high sparsity of Twitter data, we also investigated the influence of different initializations mechanisms for NMF on the factorization results. Numerical comparisons could be used for clustering as it is comparable to kmeans cluster. Visual inspection of the word-clouds allowed a qualitative assessment of the results that confirmed the expected outcomes. Originality The proposed framework enables a collaborative approach between users and computers for an intelligent analysis of Twitter data. Users are faced with interpretable descriptions of tweet clusters, which can be interactively refined with few adjustable parameters. The resulting clusters can be used for intelligent selection of tweets, as well as for further analytics concerning the impact of products, events, etc.

3

in the social network.

4

.

5

they find interesting. Such lists that are created for personal convenience do not gain the attention of people. This implies that most of them do not have any subscribers. Furthermore, list names are not unique; there can be thousands of lists with similar (or even same) names (Kim, Jo, and Moon 2010). This further exacerbates the problem of finding genuine, authoritative and topically relevant set of lists. In this paper, we propose two recommendation models that recommend lists for Twitter users based on their personalized interest. Our first model, called the ListRec, captures and models the users’ interest based on a combination of content, network and trendiness based measures. For users with rich tweet history, we measure their interests using the topics derived from their tweets. Unlike the existing studies, we view the twitterer’s interest as a temporally varying feature and exploit this variation using an exhaustive set of streaming tweets to dynamically model the users’ interest. For users with sparse tweet history, we project the user space into a followee space and utilize the followee’s list subscriptions to indirectly measure the interest of the users. We also add a new trend based score that measures the popularity of lists in the Twitter domain. The final score is then modeled as a linear combination of these three individual scores (based on content, network, and popularity) to effectively measure the interests of the users and personalize list recommendation. The coefficients in this linear combination are estimated using a cyclic ridge regression estimation approach. Our experimental results show that the ListRec outperforms other competing state of the art methods. Our second model is the LIST-PAGERANK which will recommend lists that are popular and are more (topically) authoritative than the lists that are currently subscribed by the users. To the best of our knowledge, there are no studies that use Twitter lists for personalized recommendation. We summarize the major contributions of this paper as follows: a. We propose a recommendation framework called ListRec that recommends Twitter lists based on the personalized interest of twitterers. Unlike the existing studies that recommend external information like news articles and blogs, our work is purely domain-specific. b. The interests of users are modeled using a combination of weighting schemes: (a) a content based scheme that models the users’ interest based on temporally varying topics; (b) a network based scheme that uses the followeenetwork of the users to overcome the tweet sparsity; and (c) a trendiness based scheme that is based on the popularity of the lists. c. We propose a LIST-PAGERANK based algorithm that leverages the network structure of Twitter lists to recom6

mend authoritative lists that match the topical interest of the users. The rest of this paper is organized as follows. We begin by describing the modeling of ListRec in Section 2. Section 3 describes the creation of the list network and formulation of the LIST-PAGERANK. Section 4 will show the results of our experiments and explain the data collection methodology. Section 5 discusses the related work on this topic. Finally, the conclusions obtained through this study are presented in Section 6

7

Related Documents

Mis Assignment
July 2020 16
Sap- Mis Assignment
July 2020 11
Assignment - 1 Mis
June 2020 10
Assignment Mis 2
June 2020 15
Mis 2nd Assignment
June 2020 11

More Documents from ""