Sreekanth Madisetty, PhD

Greater Hyderabad Area Contact Info
768 followers 500+ connections

Join to view profile

About

• Research experience in Machine Learning, Deep Learning, Natural
Language Processing,…

Articles by Sreekanth

Activity

Join now to see all activity

Experience & Education

  • Jio Platforms Limited (JPL)

View Sreekanth’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Volunteer Experience

  • Reviewer

    IEEE Transactions on Computational Social Systems

  • Student Volunteer

    Enabling Large Scale Data Analytics: From Theoretical Foundations to Practice at IIT Hyderabad

Publications

  • A reranking-based tweet retrieval approach for planned events

    Twitter provides access to latest information. Whenever a major event happens, people try to search for event related information in social media platforms like Twitter. So, it is essential to develop methods to get good quality of event related tweets. People share different opinions, feelings, feedback, etc. about events happening around the world in Twitter in the form of tweets. These tweets are often short and contain noise. So, it is very difficult to get the most relevant data for a…

    Twitter provides access to latest information. Whenever a major event happens, people try to search for event related information in social media platforms like Twitter. So, it is essential to develop methods to get good quality of event related tweets. People share different opinions, feelings, feedback, etc. about events happening around the world in Twitter in the form of tweets. These tweets are often short and contain noise. So, it is very difficult to get the most relevant data for a given event from Twitter. We propose a two-phase approach to retrieve the tweets related to planned events. In the first phase, initial retrieval is done by using BM25 algorithm. In the second phase, reranking is done by combining three scoring mechanisms namely BM25 score, top hashtags score related to an event, and top TF-IDF terms score related to an event. A learning to rank algorithm SVM_Rank is applied to give weights to these three methods and combine them to get the final score of the tweet. We performed experiments on two benchmark datasets CLEF and TREC. Experimental results show that our method outperforms baseline and literature methods for both the datasets according to multiple evaluation metrics.

    See publication
  • Social Media Popularity Prediction of Planned Events Using Deep Learning

    Early prediction of popularity is crucial for recommendation of planned events such as concerts, conferences, sports events, performing arts, etc. Estimation of the volume of social media discussions related to the event can be useful for this purpose. Most of the existing methods for social media popularity prediction focus on estimating tweet popularity i.e. predicting the number of retweets for a given tweet. There is less focus on predicting event popularity using social media. We focus on…

    Early prediction of popularity is crucial for recommendation of planned events such as concerts, conferences, sports events, performing arts, etc. Estimation of the volume of social media discussions related to the event can be useful for this purpose. Most of the existing methods for social media popularity prediction focus on estimating tweet popularity i.e. predicting the number of retweets for a given tweet. There is less focus on predicting event popularity using social media. We focus on predicting the popularity of an event much before its start date. This type of early prediction can be helpful in event recommendation systems, assisting event organizers for better planning, dynamic ticket pricing, etc. We propose a deep learning based model to predict the social media popularity of an event. We also incorporate an extra feature indicating how many days left to the event start date to improve the performance. Experimental results show that our proposed deep learning based approach outperforms the baseline methods.

    See publication
  • A Neural Approach for Detecting Inline Mathematical Expressions from Scientific Documents

    Wiley Expert Systems (Accepted)

    Scientific documents generally contain multiple mathematical expressions in them. Detect-
    ing inline mathematical expressions is one of the most important and challenging tasks in
    scientific text mining. Recent works that detect inline mathematical expressions in scientific documents have looked at the problem from an image processing perspective. There is
    little work that has targeted the problem from NLP perspective. Towards this, we define
    a few features and applied Conditional…

    Scientific documents generally contain multiple mathematical expressions in them. Detect-
    ing inline mathematical expressions is one of the most important and challenging tasks in
    scientific text mining. Recent works that detect inline mathematical expressions in scientific documents have looked at the problem from an image processing perspective. There is
    little work that has targeted the problem from NLP perspective. Towards this, we define
    a few features and applied Conditional Random Fields (CRF) to detect inline mathemat-
    ical expressions in scientific documents. Apart from this feature based approach, we also
    propose a hybrid algorithm that combines Bidirectional Long Short Term Memory net-
    works (Bi-LSTM) and feature-based approach for this task. Experimental results suggest
    that this proposed hybrid method outperforms several baselines in the literature and also
    individual methods in the hybrid approach.

  • Event Recommendation using Social Media

    2019 IEEE 35th International Conference on Data Engineering (ICDE)

    Every day, lots of events keep happening around the globe. Examples of events are festivals, concerts, shows, conferences, sports events, movie launches, etc. There are several event recommendation systems that suggest different events to the users. The popularity of an event plays an important role in event recommendation. If the system knows which event is popular in a region, then that can be recommended to more people in that region. However, since events take place at predefined location…

    Every day, lots of events keep happening around the globe. Examples of events are festivals, concerts, shows, conferences, sports events, movie launches, etc. There are several event recommendation systems that suggest different events to the users. The popularity of an event plays an important role in event recommendation. If the system knows which event is popular in a region, then that can be recommended to more people in that region. However, since events take place at predefined location and time it is important to identify the popularity much before the occurrence of the event. Early prediction of future popularity is very crucial for the success of any event recommendation system. We plan to use event related discussions in social media as a signal for estimating the popularity of the events. Towards this task, given meta information (time, date, venue, title etc.) about an event, we first try to identify social media posts related to the event. These posts are then passed through several preprocessing stages in a pipeline to detect spam, identify the presence of emotions and their intensities, etc. The preprocessed posts can then be used along with several other contextual parameters to predict the future popularity of the events. Next, events can be recommended to the users by using a recommendation algorithm.

    See publication
  • Identification of Relevant Hashtags for Planned Events Using Learning to Rank

    Knowledge Discovery, Knowledge Engineering and Knowledge Management. Communications in Computer and Information Science, vol 976. Springer, Cham

    Lots of planned events (e.g. concerts, sports matches, festivals, etc.) keep happening across the world every day. In various applications like event recommendation, event reporting, etc. it might be useful to find user discussions related to such events from social media. Identification of event related hashtags can be useful for this purpose. In this paper, we focus on identifying the top hashtags related to a given event. We define a set of features for (event, hashtag) pairs, and discuss…

    Lots of planned events (e.g. concerts, sports matches, festivals, etc.) keep happening across the world every day. In various applications like event recommendation, event reporting, etc. it might be useful to find user discussions related to such events from social media. Identification of event related hashtags can be useful for this purpose. In this paper, we focus on identifying the top hashtags related to a given event. We define a set of features for (event, hashtag) pairs, and discuss ways to obtain these feature scores. A linear aggregation of these scores is used to finally output a ranked list of top hashtags for the event. The aggregation weights of the features are obtained using a learning to rank algorithm. We establish the superiority of our method by performing detailed experiments on a large dataset containing multiple categories of events and related tweets.

    See publication
  • A Neural Network-Based Ensemble Approach for Spam Detection in Twitter

    IEEE Transactions on Computational Social Systems

    As the social networking sites get more popular, spammers target these sites to spread spam posts. Twitter is one of the most popular online social networking sites where users communicate and interact on various topics. Most of the current spam filtering methods in Twitter focus on detecting the spammers and blocking them. However, spammers can create a new account and start posting new spam tweets again. So there is a need for robust spam detection techniques to detect the spam at tweet…

    As the social networking sites get more popular, spammers target these sites to spread spam posts. Twitter is one of the most popular online social networking sites where users communicate and interact on various topics. Most of the current spam filtering methods in Twitter focus on detecting the spammers and blocking them. However, spammers can create a new account and start posting new spam tweets again. So there is a need for robust spam detection techniques to detect the spam at tweet level. These types of techniques can prevent the spam in real time. To detect the spam at tweet level, often features are defined, and appropriate machine learning algorithms are applied in the literature. Recently, deep learning methods are showing fruitful results on several natural language processing tasks. We want to use the potential benefits of these two types of methods for our problem. Toward this, we propose an ensemble approach for spam detection at tweet level. We develop various deep learning models based on convolutional neural networks (CNNs). Five CNNs and one feature-based model are used in the ensemble. Each CNN uses different word embeddings (Glove, Word2vec) to train the model. The feature-based model uses content-based, user-based, and n-gram features. Our approach combines both deep learning and traditional feature-based models using a multilayer neural network which acts as a meta-classifier. We evaluate our method on two data sets, one data set is balanced, and another one is imbalanced. The experimental results show that our proposed method outperforms the existing methods.

    See publication
  • Aggression Detection in Social Media Using Deep Neural Networks

    Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018) at COLING 2018

    With the rise of user-generated content in social media coupled with almost non-existent moderation in many such systems, aggressive contents have been observed to rise in such forums. In this paper, we work on the problem of aggression detection in social media. Aggression can sometimes be expressed directly or overtly or it can be hidden or covert in the text. On the other hand, most of the content in social media is non-aggressive in nature. We propose an ensemble based system to classify an…

    With the rise of user-generated content in social media coupled with almost non-existent moderation in many such systems, aggressive contents have been observed to rise in such forums. In this paper, we work on the problem of aggression detection in social media. Aggression can sometimes be expressed directly or overtly or it can be hidden or covert in the text. On the other hand, most of the content in social media is non-aggressive in nature. We propose an ensemble based system to classify an input post to into one of three classes, namely, Overtly Aggressive, Covertly Aggressive, and Non-aggressive. Our approach uses three deep learning methods, namely, Convolutional Neural Networks (CNN) with five layers (input, convolution, pooling, hidden, and output), Long Short Term Memory networks (LSTM), and Bi-directional Long Short Term Memory networks (Bi-LSTM). A majority voting based ensemble method is used to combine these classifiers (CNN, LSTM, and Bi-LSTM). We trained our method on Facebook comments dataset and tested on Facebook comments (in-domain) and other social media posts (cross-domain). Our system achieves the F1-score (weighted) of 0.604 for Facebook posts and 0.508 for social media posts.

  • A Framework for Real-time Spam Detection in Twitter

    2018 10th International Conference on Communication Systems & Networks (COMSNETS)

    With the increased popularity of online social networks, spammers find these platforms easily accessible to trap users in malicious activities by posting spam messages. In this work, we have taken Twitter platform and performed spam tweets detection. To stop spammers, Google SafeBrowsing and Twitter's BotMaker tools detect and block spam tweets. These tools can block malicious links, however they cannot protect the user in real-time as early as possible. Thus, industries and researchers have…

    With the increased popularity of online social networks, spammers find these platforms easily accessible to trap users in malicious activities by posting spam messages. In this work, we have taken Twitter platform and performed spam tweets detection. To stop spammers, Google SafeBrowsing and Twitter's BotMaker tools detect and block spam tweets. These tools can block malicious links, however they cannot protect the user in real-time as early as possible. Thus, industries and researchers have applied different approaches to make spam free social network platform. Some of them are only based on user-based features while others are based on tweet based features only. However, there is no comprehensive solution that can consolidate tweet's text information along with the user based features. To solve this issue, we propose a framework which takes the user and tweet based features along with the tweet text feature to classify the tweets. The benefit of using tweet text feature is that we can identify the spam tweets even if the spammer creates a new account which was not possible only with the user and tweet based features. We have evaluated our solution with four different machine learning algorithms namely - Support Vector Machine, Neural Network, Random Forest and Gradient Boosting. With Neural Network, we are able to achieve an accuracy of 91.65% and surpassed the existing solution by approximately 18%.

  • An Ensemble Based Method for Predicting Emotion Intensity of Tweets

    International Conference on Mining Intelligence and Knowledge Exploration

    Recently, user generated contents have increased tremendously in social media. Twitter is a popular micro-blogging platform in which users share their feelings, opinions, feedback, etc. It has been observed that microblogs are often associated with emotions. Several studies have focused on assigning a given tweet to one of the available emotion categories (e.g., anger, fear, joy, sadness). It is often useful in applications to find the intensity of emotion in the tweets. The focus on…

    Recently, user generated contents have increased tremendously in social media. Twitter is a popular micro-blogging platform in which users share their feelings, opinions, feedback, etc. It has been observed that microblogs are often associated with emotions. Several studies have focused on assigning a given tweet to one of the available emotion categories (e.g., anger, fear, joy, sadness). It is often useful in applications to find the intensity of emotion in the tweets. The focus on identifying emotion intensity is less in the literature. In this paper, we focus on determining the level of emotion intensity in the tweets. We use an ensemble of three methods: Convolution Neural Networks (CNN) with word embedding features, XGBoost with word n-gram and char n-gram features, and Support Vector Regression (SVR) with lexicon and word embedding features. The final prediction of the given tweet is obtained by the average of predictions of individual methods in the ensemble. The performance of ensemble is better than the methods in the ensemble due to diverse features. Our experimental results outperform baseline methods.

  • Exploiting Meta Attributes for Identifying Event Related Hashtags

    International Conference on Knowledge Discovery and Information Retrieval

    Users in social media often participate in discussions regarding different events happening in the physical world (e.g., concerts, conferences, festivals) by posting messages, replying to or forwarding messages related to such events. In various applications like event recommendation, event reporting, etc. it might be useful to find user discussions related to such events from social media. Finding event related hashtags can be useful for this purpose. In this paper, we focus on the problem of…

    Users in social media often participate in discussions regarding different events happening in the physical world (e.g., concerts, conferences, festivals) by posting messages, replying to or forwarding messages related to such events. In various applications like event recommendation, event reporting, etc. it might be useful to find user discussions related to such events from social media. Finding event related hashtags can be useful for this purpose. In this paper, we focus on the problem of finding relevant hashtags for a given event. Features are defined to identify the event related hashtags. We specifically look for features that use similarities of the hashtags with the event metadata attributes. A learning to rank algorithm is applied to learn the importance weights of the features towards the task of predicting the relevance of a hashtag to the given event. We experimented on events from four different categories (namely, Award ceremonies, E-commerce events, Festivals, and Product launches). Experimental results show that our method significantly outperforms the baseline methods.

  • IITH at CLEF 2017: Finding Relevant Tweets for Cultural Events

    CLEF

    Retrieving relevant tweets corresponding to cultural events
    can be used in various applications like event reporting, event recommendation,
    etc. This type of retrieval is challenging due to short length of the
    tweet, noise, out of vocabulary words, abbreviations in the tweet. In this
    paper, we focus on the problem of retrieving relevant tweets related to
    given cultural event of a festival. We consider several factors like BM25,
    DFR, presence of artist name, relevant hashtag…

    Retrieving relevant tweets corresponding to cultural events
    can be used in various applications like event reporting, event recommendation,
    etc. This type of retrieval is challenging due to short length of the
    tweet, noise, out of vocabulary words, abbreviations in the tweet. In this
    paper, we focus on the problem of retrieving relevant tweets related to
    given cultural event of a festival. We consider several factors like BM25,
    DFR, presence of artist name, relevant hashtag, festival name for finding
    the relevance of tweets to the event. We apply BM25 + DFR model to
    retrieve candidate set of tweets related to each event of a festival. We
    find the top hashtags for each event by exploring meta-attributes of an
    event. We re-rank the initial rank list from BM25 + DFR based on two
    strategies, namely, presence of the event meta-attributes (artist name,
    festival name, title, etc.) and the identified top hashtags in the tweet,
    and based on the timestamp of the event. We experimented on a subset
    of CLEF 2017 cultural microblog contextualization dataset. The experimental
    results show that the proposed method is able to put relevant
    tweets at the top of the retrieval list.

    See publication
  • NSEmo at EmoInt-2017: An Ensemble to Predict Emotion Intensity in Tweets

    WASSA @ EMNLP

    In this paper, we describe a method to predict emotion intensity in tweets. Our approach is an ensemble of three regression methods. The first method uses contentbased features (hashtags, emoticons, elongated words, etc.). The second method considers word n-grams and character ngrams
    for training. The final method uses lexicons, word embeddings, word ngrams, character n-grams for training the model. An ensemble of these three methods gives better performance than individual
    methods. We…

    In this paper, we describe a method to predict emotion intensity in tweets. Our approach is an ensemble of three regression methods. The first method uses contentbased features (hashtags, emoticons, elongated words, etc.). The second method considers word n-grams and character ngrams
    for training. The final method uses lexicons, word embeddings, word ngrams, character n-grams for training the model. An ensemble of these three methods gives better performance than individual
    methods. We applied our method on WASSA emotion dataset. Achieved results are as follows: average Pearson correlation is 0.706, average Spearman correlation is 0.696, average Pearson correlation for gold scores in range 0.5 to 1 is 0.539, and average Spearman correlation for gold scores in range 0.5 to 1 is 0.514.

    See publication

Courses

  • Advanced Data Structures and Algorithms

    -

  • Advanced Machine Learning

    -

  • Linear Optimization

    -

  • Predictive Analytics and Knowledge Discovery

    -

  • Soft Computing

    -

  • Topics in Database Management Systems

    -

Projects

  • Detecting Inline Mathematical Expressions in Scientific Documents

    -

    Scientific documents generally contain multiple mathematical expressions in them. Detecting inline mathematical expressions is one of the most important and challenging tasks in scientific text mining. Recent works that detect inline mathematical expressions in scientific documents have looked at the problem from an image processing perspective. There is little work that has targeted the problem from NLP perspective. Towards this, we define a few features and applied Conditional Random Fields…

    Scientific documents generally contain multiple mathematical expressions in them. Detecting inline mathematical expressions is one of the most important and challenging tasks in scientific text mining. Recent works that detect inline mathematical expressions in scientific documents have looked at the problem from an image processing perspective. There is little work that has targeted the problem from NLP perspective. Towards this, we define a few features and applied Conditional Random Fields (CRF) to detect inline mathematical expressions in scientific documents. Apart from this feature based approach, we also propose a hybrid algorithm that combines Bidirectional Long Short Term Memory networks (Bi-LSTM) and feature-based approach for this task. Experimental results suggest
    that this proposed hybrid method outperforms several baselines in the literature and also individual methods in the hybrid approach.

  • Predicting Local News Potential to go Global

    -

    Nowadays, digital information is increasing rapidly in our society. Journalists find it
    difficult to manage and asses the newsworthiness of these information. Assessing
    newsworthiness of an event is an important and challenging task. Events can be
    broadly classified into local events and global events. Predicting local news potential
    to go global is also a challenging task. Most of the methods in literature apply to
    disaster events and use hand crafted features only to predict…

    Nowadays, digital information is increasing rapidly in our society. Journalists find it
    difficult to manage and asses the newsworthiness of these information. Assessing
    newsworthiness of an event is an important and challenging task. Events can be
    broadly classified into local events and global events. Predicting local news potential
    to go global is also a challenging task. Most of the methods in literature apply to
    disaster events and use hand crafted features only to predict the local news potential
    to go global. In this work, we propose an algorithm to collect the dataset in such
    a way that local news articles posted before the global ones from GDELT. The
    events are filtered by using CAMEO codes which fall under crisis semantics e.g.,
    “Provide aid (07)”, “Threaten (13)”, “Protest (14)”, etc. We develop deep learning
    methods to predict the number of global news outlets that report the local news.
    Our method can also be applied to any event type.

  • Ranking Product Reviews using Feature Preference

    -

    With the growing popularity of online shopping many e-commerce sites have a support for both
    writing and reading reviews. With so many reviews talking about varied features the users may
    often find it difficult to get to the review of their interest it is synonymous to finding a needle in a
    haystack. On many e-commerce sites, product reviews are ranked and then displayed based on
    stars and usefulness of review given by the reviewer to that product, but page does not have…

    With the growing popularity of online shopping many e-commerce sites have a support for both
    writing and reading reviews. With so many reviews talking about varied features the users may
    often find it difficult to get to the review of their interest it is synonymous to finding a needle in a
    haystack. On many e-commerce sites, product reviews are ranked and then displayed based on
    stars and usefulness of review given by the reviewer to that product, but page does not have any
    facility to search for review which is talking about a particular feature of product.

    So we propose a method to rank the reviews based on given features given by user. We are
    considering measures such as reviewer experience, reviewer helpfulness and review
    information content obtained for a particular review to rank them. These factors help the reviews
    relevant to the users to bubble up. Not all users have a clear idea of what feature should they search
    for while looking for a product hence the way they browse through the reviews will be different.
    The way an experienced user browses through the reviews would be different as they have an idea
    of what features are of utmost importance to them. The users may want to look for one particular
    feature or a set of them based on their need. Hence in our work we have zeroed down to three cases
    that will capture most type of user tendencies in searching the reviews. These cases being global
    case where no feature is taken, single feature based search where only a single feature is used
    for searching, and finally a multi-feature based search where the user can choose multiple
    features. In our proposed work we have described how these cases differ from one another and
    how different factors have varied effects on the ranking for these cases.

Honors & Awards

  • Awarded ACM Travel Grant to attend IRIS 2018

    ACM India

  • Awarded Microsoft Research Travel Grant to attend COLING 2018 conference

    Microsoft Research

  • Awarded Research Excellence Award from IIT Hyderabad

    IIT Hyderabad

  • Awarded Second Runner Up Prize in 2018 Honeywell Aerospace Automation Challenge Hackathon

    Honewell India

  • Awarded ACM India Travel Grant to attend Cods CoMAD 2017 conference

    ACM India

  • Awarded Travel Grant to attend Indian Workshop on Machine Learning (iWML),2016 at IIT Kanpur

    IIT Kanpur

  • Prof.M.N. Seetharamanath Endowment Prize

    Andhra University

    Awarded Endownment prize in Master of Technology (M.Tech) for my academic excellence from Andhra University, Visakhapatnam

  • State Bank of Hyderabad Staff Association-cum-Prabhatkar Gold Medal

    Andhra University

    Awarded Gold Medal in Master of Technology (M.Tech) for my academic excellence from Andhra University, Visakhapatnam

  • Certified from IBM in Tivoli Directory Server workshop held at Sree Vidyanikethan Engineering College,Tirupati

    IBM India

  • Certified from IBM in DB2 9 Database and Application Fundamentals workshop held at Sree Vidyanikethan Engineering College, Tirupati

    IBM India

  • Got a Second prize in National level quiz competition TECHQUIZ

    Priyadarshi College of Engineering

Recommendations received

More activity by Sreekanth

View Sreekanth’s full profile

  • See who you know in common
  • Get introduced
  • Contact Sreekanth directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses