Tuesday, November 19, 2019

Who Are the Top HR Analytics Influencers on Twitter

Everyday people use social media such as Twitter to share thoughts and ideas. People with similar interests come together and interact on the online platform by re-sharing or replying posts they like. By studying how people interact on social networks, it will help us understand how information is distributed and identify who are the most prominent figures.
In our last post, we did a topic modeling study using Twitter feeds #HRTechConf and trained a model to learn the topics of all the tweets. In this article, we will analyze Twitter user interactions and visualize it in an interactive graph. Here is the link to the interactive graph web page.
Social Network is a network of social interactions and personal relationships.
Oxford Dictionary
I use Python 3.6 and the following packages:
  • Tweepy: a Python library for accessing Twitter API .
  • NetworkX: a Python package for studying structure of complex networks .
  • PyVis: a Python library for creating and visualizing interactive network graphs.
If you are interested in Organizational Network Analysis, check this article we wrote.

Data Gathering

We consider there is a Twitter user interaction when a tweet is retweeted or replied, or a Twitter user gets mentioned in another user’s tweet. The idea is that someone has more influence on the topic of HRanalytics if their tweets or name appear more frequently in the network.
In the following retweet example, we consider there is an interaction between Twitter user Sonianasher and HRCurator.
RT @HRCurator: How Bosch Uses Gamification to Build #HRAnalytics Skills (Case Study) https://t.co/8IjyO1HdUe @DigitalHRTech @AnalyticsinHR…
Sonianasher

Graph Modeling

A social network e.g. Twitter graph consists of nodes and edges. Nodes are Twitter account, and edges are interactions between Twitter users.
Also, it is very likely that a Twitter user A has some influence on user B if B retweets, replies or mentions A’s tweet. Therefore, Twitter network is a directed graph. In the example above, user Sonianasher is not only connected with HRCurator but also is “influenced” by HRCurator ‘s tweet.
import networkx as nx
graph = nx.DiGraph()
To construct our Twitter network, we use NetworkX, a Python package for studying structure of complex networks.
degrees = [(node, val) for (node, val) in graph.degree()]
degrees_df = pd.DataFrame(degrees, columns=['node', 'degree']).sort_values(
        by='degree',ascending=False)
degrees_df.head(10)
All interactions in the 333 retrieved tweets are added to the directed graph. Let’s see how well our Twitter network is connected.
degrees_df.describe()
Twitter account HRCurator has most interactions (40) and martinhoyes has 15.
nx.number_connected_components(graph.to_undirected())22
Wow, we have 22 disconnected sub-graphs and that is not good. We are mostly interested in a large fully connected graph.
nodes = max(nx.connected_component_subgraphs(graph.to_undirected()), key=len)
largest_subgraph = graph.subgraph(nodes)
print(nx.info(largest_subgraph))
Name:
Type: DiGraph
Number of nodes: 84
Number of edges: 100
Average in degree: 1.1905
Average out degree: 1.1905
The largest sub-graph has 84 nodes and 100 edges. Let’s make sure all nodes are connected in this graph.
nx.number_connected_components(largest_subgraph.to_undirected())1
Great! It has only one connected graph.

Network Visualization

node_degree = largest_subgraph.degree()
pos = nx.spring_layout(largest_subgraph, k=0.4)
plt.figure(figsize=(18,16))
nx.draw(largest_subgraph, pos=pos, 
        linewidths=0.2, node_color=range(len(node_degree)), 
        node_size=60, alpha=0.6, with_labels=True)
nx.draw_networkx_nodes(largest_subgraph, pos=pos, node_size=10, alpha=0.3)
plt.show()
Now, we can plot our Twitter interaction network.
Twitter Network
Same network in a circular layout.
Twitter Network — circular layout
It appears that many arrows are pointing to node HRCurator and martinhoyes, which confirms what is discovered earlier in our analysis above that user HRCurator and martinhoyes have the most influence in this network.
from pyvis.network import Network
net = Network(height="850px", width="100%", bgcolor="#222222", 
              font_color="white", directed=True)
net.from_nx(largest_subgraph)
net.show("interactive_graph.html")
It is cool but hard to visualize connections of each node. No worries. We use PyVis library to build an interactive web graph that allows user dragging, hovering, and selecting nodes and edges. Here is the link to the interactive graph web page.
Interactive Twitter Network

Closing Notes

If you have any questions or feedback, feel free to leave a comment.
Happy Machine Learning!

Originally published at https://ai-journey.com on November 17, 2019.

Towards Data Science

Sharing concepts, ideas, and codes.

You're following Towards Data Science.

You’ll see more from Towards Data Science across Medium and in your inbox.

No comments:

Must Watch YouTube Videos for Databricks Platform Administrators

  While written word is clearly the medium of choice for this platform, sometimes a picture or a video can be worth 1,000 words. Below are  ...