Apache Spark Streaming Tutorial: Identifying Trending Twitter Hashtags
Social networks are among the biggest sources of data today, and this means they are an extremely valuable asset for marketers, big data specialists, and even individual users like journalists and other professionals. Harnessing the potential of real-time Twitter data is also useful in many time-sensitive business processes. In this article, Toptal Freelance Software Engineer Hanee' Medhat explains how you can build a simple Python application to leverage the power of Apache Spark, and then use it to read and process tweets to identify trending hashtags. AUTHOR Hanee' Medhat Shousha A certified Spark dev with a CEng degree and business intelligence diploma, Hanee' has built enterprise apps with millions of daily users. 43 SHARES Read the Spanish version of this article translated by Marisela Ordaz Nowadays, data is growing and accumulating faster than ever before. Currently, around 90% of all data generated in our world was generated only in the last two years. ...