Mining Social Media

Finding Stories in Internet Data


Mining Social Media
Mining Social Media

Book Details

Author Lam Thuy Vo
Publisher No Starch Press
Published 2019
Edition 1
Paperback 208 pages
Language English
ISBN-13 9781593279165
ISBN-10 1593279167
License Open Access

Book Description

Did fake Twitter accounts help sway a presidential election? What can Facebook and Reddit archives tell us about human behavior? In Mining Social Media, senior BuzzFeed reporter Lam Thuy Vo shows you how to use Python and key data analysis tools to find the stories buried in social media.

Whether you're a professional journalist, an academic researcher, or a citizen investigator, you'll learn how to use technical tools to collect and analyze data from social media sources to build compelling, data-driven stories.

Learn how to:
- Write Python scripts and use APIs to gather data from the social web;
- Download data archives and dig through them for insights;
- Inspect HTML downloaded from websites for useful content;
- Format, aggregate, sort, and filter your collected data using Google Sheets;
- Create data visualizations to illustrate your discoveries;
- Perform advanced data analysis using Python, Jupyter Notebooks, and the pandas library;
- Apply what you've learned to research topics on your own;

Social media is filled with thousands of hidden stories just waiting to be told. Learn to use the data-sleuthing tools that professionals use to write your own data-driven stories.


This book is published as open-access, which means it is freely available to read, download, and share without restrictions.

If you enjoyed the book and would like to support the author, you can purchase a printed copy (hardcover or paperback) from official retailers.

Download and Read Links

PDF

Share This Book

[localhost]# find . -name "*Similar_Books*"


Data Science at the Command Line, 2nd Edition

Unix

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed wit

R for Data Science

R Analysis

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data

Managing Cloud Native Data on Kubernetes

Kubernetes Cloud

Is Kubernetes ready for stateful workloads? This open source system has become the primary platform for deploying and managing cloud native applications. But because it was originally designed for stateless workloads, working with data on Kubernetes has been challenging. If you want to avoid the inefficiencies and duplicative costs of having separa

Python Data Science Handbook

Python Pandas

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all - IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other relate

Critical Data Literacy

Analysis

A short course for students to increase their proficiency in analyzing and interpreting data visualizations. By completing this short course students will be able to explain the importance of data literacy, identify data visualization issues in order to improve their own skills in data story-telling. The intended outcome of this course is to help s

Text Mining with R

R NLP Twitter

Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages li