Text Mining with R: A Tidy Approach

"O'Reilly Media, Inc."
Free sample

Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you’ll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You’ll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.

The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You’ll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.

  • Learn how to apply the tidy text format to NLP
  • Use sentiment analysis to mine the emotional content of text
  • Identify a document’s most important terms with frequency measurements
  • Explore relationships and connections between words with the ggraph and widyr packages
  • Convert back and forth between R’s tidy and non-tidy text formats
  • Use topic modeling to classify document collections into natural groups
  • Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages
Read more

About the author

Julia Silge is a data scientist at Stack Overflow; her work involves analyzing complex datasets and communicating about technical topics with diverse audiences. She has a PhD in astrophysics and loves Jane Austen and making beautiful charts. Julia worked in academia and ed tech before moving into data science and discovering the statistical programming language R.

David Robinson is a data scientist at Stack Overflow with a PhD in Quantitative and Computational Biology from Princeton University. He enjoys developing open source R packages, including broom, gganimate, fuzzyjoin and widyr, as well as blogging about statistics, R, and text mining on his blog, Variance Explained.

Read more
Loading...

Additional Information

Publisher
"O'Reilly Media, Inc."
Read more
Published on
Jun 12, 2017
Read more
Pages
194
Read more
ISBN
9781491981603
Read more
Language
English
Read more
Genres
Computers / Data Visualization
Computers / Databases / Data Mining
Computers / Databases / General
Computers / Intelligence (AI) & Semantics
Computers / Natural Language Processing
Read more
Content Protection
This content is DRM free.
Read more
Read Aloud
Available on Android devices
Read more
Eligible for Family Library

Reading information

Smartphones and Tablets

Install the Google Play Books app for Android and iPad/iPhone. It syncs automatically with your account and allows you to read online or offline wherever you are.

Laptops and Computers

You can read books purchased on Google Play using your computer's web browser.

eReaders and other devices

To read on e-ink devices like the Sony eReader or Barnes & Noble Nook, you'll need to download a file and transfer it to your device. Please follow the detailed Help center instructions to transfer the files to supported eReaders.
Sources of the African Past combines a case-study approach with an emphasis on primary and orally transmitted sources to accomplish three objectives; to tell a story in some depth, to portray major themes and to raise basic questions of analysis and interpretation. The case studies are set in the nineteenth century and deal with critical periods in the fortunes of five societies in different parts of the continent (South, East, and West Africa). The authors wish students to work with the "raw" materials of history and to that end have provided a workbook for a "laboratory" experience.

Sources of the African Past is designed for use in a wide variety of courses and in conjuction with other texts. The authors have kept their own interpretations to a minimum and invited scrutiny of their decision of selection and arrangement. They chose the cases on the basis of several criteria: geographical coverage, abundance and diversity of primary sources, importance in the secondary literature, and relevance to important historical problems. All the studies emphasize political change. All witness some growth in European intervention.

In selecting the documents, the authors sought a balance of perspective without sacrificing accuracy and relevance. This means a conscious effort to present a variety of views: African and European, internal and external, partipant and observer, those of the victims as well as those of the victors, those of the "people" as well as those of the elite. Within the limitations of space, they have made the excerpts sufficiently long to allow the reader to examine the author's style, purpose and other characteristics. Keeping in mind the limitations of libraries, they have attemted to make each chapter self-contained.

Sources of the African Past combines a case-study approach with an emphasis on primary and orally transmitted sources to accomplish three objectives; to tell a story in some depth, to portray major themes and to raise basic questions of analysis and interpretation. The case studies are set in the nineteenth century and deal with critical periods in the fortunes of five societies in different parts of the continent (South, East, and West Africa). The authors wish students to work with the "raw" materials of history and to that end have provided a workbook for a "laboratory" experience.

Sources of the African Past is designed for use in a wide variety of courses and in conjuction with other texts. The authors have kept their own interpretations to a minimum and invited scrutiny of their decision of selection and arrangement. They chose the cases on the basis of several criteria: geographical coverage, abundance and diversity of primary sources, importance in the secondary literature, and relevance to important historical problems. All the studies emphasize political change. All witness some growth in European intervention.

In selecting the documents, the authors sought a balance of perspective without sacrificing accuracy and relevance. This means a conscious effort to present a variety of views: African and European, internal and external, partipant and observer, those of the victims as well as those of the victors, those of the "people" as well as those of the elite. Within the limitations of space, they have made the excerpts sufficiently long to allow the reader to examine the author's style, purpose and other characteristics. Keeping in mind the limitations of libraries, they have attemted to make each chapter self-contained.

©2018 GoogleSite Terms of ServicePrivacyDevelopersArtistsAbout Google|Location: United StatesLanguage: English (United States)
By purchasing this item, you are transacting with Google Payments and agreeing to the Google Payments Terms of Service and Privacy Notice.