Skip to contents

This function takes a dataframe containing text data and returns a dataframe with the `top_n` n-grams and their corresponding exemplars

Usage

ngram_exemplars(
  df,
  top_n = 50,
  n = 2,
  text_var = message,
  url_var = permalink,
  sentiment_var = sentiment,
  date_var = date,
  author_var = sender_screen_name,
  platform_var = social_network
)

Arguments

df

Data frame or tibble object

top_n

Top n n-grams to output exemplars for

n

value of n in n-gram e.g. 2 = bigram

text_var

Name of your text variable

url_var

Name of your url variable

sentiment_var

Name of your sentiment variable

date_var

Name of your date variable

author_var

Name of the author variable e.g. 'sender_screen_name'

platform_var

Name of the platform variable e.g. 'social_network'

Value

A data frame with exemplar posts

Examples

# Example data
test_data <- tibble::tibble(
  message = c("This is a test message", "Another test message", "Yet another message"),
  permalink = c("http://test1.com", "http://test2.com", "http://test3.com"),
  sentiment = c("positive", "negative", "neutral"),
  date = as.Date(c("2022-01-01", "2022-01-02", "2022-01-03"))
)
# Using ngram_exemplars function
result <- ngram_exemplars(test_data)