text_summary module¶

Created on Mon Aug 5 19:56:19 2019

@author: Daniel

class text_summary.TextSummary(data=None)¶

Bases: object

A representation of the basic statistics of a set of texts.

data¶

input dataframe of texts

count¶

Number of _, has keys texts, words, char, space letter, digit, emotes, punct.

prop¶

contains overall statistics that are fractions (laziness, % of emoji, words per text, verbosity)

occurrence_dicts¶

contains dictionaries {token: count} (words or emotes)

per_text_lists¶

contains statistics per text (sentiment: polarity, subjectivity, words per text, characters per text, emotes per text)

compare_freq(other, token)¶

Find differences in word or emoji use frequency.

Parameters

Returns

dictionary where keys correspond to words and values are tuples (total, expected ratio)

Return type

diff_dict (dict)

get_conversations(names)¶

Get a list of conversations.

get_counts(word)¶

Find number of occurrences of word in each text.

set_counts(raw_text, emote_free_text)¶

Set the count statistics.

Parameters

set_occurrence_dicts(raw_text, emote_free_text)¶

Fill a dictionary with the occurrences of each word and each emote.

Parameters

set_per_text_lists()¶: Set the per text list dictionary with words per text, characters per text, and emotes per text.