Skip to content

Releases: Watts-Lab/team_comm_tools

All Caps Fix and Timestamp Robustness - 0.1.7

29 May 22:50

Choose a tag to compare

Release Notes

BUG FIX: Errors in num_all_caps

The function for counting all caps previous had an error in which the logic counted single-letter words. These have now been removed, and the resulting output should more accurately reflect words that are deliberately placed in "ALL CAPS," rather than portions of emojis (e.g., :D) or single-letter capitals (e.g., I, A).

Updated handling to timestamp formats

We were previously fairly lenient in how we handled time stamps; we allowed the time stamp column to contain null or unparseable values, which sometimes led to the TCT throwing uncaught errors. We now validate the formatting of timestamps upfront upon instantiation of the FeatureBuilder, and we do not allow users to proceed without correctly-formatted timestamps. The change in logic also cleans up some code redundancy (in which we were previously validating timestamps separately whenever we were using the timestamp column, rather than once at the beginning).

The following is the revised expected behavior in this version:

  1. If the user does not pass in a timestamp column, timestamp-related features ['Time Difference', 'Team Burstiness'] are removed from the output but execution proceeds normally.
  2. If the timestamp column is provided, we verify that there are no null values and that all values are parsable by pd.to_datetime(). Otherwise, halt execution and raise an error for the user to correct issues with the time column.

v0.1.6 - Hot Fix

11 Mar 04:49
245e0c8

Choose a tag to compare

v0.1.6 - 2025-03-10

What's Changed

Full Changelog: v0.1.5...v0.1.6

v0.1.5 - Patch Release

04 Mar 19:01
fcdb6fc

Choose a tag to compare

v0.1.5 - 2025-03-04

What's Changed

Full Changelog: v0.1.4.post2...v0.1.5

Patch Release - v0.1.4.post2

08 Oct 05:27
cd3ad2f

Choose a tag to compare

Defaults in FeatureBuilder were incorrectly specified, leading to undesirable behavior when relying on the default values of the vector_directory. The format is now correct.

Patch Release - v0.1.4.post1

08 Oct 04:17
627955a

Choose a tag to compare

Post-Release: Update README with new function call example for output_file_base.

Patch Release - v0.1.4

08 Oct 04:02
9b9ce16

Choose a tag to compare

  1. Progress Bars: Loading bar during feature generation helps users better understand the status of the features/estimated completion time.
  2. Vector Updates: Vectors are now batched for faster generation.
  3. Vectors are now generated by default WITH punctuation, allowing for more accurate gauging of sentiment.
  4. Denormalizing LIWC: LIWC is no longer normalized as a rate (i.e., per 100 words), and is instead returned as raw count per utterance. This ensures that the aggregated values of LIWC are more sensible/interpretable. (#306)
  5. Labeling Feature Columns with the Source: Politeness and Receptiveness features are labeled with the source (e.g., “politeness_convokit” and “receptiveness_yeomans”) (#300).
  6. Easy Access to List of Generated Features: There is now an easier way to access the names of features and columns generated (by using my_feature_builder.feature_names) as well as to access the names of the columns generated (my_feature_builder.chat_features, my_feature_builder.conv_features_base, my_feature_builder.conv_features_all) (#304)
  7. More Defaults for Input Params: Input parameters have defaults, so it’s easier than ever to use the FeatureBuilder (all you need is the input dataframe). (#286)
  8. One File Path to Rule Them All: You can generate files at all three levels automatically using the “output_file_base” parameter, rather than separately specifying the output paths for all three files. This also creates a more streamlined workaround than the current way of specifying paths, which is a bit counterintuitive. (However, we maintain full backwards-compatibility; this patch release does not change the behavior in which outputs are saved in the output/chat/... path. (#299).
  9. Website Auto-Updating: The project website will now auto-update alongside changes in dev, and documentation has been updated alongside these changes.
  10. Other Bug Fixes: Bug fixes for NLTK punkt: #302

v0.1.3 - Patch Release - Dependency and Documentation Updates

16 Sep 22:31

Choose a tag to compare

v0.1.3 - 2024-09-16

Added

  • Documentation: Updated documentation and our requirement files

Fixed

  • Dependency: Our current required version of torch (2.4.0) has a known issue that causes an error for Windows users; we have updated the requirement to 2.4.1 to resolve this issue.

What's Changed

Full Changelog: v0.1.2...v0.1.3

Patch Release - v0.1.2

17 Aug 02:18

Choose a tag to compare

Security patch ensuring privacy of LIWC lexicons.

v0.1.1 - Patch Release: Dependency and Documentation Updates

09 Aug 23:27

Choose a tag to compare

v0.1.1 - 2024-08-09

Added

  • Documentation: Updated the user guide to properly install the package from pip and dependencies.
  • Performance: Updated a CLI command download_resources that downloads spacy's en_core_web_sm model and NLTK data

Fixed

  • Dependency: Resolved the issue that en-core-web-sm and NLTK resources can't be downloaded upon installation. Now they are automatically added when the user runs Feature Builder for the first time if they're missing in the environment.

What's Changed

Full Changelog: v0.1.0...v0.1.1

v0.1.0 - Initial release

08 Aug 00:16

Choose a tag to compare

Pre-release

v0.1.0 - 2024-08-07

Added

  • Initial release