Skip to content

alexdrk14/RussoUkrainianWar_Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Russo Ukrainian War Collection of Tweet IDs

The repository contains collection of tweets IDs associated with the current war between Russia and Ukraine, which we commenced collecting on Februrary 24, 2022. We leveraged Twitter's search API to extract historical tweets, leading our dataset to contain tweets from February 22, 2022. We utilize Twitter’s streaming API to collect dataset based on selected popular hashtags corelated to particullar topic. The list of selected hashtags is presented in "hashtags.txt" file (hashtags.txt). To comply with Twitter’s Terms of Service, we are only publicly releasing the Tweet IDs of the collected Tweets. The data is released for non-commercial research use.

The associated paper is accepted into 16th International Conference on Advances in Social Networks Analysis and Mining (ASONAM-2024) with title: Exploring Crisis-Driven Social Media Patterns: A Twitter Dataset of Usage During the Russo-Ukrainian War

Data Organization

The Tweet-IDs are organized as follows:

  • Tweet-ID files are stored in folders that indicate the year and month of the collection (YEAR-MONTH).
  • Individual Tweet-ID files contain a collection of Tweet IDs, and the file names all follow the same structure, with a prefix “tweet_ids_day_” followed by the YEAR_MONTH_DATE.
  • Note that Twitter returns Tweets in UTC, and thus all Tweet ID folders and file names are all in UTC as well.

Data Statistics and Analysis

We are manage to perform multiple statistical measurments in daily basis over the described dataset such as:

All described analytics are published in Parasecurity Group webpage.

Anonymized Text Data Sharing

Additionally, we have shared the collected text data sorted by creation date. User IDs, tweet IDs, and user mentions have all been anonymized for privacy. You can access the data via the following link: Zenodo repository.

Statistics Summary (v1.0)

Number of Tweets : 127,275,386

Daily volume of registered users activity

plot

Daily volume of 10 most popular hashtags

plot

Daily positive and negative sentiments towards each country

plot

Daily positive and negative sentiments towards each president

plot

Data Usage Agreement / How to Cite

By using this dataset, you agree to abide by the stipulations in the license, remain in compliance with Twitter’s Terms of Service, and cite the following manuscript:

Authors and Paper title with arxiv_id BibTeX: TBD

Inquiries

Please read through the README and the closed issues to see if your question has already been addressed first.

If you have any questions about this dataset/analysis, please contact :

  • Ioannis Lamprou at ilamprou1[at]tuc.gr
  • Alexander Shevtsov at asevtsov[at]tuc.gr.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages