Skip to content

AU-DATALAB/NORDIS-nordic-Twitter-data-specs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

NORDIS - Nordic Twitter data specifics

Description of the specifics of the Nordic Twitter datasets collected for various projects in NORDIS. The data used is gathered by DATALAB and in collaboration with the HOPE project. HOPE project has been collecting data for Danish, Norwegian, and Swedish since early 2020. DATALAB is collecting the Finnish tweets for the same time range with a similar approach to match the datasets as closely as possible.

NORDIS website

The scraping process

DATALAB utilizes the codes made by CHCAA for scraping Finnish Twitter as that is used for scraping the HOPE Scandinavian tweets. For more info, contact Peter Bjerregaard Vahlstrup.

We have the researcher academic access to Twitter and use their API for the scraping. This includes using an API access token and key. The tweets are recorded in separate files based on the months.

Structure of this repository

.
├── res                                 # Resources for the Finnish scrape
│   ├── finnish_query_tracking.md       # Overview on how the Finnish scrape is developing
│   ├── test_data.py                    # Quick script that shows how many tweets have been scraped per month
│   ├── M8_FI.txt                       # List of generated finnish keywords
├── HOPE-and-scrape-specs.md            # Specifications for the HOPE and Finnish scrapes
└── README.md                           # Main information for this repository

This repository does not include the API key and token, nor the actual call made to receive the tweets.

About

Specifications related to Nordic Twitter scrape in relation to the NORDIS project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages