Skip to content

Commit 3c0f4ab

Browse files
authored
Merge pull request #172 from NYU-RTS/review_datasets
updated a url and removed Peel
2 parents ea73733 + 277111a commit 3c0f4ab

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/hpc/04_datasets/01_intro.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -92,15 +92,15 @@ Please open the ImageNet site, find the terms of use ([http://image-net.org/down
9292

9393
NYU has a subscription to Twitter Decahose - 10% random sample of the realtime Twitter Firehose through a streaming connection
9494

95-
*Data are stored* in GCP cloud (BigQuery) and on HPC clusters Greene and Peel (Parquet format).
95+
*Datasets are stored* in GCP cloud (BigQuery) and on the HPC cluster Greene.
9696

9797
Please contact Megan Brown at [The Center for Social Media & Politics](https://csmapnyu.org/) to get access to data and learn the tools available to work with it.
9898

9999
*On cluster dataset is available under (given that you have permissions)*
100100
- `/scratch/work/twitter_decahose/`
101101

102102
### ProQuest Congressional Record
103-
About data set: [ProQuest Congressional Record](https://guides.nyu.edu/tdm/proquest-congressional-record-tdm-guide)
103+
About data set: [ProQuest Congressional Record](https://guides.nyu.edu/govdocs/congressional#s-lg-box-14137380)
104104

105105
The ProQuest Congressional Record text-as-data collection consists of machine-readable files capturing the full text and a small number of metadata fields for a full run of the Congressional Record between 1789 and 2005. Metadata fields include the date of publication, subjects (for issues for which such information exists in the ProQuest system), and URLs linking the full text to the canonical online record for that issue on the ProQuest Congressional platform. A total of 31,952 issues are available.
106106

0 commit comments

Comments
 (0)