1,721,080 research outputs found

    #MarchForScience tweets April 12-26, 2017

    No full text
    1,276,220 tweet ids for #MarchForScience collected with Documenting the Now's twarc from January 22-26, 2017. Tweets can be “rehydrated” with Documenting the Now’s twarc (https://github.com/DocNow/twarc). twarc.py hydrate MarchForScience_tweet-ids.txt > MarchForScience.json

    Wet'suwet'en tweet ids

    No full text
    425,227 tweet ids for Wet'suwet'en tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate wetsuweten-20210115-ids.txt > wetsuweten.jsonl Tweets were collected via the Standard Search API on a cron job every five days beginning on February 18, 2020. Collection is ongoing. The account that was used to collect these Tweets failed to collect Tweets for the period from Sun Jul 26 02:00:21 +0000 2020 through Fri Aug 07 20:05:54 +0000 2020.</p

    Tweet ids for final Tragically Hip concert

    No full text
    228,086 tweet ids for "TheHip, hipinkingston" captured during the Tragically Hip's final concert in Kingston, Ontario in August 2016. Tweets can be "rehydrated" with Documenting the Now's twarc (https://github.com/DocNow/twarc). twarc.py --hydrate th_final_concert_kingston_tweet_ids.txt > th_final_concert_kingston.jso

    The fall of Aleppo tweets; Aleppo 2016-12-13 through 2016-12-29

    No full text
    8,595,589 tweet ids for aleppo tweets captured during the fall of Aleppo in December 2016. Tweets can be "rehydrated" with Documenting the Now's twarc (https://github.com/DocNow/twarc). twarc.py --hydrate aleppo_tweet_ids.txt > aleppo.jso

    #JeffSessions tweets

    No full text
    2,278,757 tweet ids for #JeffSessions collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate to_realdonaldtrump_ids.txt > to_donaltrump.jsonl. </p

    #paradisepapers tweets

    No full text
    1,797,260 tweet ids for #paradisepapers collected with Documenting the Now's twarc from November 5-26, 2017. Tweets can be “rehydrated” with Documenting the Now’s twarc (https://github.com/DocNow/twarc). twarc.py hydrate paradisepapers_ids.txt > paradisepapers.json. Or with Documenting the Now's Hydrator: https://github.com/DocNow/hydrato

    #climatemarch tweets April 19-May 3, 2017

    No full text
    681,668 tweet ids for #climate collected with Documenting the Now's twarc from January 22-26, 2017. Tweets can be “rehydrated” with Documenting the Now’s twarc (https://github.com/DocNow/twarc). twarc.py hydrate climatemarch_tweet_ids.txt > climatemarch.json

    #healthcanada #NACI #fordnation #medicalfreedom #covid19 #covid19vaccines #protectourfamilies #protectyourchildren #holdtheline tweets

    No full text
    2,661,117 tweet ids for #healthcanada #NACI #fordnation #medicalfreedom #covid19 #covid19vaccines #protectourfamilies #protectyourchildren #holdtheline tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate tweet-ids.txt > tweets.jsonl ID files are available for all hashtags or some individual hashtags: covid19-ids.txt covid19vaccines-ids.txt fordnation-ids.txt healthcanada-ids.txt healthcanada-NACI-fordnation-medicalfreedom-covid19-covid19vaccines-protectourfamilies-protectyourchildren-holdtheline-ids.txt holdtheline-ids.txt medicalfreedom-ids.txt NACI-ids.txt protectyourchildren-ids.txt Tweets were collected via the Standard Search API on: November 18, 2021 November 21, 2021 November 26, 2021 December 1, 2021 </p

    #elxn44 tweets (44th Canadian Federal Election)

    No full text
    2,075,645 tweet ids for #elxn44 tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate elxn44-tweet-ids.txt > elxn44.jsonl. Tweets were collected via the Standard Search API on a cron job every five days from July 28, 2021 - November 05, 2021.</p

    Tweets to Donald Trump (@realDonaldTrump)

    No full text
    362,464,578 tweet ids for tweets directed at Donald Trump (@realDonaldTrump), collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate to_realdonaldtrump_20210120_ids.txt > to_realdonaldtrump_20210120.jsonl. Collection notes: Tweets from May 7, 2017 - October 16, 2018 of the dataset used a combination of the Filter (Streaming) API and Search API. The Filter API failed on June 21, 2017. From June 23, 2017 forward only the Search API was used to collect. Collection was done every 5 days on a cron job, and periodically deduplicated. There is a data gap from Tue Jul 28 13:53:50 +0000 2020 through Thu Aug 06 09:36:23 +0000 2020 due to a collection error. This dataset also includes a number of derivative csv files from the original jsonl collected. This includes: A user csv file created with jq (see below). twut userInfo twut language twut times twut sources twut hashtags twut urls twut animatedGifUrls twut imageUrls twut mediaUrls twut videoUrls User csv: jq -r '[.id_str, .created_at, .user.screen_name, .retweeted_status != null] | @csv' to_realdonaldtrump_20190130.jsonl > to_realdonaldtrump_20190130_users.jsonl </p
    corecore