1,721,057 research outputs found

    Knowledge Transfer Seminar - EARLY-CAREER RESEARCHERS IN MEDICAL APPLICATIONS @ CERN – SHORT TALKS

    No full text
    Knowledge Transfer seminar entitled EARLY-CAREER RESEARCHERS IN MEDICAL APPLICATIONS @ CERN – SHORT TALKS held on 16th October in the Main Auditorium. The seminar featured Adithya Raghupathi Gowtham, Gianluca Stringhini, Marie Nowak, Maryam Mohamedali, Shaukatali Zahir Hussein & Wioletta Sandra Kozlowska

    Dataset for "On the Origins of Memes by Means of Fringe Web Communities"

    No full text
    This dataset was collected with research funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Grant Agreement No 691025. The publication on which this dataset was used is: "On the Origins of Memes by Means of Fringe Web Communities". Savvas Zannettou, Tristan Caulfield, Jeremy Blackburn, Emiliano De Cristofaro, Michael Sirivianos, Gianluca Stringhini, and Guillermo Suarez-Tangil. ACM Internet Measurement Conference (IMC), 2018., DOI: https://doi.org/10.5281/zenodo.1323551 The code related to this research can be found here: https://github.com/memespaper/memes_pipeline, or here: 10.5281/zenodo.1463050 The restriction of accessing this dataset has been revoked. The dataset is available here: DOI: https://doi.org/10.5281/zenodo.1184171 Presentation available here: https://doi.org/10.5281/zenodo.147708

    Dataset for "Who Let The Trolls Out? Towards Understanding State-Sponsored Trolls"

    No full text
    This is the dataset used for the study "Who Let The Trolls Out? Towards Understanding State-Sponsored Trolls". Savvas Zannettou, Tristan Caulfield, William Setzer, Michael Sirivianos, Gianluca Stringhini, Jeremy Blackburn. Arxiv, 2019. DOI: 10.5281/zenodo.2558560 The dataset consists of the data released by Twitter on October 2018 for Russian and Iranian state-sponsored troll accounts, which is available at https://about.twitter.com/en_us/values/elections-integrity.html#data as well as intermediate data that we generated after processing the raw data. For instance, we include trained Word2Vec and LDA models, the output of our influence estimation experiments via Hawkes Processes, and a lot of other data necessary to reproduce the results in the paper. To use the provided data simply download the compressed file from and make sure that the uncompressed data folder is in the same directory as the IPython Notebook. The code used for this study can be found here: https://github.com/zsavvas/trolls_analysis Please cite our paper if any publication, of any form and kind results of you using this data: @article{zannettou2018let, title={Who let the trolls out? towards understanding state-sponsored trolls}, author={Zannettou, Savvas and Caulfield, Tristan and Setzer, William and Sirivianos, Michael and Stringhini, Gianluca and Blackburn, Jeremy}, journal={arXiv preprint arXiv:1811.03130}, year={2018} }</pre

    Dataset: Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board

    Full text link
    This is the dataset released with the paper titled: "Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board". The dataset is a single Newline delimited JSON file. Each line in the file consists of a JSON object which is a full 4chan /pol/ thread. The JSON objects contain all the key/values returned by the 4chan API, along with three additional keys (entities, perspectives, and extracted_poster_id). For each JSON object we complement the data with the list of the named entities we detect for each post, using the spaCy Python library. In addition, for each post we add scores returned by the Google’s Perspective API, and more specifically seven scores in the [0; 1] interval. For the detailed description of every key in the JSON structure, along with the type of the value, please read the readme.pdf file provided with this dataset. If you find our dataset useful, please cite our paper: @article{papasavva2020raiders, title={Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board}, author={Antonis Papasavva, Savvas Zannettou, Emiliano De Cristofaro, Gianluca Stringhini, Jeremy Blackburn}, journal={arXiv preprint arXiv:2001.07487}, year={2020} }    </p

    Dataset for "What is Gab? A Bastion of Free Speech or an Alt-Right Echo Chamber?"

    No full text
    &lt;p&gt;This dataset was used for this project: &quot;What is Gab? A Bastion of Free Speech or an Alt-Right Echo Chamber?&quot;. Savvas Zannettou, Barry Bradlyn, Emiliano De Cristofaro, Michael Sirivianos, Gianluca Stringhini, Haewoon Kwak, Jeremy Blackburn. Workshop on Computational Methods in CyberSafety, Online Harassment and Misinformation, 2018.&nbsp;DOI:&nbsp;&lt;a href="https://arxiv.org/ct?url=http%3A%2F%2Fdx.doi.org%2F10%252E1145%2F3184558%252E3191531&amp;v=345a781d"&gt;10.1145/3184558.3191531&lt;/a&gt;.&lt;/p&gt; &lt;p&gt;In addition, this project has received funding from the European Union&rsquo;s Horizon 2020 Research and Innovation program under the Marie Skłodowska-Curie ENCASE project (Grant Agreement No. 691025). The work reflects only the authors&rsquo; views; the Agency and the Commission are not responsible for any use that may be made of the information it contains.&lt;/p&gt; &lt;p&gt;Using Gab&rsquo;s API, we crawl the social network using a snowball methodology. Specifically, we obtain data for the most popular users as returned by Gab&rsquo;s API and iteratively collect data from all their followers as well as their followings. Subsequently, for all users in our dataset we collect all of their the posts. Overall, we collect 22,112,812 posts from 336,752 users, between August 2016 and January 2018.&nbsp;This dataset is a .json file and each line has one .json&nbsp;object.&lt;/p&gt

    Going Beyond Counting First Authors in Author Co-citation Analysis

    Full text link
    The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

    Dataset for "What is Gab? A Bastion of Free Speech or an Alt-Right Echo Chamber?"

    No full text
    This dataset was used for this project: "What is Gab? A Bastion of Free Speech or an Alt-Right Echo Chamber?". Savvas Zannettou, Barry Bradlyn, Emiliano De Cristofaro, Michael Sirivianos, Gianluca Stringhini, Haewoon Kwak, Jeremy Blackburn. Workshop on Computational Methods in CyberSafety, Online Harassment and Misinformation, 2018. DOI: 10.1145/3184558.3191531. In addition, this project has received funding from the European Union’s Horizon 2020 Research and Innovation program under the Marie Skłodowska-Curie ENCASE project (Grant Agreement No. 691025). The work reflects only the authors’ views; the Agency and the Commission are not responsible for any use that may be made of the information it contains. Using Gab’s API, we crawl the social network using a snowball methodology. Specifically, we obtain data for the most popular users as returned by Gab’s API and iteratively collect data from all their followers as well as their followings. Subsequently, for all users in our dataset we collect all of their the posts. Overall, we collect 22,112,812 posts from 336,752 users, between August 2016 and January 2018. This dataset is a .json file and each line has one .json object.</p

    Variations on the Author

    Full text link
    “Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
    corecore