15 research outputs found

    A Novel Text Analysis Method: Numerals Reveal the Author

    No full text
    Two approaches to the statistical analysis of texts are suggested, both based on the study of numerals occurring in literary texts. The first approach is related to the study of the frequency distribution of various leading digits of numerals occurring in the text. This approach is convenient for testing whether a group of texts has common authorship: the latter is dubious if the frequency distributions are sufficiently different. The second approach requires the study of the frequencies of numerals themselves. The approach yields information about the author, stylistic and genre peculiarities of the texts and is suited for advanced study of authorial texts. The hypothesis that I. Ilf and E. Petrov are fake authors of novels "The Twelve Chairs" and "The Little Golden Calf", and they were ghosted by M. Bulgakov, is checked. The frequency distribution of numerals, as well as its cluster analysis, do not confirm this hypothesis

    Numerals in authorial Turkish-language texts and the stylometric analysis

    Full text link
    Two approaches to the statistical analysis of texts are suggested, both based on the study of numerals occurrence in coherent texts. The first approach is related to the study of the frequency distribution of various leading digits of numerals occurring in the text. These frequencies are unequal: the digit 1 is strongly dominating; usually, the incidence of subsequent digits is monotonically decreasing. The frequencies of occurrence of the digit 1, as well as, to a lesser extent, the digits 2 and 3, are usually a characteristic author’s style feature, manifested in all (sufficiently long) texts of any author. This approach is convenient for testing whether a group of texts has common authorship: the latter is dubious if the frequency distributions are sufficiently different. The second approach is the extension of the first one and requires the study of the frequency distribution of numerals themselves (not their leading digits). The approach yields non-trivial information about the author, stylistic and genre peculiarities of the texts and is suited for the advanced discourse analysis. This paper deals with the application of the second approach to the literary texts in Turkish. We have analysed almost the whole corpus of works by are illustrated by examples of computer analysis of the literary texts by O. Pamuk and Y. Kemal – two of Turkey’s most prominent novelists. The hierarchical cluster analysis based on the occurrence of numerals in the texts by Pamuk and Kemal shows the author, genre, and chronology differences of numerals usage in the literary texts of these authors

    Data Analysis on the Basis of Numerals Statistics

    Full text link
    Two approaches to content analysis of text data are suggested, both based on the statistical study of numerals occurrence in texts. The first approach is related to counting the frequency distribution of various leading digits of numerals occurring in the text. These frequencies are unequal: the digit 1 is strongly dominating; usually, the incidence of subsequent digits is monotonically decreasing. The frequencies of occurrence of the digit 1, as well as, to a lesser extent, the digits 2 and 3, are usually a characteristic author's style feature, manifested in all (sufficiently long) literary texts of any author. This approach is convenient for testing whether a group of texts has common authorship: the latter is dubious if the frequency distributions are sufficiently different. The second approach is the extension of the first one and requires the study of the frequency distribution of numerals themselves (not their leading digits). The approach yields non-trivial information about the author, stylistic and genre peculiarities of the texts and is suited for the advanced stylometric analysis. The proposed approaches are illustrated by examples of computer analysis of the literary texts in Lithuanian – by S. Daukantas, A. Baranauskas, Maironis, and J. Tumas-Vaižgantas

    Stylometry and Numerals Usage: Benford’s Law and Beyond

    Full text link
    We suggest two approaches to the statistical analysis of texts, both based on the study of numerals occurrence in literary texts. The first approach is related to Benford’s Law and the analysis of the frequency distribution of various leading digits of numerals contained in the text. In coherent literary texts, the share of the leading digit 1 is even larger than prescribed by Benford’s Law and can reach 50 percent. The frequencies of occurrence of the digit 1, as well as, to a lesser extent, the digits 2 and 3, are usually a characteristic the author’s style feature, manifested in all (sufficiently long) literary texts of any author. This approach is convenient for testing whether a group of texts has common authorship: the latter is dubious if the frequency distributions are sufficiently different. The second approach is the extension of the first one and requires the study of the frequency distribution of numerals themselves (not their leading digits). The approach yields non-trivial information about the author, stylistic and genre peculiarities of the texts and is suited for the advanced stylometric analysis. The proposed approaches are illustrated by examples of computer analysis of the literary texts in English and Russian

    Neuropathology of epileptic encephalopathies and non-paroxysmal epileptic disorders and principles of their treatment

    Full text link
    Based on his many year's studies and the data available in the literature, the author considers the neuropathophysiological mechanisms responsible for the damaging action of epileptic discharges on basic brain functions and provides evidence for the views of prolonged non-paroxysmal psychoneurological epileptic disorders and epileptic encephalopathies. He shows the consistency of clinical manifestations of the site of cerebral epileptic discharges on the basis of an update on their functional localization and functional neuroimaging. Computerized three-dimensional electroencephalography (EEG) data mapping is shown to play a crucial role in the understanding of the mechanisms, diagnosis, and treatment of non-paroxysmal epileptic disorders. Based on the concepts that epileptiform activity plays a neuropathological role in the genesis of non-paroxysmal epileptic disorders, the author substantiates the use of agents that are effective in suppressing EEG epileptiform activity (valproate, levetiracetam, and lamotrigine) as a first-choice drug and caution when giving the antiepileptics that can aggravate epileptiform activity and clinical manifestations

    Education During a Pandemic: Prospects and Challenges of Digital Learning

    No full text
    The study focuses on the impact of the COVID-19 pandemic on education systems around the world. The article reveals similarities and differences in decisions made by the responsible bodies in order to ensure the work of educational institutions during the current crisis caused by the new coronavirus pandemic. The author analyses the tools and facilities selected for education process and argues that in most cases digital technologies help students to continue their education even in the face of serious social shocks. At the same time, the downside of digital learning is also discussed. In a significant number of cases, the inability to attend schools and universities has a painful effect on the overall physical (such as food support), psychosocial (stress) and economic (additional costs associated with the need to use equipment and communication facilities) condition of both students and their families. The author emphasizes the pro and contra of digitalization as a vector for the development of education: new technologies not only contribute to solving the “old” issues of the industry, but also provoke the emergence of new challenges in this area. It is noted that the current situation in education can radically change not only the set of familiar tools for transferring knowledge, but also its content. Decision makers, heads of educational institutions, students and their parents face the challenge of finding the optimal ratio of “new” (digital) and “old” (classical) in the emerging model of education of the XXI century. The research is based on data published by international organizations (UN, ITU, UNESCO et al.), educational institutions of various countries of the world, and the author's personal teaching experience during the pandemic

    Great Recluses Salinger And Pynchon: A Comparative Stylometric Analysis of Texts

    No full text
    Количественный метод изучения авторского стиля литературных текстов, основанный на анализе статистики встречающихся в них числительных, применен к англоязычным текстам. Показано, что манера использования числительных индивидуальна для каждого автора; совокупность числительных является авторским инвариантом, различающим тексты разного авторства. Выполнен стилометрический анализ литературных произведений Дж. Сэлинджера и Т. Пинчона – представителей литературного постмодернизма США. Обнаружены различия в использовании авторами числительных. Результаты анализа подвергнуты иерархической кластеризации, показывающей близость стилей двух авторов.The study pertains to the field of quantitative linguistics. The quantitative method of studying the author's style of literary texts, based on the analysis of statistics of numerals occurring in them, is applied to English language texts. It is shown that the numerals used by the author in the (fiction) text are individual for each author; their combination is a characteristic feature (author's invariant) distinguishing texts by different authors. a comparative stylometric analysis of literary texts by T. Pynchon and J. D. Salinger – representatives of American literary postmodernism – is performed. a noticeable difference in the way the authors use numbers is observed. The results of the analysis were subjected to hierarchical clustering, correctly distributing the texts according to the authorship. Thus, the new method of stylometry is able to successfully attribute literary texts.Исследование выполнено за счет средств гранта Российского научного фонда № 23-28-00750, https://rscf.ru/project/23-28-00750/, проект «Разработка нового метода стилометрии на основе статистики использования числительных в авторских текстах»

    A novel method of stylometry based on the statistic of numerals

    Full text link
    A new method of statistical analysis of texts is suggested. The frequency distribution of the first significant digits in numerals of English-language texts is considered. We have taken into account cardinal as well as ordinal numerals expressed both in figures, and verbally. To identify the author's use of numerals, we previously deleted from the text all idiomatic expressions and set phrases accidentally containing numerals, as well as itemizations and page numbers, etc. Benford's law is found to hold approximately for the frequencies of various first significant digits of compound literary texts by different authors; a marked predominance of the digit 1 is observed. In coherent authorial texts, characteristic deviations from Benford's law arise which are statistically stable significant author peculiarities that allow, under certain conditions, to consider the problem of authorship and distinguish between texts by different authors. The text should be large enough (at least about 200 kB). At the end of {1, 2, ⋯, 9} digits row, the frequency distribution is subject to strong fluctuations and thus unrepresentative for our purpose. The aim of the theoretical explanation of the observed empirical regularity is not intended, which, however, does not preclude the applicability of the proposed methodology for text attribution. The approach suggested and the conclusions are backed by the examples of the computer analysis of works by W.M. Thackeray, M. Twain, R. L. Stevenson, J. Joyce, sisters Bront.e, and J.Austen. On the basis of technique suggested, we examined the authorship of a text earlier ascribed to L. F. Baum (the result agrees with that obtained by different means). We have shown that the authorship of Harper Lee's "To Kill a Mockingbird" pertains to her, whereas the primary draft, "Go Set a Watchman", seems to have been written in collaboration with Truman Capote. All results are confirmed on the basis of parametric Pearson's chi-squared test as well as non-parametric Mann-Whitney U test and Kruskal-Wallis test. Copyright © 2018 Institute of Computer Science

    Evaluation of corporate social responsibility

    No full text
    In article, some questions connected with the influence of business social responsibility on the development of the external and internal enterprise environment are described. Moreover, the problems connected with the assessment of corporate social responsibility are presented, types of the social reporting and indexes on social investments are examined. Relevance of the paper is connected with studying an assessment problem of corporate social responsibility of business. The author tries to analyse an assessment techniques of corporate social responsibility of the Russian enterprises and to reveal the most important directions of an assessment. Descriptive, analytical and structural methods of research are used for analysing existing practices of social responsibility of Russian companies and for identifying the parameters of the evaluation model of social responsibility. In this study, the concept of corporate social policy is given, and assessment methods of small social practices allow us to select the most effective project. For large enterprises with a developed system of social responsibility complex methods based on social reporting and index indicators are used
    corecore