1,721,055 research outputs found

    Going Beyond Counting First Authors in Author Co-citation Analysis

    Full text link
    The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

    Visualizing Longitudinal Data with Dropouts

    Full text link
    A triangle plot is proposed to display longitudinal data with dropouts. The triangle plot is a tool of data visualization that can also serve as a graphical check for informativeness of the dropout process. There are similarities between the lasagna plot and the triangle plot but the explicit use of dropout time as an axis is an advantage of the triangle plot over the more commonly used graphical strategies for longitudinal data. It is possible to interpret the triangle plot as a trellis plot 1 which gives rise to several extensions such as the triangle histogram and the triangle boxplot. R code is available to streamline the use of the triangle plot in practice

    Mixtures of Receiver Operating Characteristic Curves

    Full text link
    Rationale and Objectives: ROC curves are ubiquitous in the analysis of imaging metrics as markers of both diagnosis and prognosis. While empirical estimation of ROC curves remains the most popular method, there are several reasons to consider smooth estimates based on a parametric model. Materials and Methods: A mixture model is considered for modeling the distribution of the marker in the diseased population motivated by the biological observation that there is more heterogeneity in the diseased population than there is in the normal one. It is shown that this model results in an analytically tractable ROC curve which is itself a mixture of ROC curves. Results: The use of CK-BB isoenzyme in diagnosis of severe head trauma is used as an example. ROC curves are fit using the direct binormal method, ROCKIT and the Box-Cox transformation as well as the proposed mixture model. The mixture model generates an ROC curve that is much closer to the empirical one than the other methods considered. Conclusions: Mixtures of ROC curves can be helpful in fitting smooth ROC curves in datasets where the diseased population has higher variability than can be explained by a single distribution

    Computing the Total Sample Size When Group Sizes Are Not Fixed

    Full text link
    This article is concerned with computing the total sample size required for a two-sample comparison when the sizes of the two groups to be compared cannot be fixed in advance. This is frequently encountered when group membership depends on a variable which is observable only after the subject is enrolled to the study, such as a genetic or a biological marker. The most common way of circumventing this problem is assuming a fixed number for the prevalence of the condition that will determine the group membership and compute the required sample size conditionally. In this article this practice is formalized by placing a prior distribution on the prevalence which results in an analytically tractable formula for the unconditional sample size. In particular a sample size inflation factor, a number that can be multiplied with conditional sample size, is presented. An example is given from the planning of a clinical trial investigating the prognostic role of molecular markers in gastrointestinal stromal cancer

    Asymptotic distribution of the least squares estimator in the first-order autoregressive process

    No full text
    This study is about the asymptotic distribution of the least squares estimator in nonstationary first-order autoregressive processes. These processes are commonly used to model economic time series and the desired distribution is important in finding the size of the so-called unit root tests. Our approach is based on the asymptotic characterization of the distribution in terms of a functional of the standard Wiener process. We use the Karhunen- Loeve expansion for the Wiener process and obtain the solution using characteristic functions and the Fourier inversion theorem. As compared to the previous studies, our method provides a conceptually simple framework in which one can investigate more complicated models

    Bayes factors for variance components in the mixed linear model

    No full text
    The Bayes Factor is a widely-used summary measure that can be used to test hypotheses in a Bayesian setting. It also performs well in problems of model selection. In this study, Bayes Factors for variance components in the mixed linear model are derived. The formulation used avoids the assumption of a priori independence between the variance components by using a Dirichlet prior on the intraclass correlations. A reference prior, which results in a Bayes Factor that is flexible and easy to use, is suggested. Hypothesis tests using the Bayes Factor avoid difficulties of the classical tests, such as non-uniqueness and invalid asymptotics. The priors on the nuisance parameters are chosen to be non-informative and the corresponding integrals are carried out analytically. For the parameters of interest, however, numerical methods have to be used. For this purpose, Monte Carlo methods have been investigated. Simple random sampling and Latin hypercube sampling are employed for simulating the prior and a Gibbs sampling scheme has been implemented for simulating the posterior. The resulting estimators are compared on a small data set

    Variations on the Author

    Full text link
    “Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship

    Appropriate Similarity Measures for Author Cocitation Analysis

    Full text link
    We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis

    Dispelling the Myths Behind First-author Citation Counts

    Full text link
    We conducted a full-scale evaluative citation analysis study of scholars in the XML research field to explore just how different from each other author rankings resulting from different citation counting methods actually are, and to demonstrate the capability of emerging data and tools on the Web in supporting more realistic citation counting methods. Our results contest some common arguments for the continued use of first-author citation counts in the evaluation of scholars, such as high correlations between author rankings by first-author citation counts and other citation counting methods, and high costs of using more realistic citation counting methods that are not well-supported by the ISI databases. It is argued that increasingly available digital full text research papers make it possible for citation analysis studies to go beyond what the ISI databases have directly supported and to employ more sophisticated methods
    corecore