1,721,029 research outputs found
Astronomical Data Mining with Neural Networks
We give a brief overview of artificial neural networks (ANNs), focusing on Kohonen networks (KNs). The two kinds of KNs will be described in detail: the unsupervised self-organizing map (SOM) and the supervised learning vector quantization (LVQ). We then apply these algorithms to two astronomical classification problems: the classification of broad absorption line quasars (BALQSOs) and of gamma-ray bursts (GRBs). In the context of BALQSOs, we find a BALQSO fraction of 10.4%, and compile a catalogue from the Sloan Digital Sky Survey (SDSS) using the supervised LVQ. This is currently the most complete BALQSO catalogue. We then apply the unsupervised SOM to GRB light curves obtained from the Burst and Transient Source Experiment (BATSE). Using only shape-dependent variables, we find that two classes are recovered: single-pulsed bursts (SPBs) and multi-pulsed bursts (MPBs). We show that these two network classes also have different observational properties that are independent of light curve shape (T90 and fluence), suggesting an intrinsic difference between the two. We conclude with some attempts to correlate our GRB result to previous studies and suggest improvements for future work
Machine learning from hard x-ray surveys: applications to magnetic cataclysmic variable studies
Within this thesis are discussed two main topics of contemporary astrophysics. The first is that of machine learning algorithms for astronomy whilst the second is that of magnetic cataclysmic variables (mCVs). To begin, an overview is given of ISINA: INTEGRAL Source Identification Network Algorithm. This machine learning algorithm, using random forests, is applied to the IBIS/ISGRI data set in order to ease the production of unbiased future soft gamma-ray source catalogues. The feature extraction process on an initial candidate list is described together with feature merging. Three trainng and testing sets are created in order to deal with the diverse time-scales encountered when dealing with the gamma-ray sky: one dealing with faint persistent source recognition, one dealing with strong persistent sources and a final one dealing with transients. For the latter, a new transient detection technique is introduced and described: the transient matrix. Finally the performance of the network is assessed and discussed using the testing set and some illustrative source examples. ISINA is also compared to the more conventional approach of visual inspection. Next mCVs are discussed, and in particular the properties arising from a hard X-ray selected sample which has proven remarkably efficient in detecting intermediate polars and asynchronous polars, two of the rarest type of cataclysmic variables (CVs). This thesis focuses particularly on the link between hard X-ray properties and spin/orbital periods. To this end, a new sample of these objects is constructed by cross-corelating candidate sources detected in INTEGRAL/IBIS observations against catalogues of known CVs. Also included in the analysis are hard X-ray Observations from Swift/BAT and SUZAKU/HXD in order to make the study more complete. It is found that most hard X-ray detected mCVs have Pspin/Porb<0.1 above the period gap. In this respect, attention is given to the very low number of detected systems in any ban between Pspin/Porb = 0.3 and Pspin/Porb = 1 and the apparent peak of the Pspin/Porb distribution at about 0.1. The observational features of the Pspin - Porb plane are discussed in the context of mCV evolution scenarios. Also presented is evidence for correlations between hard X-ray spectral hardness and Pspin, Porb and Pspin/Porb. An attempt to explain the observed correlations is made in th context of mCV evolution and accretion footpring geometrirs on the whit dwarf surface
ISINA : INTEGRAL source identification network algorithm
We give an overview of ISINA: INTEGRAL Source Identification Network Algorithm. This machine learning algorithm, using random forests, is applied to the IBIS/ISGRI data set in order to ease the production of unbiased future soft gamma-ray source catalogues. First, we introduce the data set and the problems encountered when dealing with images obtained using the coded mask technique. The initial step of source candidate searching is introduced and an initial candidate list is created. A description of the feature extraction on the initial candidate list is then performed together with feature merging for these candidates. Three training and testing sets are created in order to deal with the diverse time-scales encountered when dealing with the gamma-ray sky. Three independent random forests are built: one dealing with faint persistent source recognition, one dealing with strong persistent sources and a final one dealing with transients. For the latter, a new transient detection technique is introduced and described: the transient matrix. Finally the performance of the network is assessed and discussed using the testing set and some illustrative source examples
The intrinsic fraction of broad-absorption line quasars
We carefully reconsider the problem of classifying broad-absorption line quasars (BALQSOs) and derive a new, unbiased estimate of the intrinsic BALQSO fraction from the Sloan Digital Sky Survey (SDSS) DR3 quasi-stellar object (QSO) catalogue. We first show that the distribution of objects selected by the so-called 'absorption index' (AI) is clearly bimodal in log AI, with only one mode corresponding to definite BALQSOs. The surprisingly high BALQSO fractions that have recently been inferred from AI-based samples are therefore likely to be overestimated. We then present two new approaches to the classification problem that are designed to be more robust than the AI, but also more complete than the traditional 'balnicity index' (BI). Both approaches yield observed BALQSO fractions around 13.5 per cent, while a conservative third approach suggests an upper limit of 18.3 per cent. Finally, we discuss the selection biases that affect our observed BALQSO fraction. After correcting for these biases, we arrive at our final estimate of the intrinsic BALQSO fraction. This is fBALQSO= 0.17 ± 0.01 (stat) ± 0.03 (sys) with an upper limit of fBALQSO? 0.23 . We conclude by pointing out that the bimodality of the log AI distribution may be evidence that the BAL-forming region has clearly delineated physical boundaries
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Variations on the Author
“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
Appropriate Similarity Measures for Author Cocitation Analysis
We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis
Classifying optical (out)bursts in cataclysmic variables: the distinct observational characteristics of dwarf novae, micronovae, stellar flares and magnetic gating
Cataclysmic variables can experience short optical brightenings, which are commonly attributed to phenomena such as dwarf novae outbursts, micronovae, donor flares, or magnetic gating bursts. Since these events exhibit similar observational characteristics, their identification has often been ambiguous. In particular, magnetic gating bursts and micronovae have been suggested as alternative interpretations of the same phenomena. Here we show that the timescales and energies separate the optical brightenings into separate clusters consistent with their different classifications. This suggests that micronovae and magnetic gating bursts are in fact separate phenomena. Based on our findings, we develop diagnostic diagrams that can distinguish between these bursts/flares based on their properties. We demonstrate the effectiveness of this approach on observations of a newly identified intermediate polar, CTCV J0333-4451, which we classify as a magnetic gating system. CTCV J0333-4451 is the third highest spin-to-orbital period ratio intermediate polar with magnetic gating, suggesting that these bursts are common among these rare systems.</p
Dispelling the Myths Behind First-author Citation Counts
We conducted a full-scale evaluative citation analysis study of scholars in the XML research field to explore just how different from each other author rankings resulting from different citation counting methods actually are, and to demonstrate the capability of emerging data and tools on the Web in supporting more realistic citation counting methods. Our results contest some common arguments for the continued
use of first-author citation counts in the evaluation of scholars, such as high correlations between author rankings by first-author citation counts and other citation
counting methods, and high costs of using more realistic citation counting methods that are not well-supported by the ISI databases. It is argued that increasingly available digital full text research papers make it possible for citation analysis studies to go beyond what the ISI databases have directly supported and to employ more
sophisticated methods
- …
