1,720,976 research outputs found
Weaving enterprise knowledge graphs: The case of company ownership graphs∗
Motivated by our experience in building the Enterprise Knowledge Graph of Italian companies for the Central Bank of Italy, in this paper we present an in-depth case analysis of company ownership graphs, graphs having company ownership as a central concept. In particular, we study and introduce three industrially relevant problems related to such graphs: company control, asset eligibility and detection of personal links. We formally characterize the problems and present Vada-Link, a framework based on state-of-the-art approaches for knowledge representation and reasoning. With our methodology and system, we solve the problems at hand in a scalable, model-independent and generalizable way. We illustrate the favourable architectural properties of Vada-Link and give experimental evaluation of the approach
Eliminating Harmful Joins in Warded Datalog+/−
We provide a rewriting technique of Warded Datalog+/− settings to sustain decidability and data tractability of reasoning tasks in the presence of existential quantification and recursion. To achieve this behaviour in practice, reasoners implement specialized strategies which exploit the theoretical bases of the language to control the effects of recursion, ensuring reasoning termination with small memory footprint. However, as a necessary condition for such exploitation, the setting is required to be in a “normalized form”, essentially without joins on variables affected by existential quantification. We present the Harmful Join Elimination, a normalization algorithm of Warded Datalog+/− that removes such “harmful” joins, supporting the tractability of the reasoning task as well as the full expressive power of the language. The algorithm is integrated in the Vadalog system, a Warded Datalog+/− -based reasoner that performs ontological reasoning in complex scenarios
iWarded: A Versatile Generator to Benchmark Warded Datalog plus /- Reasoning
Warded Datalog+/- is a powerful member of the Datalog+/- family, which extends the logic language Datalog with existential quantification and provides full support for recursion. Such expressive power, paired with a promising trade-off with the offered data complexity, was the catalyst for the recent rise of the language as a relevant candidate for knowledge graph traversal and ontological reasoning applications. Despite the growing research and industrial interest towards Warded Datalog+/-, we observe a substantial lack of specific tools able to generate non-trivial settings and benchmark scenarios, essential to evaluate, analyze and compare reasoning systems over such tasks. In this paper, we aim at filling this gap by introducing iWarded, a versatile generator of Warded Datalog+/- benchmarks. Our system is able to efficiently create very large, complex, and realistic reasoning settings while providing extensive control over the theoretical underpinnings of the language. iWarded was developed and employed in the context of the Vadalog system, a state-of-the-art Warded Datalog+/-based reasoner
Neither in the Programs Nor in the Data: Mining the Hidden Financial Knowledge with Knowledge Graphs and Reasoning
Vadalog is a logic-based reasoning language for modern AI solutions, in particular for Knowledge Graph (KG) systems. It is showing very effective applicability in the financial realm, with success stories in a vast range of scenarios, including: creditworthiness evaluation, analysis of company ownership and control, prevention of potential takeovers of strategic companies, prediction of hidden links between economic entities, detection of family businesses, smart anonymization of financial data, fraud detection and anti-money laundering. In this work, we first focus on the language itself, giving a self-contained and accessible introduction to Warded Datalog+/-, the formalism at the core of Vadalog, as well as to the Vadalog system, a state-of-the-art KG system. We show the essentials of logic-based reasoning in KGs and touch on recent advances where logical inference works in conjunction with the inductive methods of machine learning and data mining. Leveraging our experience with KGs in Banca d’Italia, we then focus on some relevant financial applications and explain how KGs enable the development of novel solutions, able to combine the knowledge mined from the data with the domain awareness of the business experts
Going Beyond Counting First Authors in Author Co-citation Analysis
The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation
counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings
are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that
only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into
account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed
Reasoning on company takeovers during the COVID-19 crisis with knowledge graphs
When some country takes a disproportionate hit by a large-scale turmoil—just like Italy did during the COVID-19 pandemics—the share prices of its companies plunge. Suddenly, it becomes feasible to attempt foreign takeovers of national assets, including those of strategic interest. To avert this risk, the Government can veto transactions by summoning the so-called “Golden Powers”. Or, it can work to proactively identify structural weaknesses in the control or shareholding chains of key companies, in order to reinforce them without resorting to special powers. Sometimes, vulnerabilities and attacks hide in plain sight due to how complex and intertwined the network of mutual company shareholding is. In this work, we show how to leverage Knowledge Graphs (KGs) as a representation and reasoning framework to analyze both reactive and proactive measures against takeover attempts, however intricate the setting where they take place. We formally characterize a set of reasoning tasks that define when and if to employ Golden Powers, plus others that aim at pinpointing companies prone to attacks. These criteria are exercised on the real network of all Italian companies, built for the occasion. A rich set of experiments is provided, including on several large synthetic instances, to prove the robustness of our method
Variations on the Author
“Variations on the Author” discusses two of Eduardo Coutinho’s recent films (Um Dia na Vida, from 2010, and Últimas Conversas, posthumously released in 2015) and their contribution to the general question of documentary authorship. The director’s filmography is characterized by a consistent yet self-effacing form of authorial self-inscription: Coutinho often features as an interviewer that rather than express opinions propels discourses; an interviewer that is good at listening. This mode of self-inscription characterizes him as an author who is not expressive but who is nonetheless markedly present on the screen. In Um Dia na Vida, however, Coutinho is completely absent form the image, while Últimas Conversas, on the contrary, includes a confessional prologue that moves the director from the margins to the center of his films. This article examines the ways in which these works stand out in the filmography of a director who offers new insights into the notion of cinematic authorship
Appropriate Similarity Measures for Author Cocitation Analysis
We provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of similarity measures may be used as an alternative to the Pearson correlation. We consider three similarity measures in particular. One is the well-known cosine. The other two similarity measures have not been used before in the bibliometric literature. Finally, we show by means of an example that our findings have a high practical relevance.information science;Pearson correlation;cosine;similarity measure;author cocitation analysis
Fine-Tuning Large Enterprise Language Models via Ontological Reasoning
Large Language Models (LLMs) exploit fine-tuning as a technique to adapt to diverse goals, thanks to task-specific training data. Task specificity should go hand in hand with domain orientation, that is, the specialization of an LLM to accurately address the tasks of a given realm of interest. However, models are usually fine-tuned over publicly available data or, at most, over ground data from databases, ignoring business-level definitions and domain experience. On the other hand, Enterprise Knowledge Graphs (EKGs) are able to capture and augment such domain knowledge via ontological reasoning. With the goal of combining LLM flexibility with the domain orientation of EKGs, we propose a novel neurosymbolic architecture that leverages the power of ontological reasoning to build task- and domain-specific corpora for LLM fine-tuning
- …
