Portail HAL ENC (École nationale des chartes-PSL)
Not a member yet
3792 research outputs found
Sort by
Le livre dont vous êtes le héros
International audienceCe livre du début du 17e siècle documente un type de lecture particulier. Il se lit en effet avec des dés et est le support des jeux mondains provinciaux de l'époque. On lance trois dés, ce qui désigne un poème : ce quatrain indique au lecteur quel sera son avenir. L'article analyse le livre, son manuscrit (BnF, MSS, fr. 1702) et son auteur
Religions of Ancient South Arabia From Polytheism to Monotheism in the Context of Political, Economic and Social Evolutions (Fourth–Sixth Centuries ce )
International audienceABSTRACT The religions of South Arabia underwent major changes between the fourth and early seventh centuries. At the beginning of the fourth century, all the territories of ancient South Arabia were unified: the Ḥimyarite kings succeeded in unifying three ancient kingdoms, Ḥimyar, Sabaʾ and Ḥaḍramawt. A common language, the Sabaic, and a common era were adopted. As far as religion was concerned, the unity was more difficult to achieve. The Ḥimyarite kingdom did not have a strong centralized cult; the main gods of the Ḥimyarites were rarely mentioned and the various tribes, who had previously belonged to other kingdoms, worshipped a variety of gods. This is probably why the Ḥimyarites did not try to impose their own religion on the annexed territories. Rather, they preferred to preserve the worship of the main Sabaean god, Almaqah, in Mārib. Subsequently, as religious developments progressed in South Arabia, Jewish and Christian communities settled there and some inhabitants converted to Judaism. In the second half of the fourth century, the Ḥimyarite kings converted to Judaism, probably rather superficially, the majority of the population followed and the ancestral polytheistic cults were abandoned. From the late fourth to the sixth century, the dominant religion of the Ḥimyarite kingdom was a kind of Judaism or Judaizing monotheism, with Jewish communities living among the rather superficially Judaized population. Christians were present as well in merchant cities, but their presence is not documented in the local sources before the sixth century. In the sixth century, violent conflicts opposed Ḥimyarites and Ethiopians and also Jews and Christians; finally, Ethiopians conquered the kingdom of Ḥimyar and imposed Christianity as the official religion. These religious transformations are studied in context of political, economic and social evolutions, with the help of various sources, mainly South Arabian Pre‐Islamic inscriptions, but also Arab Islamic tradition and Roman and Byzantine sources
Entre maçonneries "tordues" et photos non rectifiées : approche exploratoire de vision par ordinateur en archéologie du bâti
International audienceLes études du bâti en archéologie reposent souvent sur des documents photogrammétriques orthonormés et des nuages de points 3D, qui offrent une précision inédite pour l'analyse des structures. Toutefois, ces méthodes restent difficilement applicables dans de nombreux contextes de terrain, faute de temps, de moyens ou en raison de l'inaccessibilité des monuments. Ce poster propose ainsi d'examiner, de manière exploratoire, le potentiel archéologique de photographies non orthonormées, source importante lorsque la documentation disponible est lacunaire ou non reproductible.En nous appuyant sur un corpus limité de monuments byzantins et ottomans du Péloponnèse à maçonnerie irrégulière, nous analysons des photographies d'élévations comportant des déformations et aberrations optiques. L'objectif est d'extraire des informations dimensionnelles simples et de comparer de manière relative les blocs et leur agencement (variations de taille, de proportions, de répartition). En intégrant, sur quelques cas d'étude, des métadonnées complémentaires (paramètres de prise de vue, mesures ponctuelles sur site, géoréférencement lorsque cela est possible), nous discutons dans quelle mesure ces informations peuvent améliorer la lisibilité et la cohérence des résultats.Le workflow s'appuie sur des modèles récents pour la détection et la segmentation des blocs (famille SAM et DINO) afin de définir des unités d'analyse, puis sur des mesures 2D dans l'image, complétées ponctuellement par des estimations de profondeur monoculaire (UniDepthV2, Metric3D-v2) lorsque cela se révèle pertinent. Quelques comparaisons avec des images mieux contraintes géométriquement permettent enfin de mettre en évidence les principales limites de cette approche et de proposer des pistes de réflexion sur l'usage raisonnable de photographies non orthonormées en archéologie du bâti
L’Orient, par-delà le romantisme
International audienceIn 1856, Lorédan Larchey (1831–1902) published a short work on Constantinople. Breaking with the romantic tradition of travel writing, he documented a country undergoing profound change, far removed from Orientalist clichés, with precision and a touch of humour. Reflecting the intellectual networks of Parisian student life, the work was published the following year by Auguste Poulet-Malassis, publisher of Les Fleurs du Mal: it was the fruit of intellectual and literary friendships on the fringes of the École des ChartesAbout: Lorédan Larchey, Un mois à Constantinople (A Month in Constantinople), Janvier 1855, Alençon, Poulet-Malassis, 1856. Purchased in bookshops, Bellamys World / Book Heritage (Lebanon), 2025. Library of the École nationale des chartes - PSL, 8 R 321.Lorédan Larchey (1831-1902) publie en 1856 un opuscule sur Constantinople. Rompant avec la tradition romantique du récit de voyage, il documente un pays en pleine mutation loin des clichés orientalistes, avec précision et non sans humour. Reflet des réseaux intellectuels du monde étudiant parisien, l’ouvrage est publié par Auguste Poulet-Malassis, éditeur des Fleurs du Mal dès l’année suivante : il est le fruit d’amitiés intellectuelles et littéraires dans les marges de l’École des chartes.A propos de : Lorédan Larchey, Un mois à Constantinople, janvier 1855, Alençon, Poulet-Malassis, 1856. Achat en librairie, Bellamys World / Book Heritage (Liban), 2025. Bibliothèque de l’École nationale des chartes - PSL, 8 R 321
Never Care For What They Say ? Platform vs Genre Rules in Online Horror Narratives (2007-2024)
Research on online cultural production shows that platforms are acting as mediators that can heavily shape textual form. Yet, empirical work is often platform-bounded, making it difficult to assess whether stylistic regularities that we observe are indeed genre signals or if some of them are platform artefacts. We address this question through a cross-platform design focused on creepypasta, a digital-born horror genre circulating across heterogeneous infrastructures. Using a corpus of ∼23,000 Englishlanguage stories published from 2007 to 2024 on Reddit's /r/nosleep and the Creepypasta Fandom wiki, we compare stylistic profiles across platforms and relate them to differences in rule regimes and moderation practices, established through qualitative extraction and close reading of platform guidelines. Across readability indices, lexical diversity measures, syntactic proxies, and a cross-fit feature-based model, we find that platform membership leaves only a narrow stylistic imprint, largely reducible to a single architectural rule: r/NoSleep's mandatory first-person narration. Beyond this constraint, differences are modest and fail to form coherent platform-specific stylistic signatures. This helps us define what is stylistically common in creepypastas, and understand what the genre is to its writers beyond the topics it deals with or the platform it is written on
HistoriQA-ThirdRepublic: Multi-Hop Question Answering Corpus for Historical Research, Parliamentary Debates from the
We present HistoriQA-ThirdRepublic: a French-language dataset of multi-hop historical questions derived from parliamentary debates and newspapers of the French Third Republic. Designed in collaboration with a historian, the corpus captures complex reasoning patterns typical of historical inquiry, including cross-source synthesis, temporal reasoning, and the integration of sparse evidence. The dataset is made of 1782 questions and emphasizes multi-hop connections across heterogeneous historical documents, providing a resource for evaluating retrieval-augmented and large language model systems in domain-specific contexts. We describe the methodology for constructing the corpus, including the selection and alignment of sources, question validation, and metadata integration. While the dataset focuses on French historical documents, our methodology can be readily adapted to other languages and national corpora. Finally, we demonstrate how the corpus can support realistic evaluation scenarios for multi-hop question answering, bridging the gap between NLP benchmarks and the needs of historical scholarship.</div
Écrire sur l’encyclopédie collaborative Wikipédia: Les biais de genre sur le 8e site internet le plus consulté au monde
International audienceThis presentation discusses the various gender biases found in Wikipedia articles.Cette présentation aborde les différents biais de genre que l'on retrouve dans les articles de Wikipédia
Under-resourced studies of under-resourced languages: lemmatization and POS-tagging with LLM annotators for historical Armenian, Georgian, Greek and Syriac
Low-resource languages pose persistent challenges for Natural Language Processing tasks such as lemmatization and part-of-speech (POS) tagging. This paper investigates the capacity of recent large language models (LLMs), including GPT-4 variants and open-weight Mistral models, to address these tasks in few-shot and zero-shot settings for four historically and linguistically diverse under-resourced languages: Ancient Greek, Classical Armenian, Old Georgian, and Syriac. Using a novel benchmark comprising aligned training and out-of-domain test corpora, we evaluate the performance of foundation models across lemmatization and POS-tagging, and compare them with PIE, a task-specific RNN baseline. Our results demonstrate that LLMs, even without fine-tuning, achieve competitive or superior performance in POS-tagging and lemmatization across most languages in few-shot settings. Significant challenges persist for languages characterized by complex morphology and non-Latin scripts, but we demonstrate that LLMs are a credible and relevant option for initiating linguistic annotation tasks in the absence of data, serving as an effective aid for annotation.
Comment Hélinand de Froidmont travaillait-il ? Découvrir ses notes grâce à des collections d’exempla cisterciennes
International audienc
The Patrologia Graeca Corpus: OCR, Annotation, and Open Release of Noisy Nineteenth-Century Polytonic Greek Editions
International audienceWe present the Patrologia Graeca Corpus, the first large-scale open OCR and linguistic resource for nineteenthcentury editions of Ancient Greek. The collection covers the remaining undigitized volumes of the Patrologia Graeca (PG), printed in complex bilingual (Greek-Latin) layouts and characterized by highly degraded polytonic Greek typography. Through a dedicated pipeline combining YOLO-based layout detection and CRNN-based text recognition, we achieve a character error rate (CER) of 1.05% and a word error rate (WER) of 4.69%, largely outperforming existing OCR systems for polytonic Greek. The resulting corpus contains around six million lemmatized and part-of-speech tagged tokens, aligned with full OCR and layout annotations. Beyond its philological value, this corpus establishes a new benchmark for OCR on noisy polytonic Greek and provides training material for future models, including LLMs