1,720,978 research outputs found
File synchronization as a way to add quality metadata to research data
Research data which is put into long term storage needs to have quality metadata attached so it may be found in the future. Metadata facilitates the reuse of data by third parties and makes it citable in new research contexts and for new research questions. However, better tools are needed to help the researchers add metadata and prepare their data for publication. These tools should integrate well in the existing research workflow of the scientists, to allow metadata enrichment even while they are creating, gathering or collecting the data. In this thesis an existing data publication tool from the project DARIAH-DE was connected to a proven file synchronization software to allow the researchers prepare the data from their personal computers and mobile devices and make it ready for publication. The goal of this thesis was to find out whether the use of file synchronization software eases the data publication process for the researchers
File synchronization as a way to add quality metadata to research data
Research data which is put into long term storage needs to have quality metadata attached so it may be found in the future. Metadata facilitates the reuse of data by third parties and makes it citable in new research contexts and for new research questions. However, better tools are needed to help the researchers add metadata and prepare their data for publication. These tools should integrate well in the existing research workflow of the scientists, to allow metadata enrichment even while they are creating, gathering or collecting the data. In this thesis an existing data publication tool from the project DARIAH-DE was connected to a proven file synchronization software to allow the researchers prepare the data from their personal computers and mobile devices and make it ready for publication. The goal of this thesis was to find out whether the use of file synchronization software eases the data publication process for the researchers
Bestimmung relevanter Worte eines Textes und Darstellung unter der Oberfläche TextGrid-Workbench
This bachelor thesis deals with the developing process of an Eclipse based plug-in for the
TextGrid workbench. To give a better understanding of what a given, presumably unknown
text is about, the plug-in supports an easy way to find the most relevant words within the text.
Additionally the plug-in extends the TextGrid research facility with a way to look up mean-
ings of search terms entered. Both features are shown in two newly implemented Eclipse
perspectives. In each one presented, the user can choose word by word between linked hy-
pernyms and hyponyms. Thus, one is able to effectively navigate through the semantic con-
text. To achieve this the RDF version of WordNet together with a text mining tool are utilized.Diese Bachelorarbeit beschreibt die Entwicklung eines auf Eclipse basierenden Plug-ins für
die TextGrid Workbench. Das Plug-in bietet die Möglichkeit sich die relevantesten Wörter
eines Textes anzeigen zu lassen, um einen schnellen Überblick über dessen Inhalt zu erhalten.
Zudem erweitert das Plug-in die TextGrid Suchmaske mit der Möglichkeit, sich die Bedeu-
tungen eingegebener Worte anzeigen zu lassen. Beide Erweiterungen werden in zwei neu
implementierten Eclipse Perspektiven dargestellt, in denen sich der Nutzer die Hypernyme
und Hyponyme der dargestellten Worte anzeigen lassen kann. Das ermöglicht die Naviga-
tion durch den semantischen Kontext. Zur Umsetzung werden die RDF Version von WordNet
sowie Text Mining Tools genutzt
Bestimmung relevanter Worte eines Textes und Darstellung unter der Oberfläche TextGrid-Workbench
This bachelor thesis deals with the developing process of an Eclipse based plug-in for the
TextGrid workbench. To give a better understanding of what a given, presumably unknown
text is about, the plug-in supports an easy way to find the most relevant words within the text.
Additionally the plug-in extends the TextGrid research facility with a way to look up mean-
ings of search terms entered. Both features are shown in two newly implemented Eclipse
perspectives. In each one presented, the user can choose word by word between linked hy-
pernyms and hyponyms. Thus, one is able to effectively navigate through the semantic con-
text. To achieve this the RDF version of WordNet together with a text mining tool are utilized.Diese Bachelorarbeit beschreibt die Entwicklung eines auf Eclipse basierenden Plug-ins für
die TextGrid Workbench. Das Plug-in bietet die Möglichkeit sich die relevantesten Wörter
eines Textes anzeigen zu lassen, um einen schnellen Überblick über dessen Inhalt zu erhalten.
Zudem erweitert das Plug-in die TextGrid Suchmaske mit der Möglichkeit, sich die Bedeu-
tungen eingegebener Worte anzeigen zu lassen. Beide Erweiterungen werden in zwei neu
implementierten Eclipse Perspektiven dargestellt, in denen sich der Nutzer die Hypernyme
und Hyponyme der dargestellten Worte anzeigen lassen kann. Das ermöglicht die Naviga-
tion durch den semantischen Kontext. Zur Umsetzung werden die RDF Version von WordNet
sowie Text Mining Tools genutzt
Making the Repository Programmable The TextGrid Repository as a multi-layered Research Environment
The TextGrid Repository (TGR) is a dedicated research-data repository for the humanities and cultural studies that specializes in XML/TEI–encoded texts. Developed in the DFG-funded TextGrid project from 2006 to 2015, TGR was a pioneering infrastructure, as it embraced the TEI format—a de facto standard in digital humanities and an essential foundation for compu-tational philology. Initially, TGR offered basic storage, download, archiving, and structured metadata for literary texts. Over time, it has evolved into a sophisticated research environ-ment that transcends conventional archival functions. To support advanced scholarly workflows, TGR integrates tools for automated text analysis and annotation—such as Voyant Tools [1], the Language Resource Switchboard [2], and the Annotation Sandbox [3]—with direct export capabilities, thereby lowering technical barriers and streamlining complex analyses. Its incorporation into the NFDI consortium Text+ ushers in a new era of modernization, component upgrades, and enhanced user engagement, open-ing TGR to emerging generations of researchers. Contemporary literary and linguistic scholars demand virtual research environments that dif-fer markedly from those of earlier years. Text-editing projects now emphasize rich presenta-tion layers and custom transformations for reading and highlighting annotated data. Computa-tional literary studies require straightforward access to plain text, programmatic interfaces, and libraries. Library-driven initiatives prioritize authority data integration. Some digital humani-ties inquiries hinge on author attributes—such as gender—while corpus linguistics projects center on detailed linguistic annotations. TGR addresses these diverse requirements by unifying multiple services and access modali-ties. From the end user's vantage point, data can be retrieved via direct reading links, faceted search in the portal's graphical interface, persistent identifiers (PIDs), or programmable inter-faces—including the Python client library tg_client [5]. Prospective data publishers receive expert guidance on metadata quality. In the Text+ context, TGR now offers new services—Notebook Actions [6], which provide a graphical import interface in Jupyter Notebooks, and tg_model [7], which generates the metadata documents required for data ingestion—alongside established tools (tg-crud [8] and tg_admin [9]) that handle repository maintenance and document management. Collectively, these enhancements simplify and accelerate data import and publication workflows. A clear indicator of TGR's transformation is the surge in new projects over recent years, which has greatly enriched the repository's content. Whereas TGR once catered primarily to German studies, it now houses materials in over one hundred languages and multiple script systems (including Coptic, Cyrillic, Arabic, Hebrew, Amharic, Chinese, Japanese, Korean, and Armenian), reflecting the needs of a broad spectrum of disciplines. In our presentation, we will demonstrate key new functionalities and illustrate how TGR's cur-rent multi-layered research environment departs from its original archival role. Special em-phasis will be placed on the latest automated processes, which not only facilitate but actively promote computer–assisted analyses, all while ensuring the highest standards of metadata quality
- …
