Recherche Data Gouv
Not a member yet
4753 research outputs found
Sort by
Historical pictures (1875-1975) of astronomical instruments at Paris, Meudon, Pic du Midi, and Saint Véran Observatories
This dataset of historical pictures and photographs was gathered by the author during his career from many sources, such as observatory archives, articles, scientific reviews or annals. It concerns the instrumental history of some telescopes and spectrographs, in particular those dedicated to solar physics (as this is the speciality of the author). Some images were extracted from old astronomical collections or periodic journals (such as annals of Meudon Observatory digitized by Gallica/BNF). The archive contains also a few movies. Several papers of the author about the history of instruments (such as Meudon spectroheliograph, heliographs, Pic du Midi Turret Dome and flare spectrographs, Saint Véran coronagraph) are included in the deposit. All pictures are for non commercial use only; for most of them, the author could not be identified and are now in the public domain; however, a credit must be given to the observatories
Metabolites characterization in boosting orphan legumes from the Mediterranean Basin: INRAE 2024 trial datasets of accession Rayhane
Results of the greenhouse trial on Rayhane accession at INRAE-Versailles/ France
The trial was conducted from september 2023 to March 2024.
These data refer to metabolites (Chlorophyll, Anthocyanins, Flavonols and NBI ratio) measurements in Rayhane accession.
The dataset is organised following the MIAPPE (Minimum Information About Plant Phenotyping Experiments) guidelines
The biological material file contains the information on the biological materials used for the trial.
The study and the observed variables files explains the observed variables and describe the study.
In the dataset file, the study_ID and the biologicalmaterial_ID refers to the identification of the trial from which the dataset is build.
The "Species_ID" column refers to the specy used in the study.
The "Accessions" column refers to the accession used in the study.
The "Treatment_ID" column refers to the treatment that is applied.
The "Days" columns refers to the different stage of development of the plant when the metabolites measurements are collected.
The 'Chlorophyll', 'Anthocyanins', 'Flavonols' and 'NBI (Nitrogen Balance Ratio)' columns refer to the measurement results of each of these metabolites. The measurement unit is µg/cm².
</br
Indicateurs des séries annuelles issus des projections hydrologiques Explore2 pour le modèle SIM2 sous RCP 2.6
Indicateurs des séries annuelles issus des débits journaliers simulés par le modèle hydrologique SIM2 pour l'ensemble des projections climatiques Explore2 sous RCP 2.6. Ces fichiers résultent de l'agrégation temporelle des simulations hydrologiques sous runs historiques (avant 2005) et des projections hydrologiques (post 2005), fichiers NetCDF disponibles au téléchargement dans la collection Explore2 - Projections hydrologiques.
Ce dépôt regroupe un tableau par indicateur et chaîne de simulation, c'est-à-dire, scénario d'émission RCP, couple GCM/RCM, correction de biais BC et modèle hydrologique HM. Ces données sont brutes et contiennent donc des chaînes de projections jugées aberrantes / horsains qu'il est possible de filtrer grâce à des métadonnées supplémentaires. Pour des raisons techniques, ces indicateurs sont regroupés par dossiers compressés selon les différentes phases du régime hydrologique.
La description des chaines de modélisation du climat et celle des modèles hydrologiques sont, respectivement, disponibles dans le rapport https://doi.org/10.57745/PUR7ML et dans les annexes du rapport https://doi.org/10.57745/S6PQXD. Retrouvez le diagnostic des modèles hydrologiques résumé à l'échelle des régions hydrologiques dans les fiches téléchargeables ici : https://doi.org/10.57745/DMFUXW.
Métadonnées supplémentaires : Récapitulatif de l'ensemble des indicateurs hydrologiques : https://doi.org/10.57745/JVNHQL Récapitulatif de l'ensemble des chaînes de simulation : https://doi.org/10.57745/R6HG5X Description de l'ensemble des points de simulation : https://doi.org/10.57745/UTKWR5 Liste des chaînes de modélisation jugées aberrantes / horsains : https://doi.org/10.57745/YZNENQ Récapitulatif des années pivots utilisées pour la TRACC : https://doi.org/10.57745/DCOQM6
Décomposition des chaînes de caractères formant le nom des fichiers parquet, séparées par des "_" : {1} Indicateur : Le nom de l’indicateur, du type de statistique calculée {2} Échantillonnage : Échantillonnage temporel sur laquelle est calculé l’indicateur → {1}_{2} Variable : Variable résultante d'un indicateur temporellement contextualisé {3} EXP : Identifiant de l’expérience historique (post 2005) ou future (post 2005) {4} GCM : Identifiant du GCM forçeur {5} RCM : Identifiant du RCM {6} BC : Identifiant de la méthode de correction de biais statistique {7} HM : Identifiant du modèle hydrologique
Les colonnes des fichiers parquet sont : EXP : Voir ci-dessus GCM : Voir ci-dessus RCM : Voir ci-dessus BC : Voir ci-dessus HM : Voir ci-dessus code : Code à 10 caractères du point de simulation fourni dans la description des points de simulation date : Date du début de la période annuelle d'agrégation (i.e. 2042-05-01 indique que l'année hydrologique commence en mai, plus d'information dans les métadonnées de variable) *Variable* : Voir ci-dessus
Retrouvez des scripts d'aide pour utiliser ces données parquet
Freshwater zooplankton dataset for automated image recognition and size measurement across seasons from three large peri-alpine lakes
We present here a dataset of freshwater crustacean zooplankton species and measures compiled using ZooScan (HYDROPTIC Inc.) on three French peri-alpine lakes that can be used as a model database to train automated identification using the online interface Ecotaxa (https://ecotaxa.obs-vlfr.fr)
Behavioral Measures on lambs Exposed to Olfactory Stimuli in a Positive Context
The dataset presented comes from an experimental study on the behavioral responses of lambs exposed to olfactory stimuli in a positive context. The study involved 30 lambs aged 3 months and 30 lambs aged 4 months, Ile-de-France. The animals were categorized based on age and sex (see the Animal_Data_lamb file). They were born at the UEPAO (Unité Expérimentale de Physiologie Animale de l'Orfrasière, INRAE, Nouzilly; doi:10.15454/1.5573896321728955E12).
This individual behavioral test was conducted over a 5-minute period, during which each lamb was placed in an experimental setup containing a social resource (three companions) and a food resource, aiming to create a positive context. The lambs were exposed to various olfactory stimuli to study their behavioral responses, including water (control condition), wolf feces (repulsive), orange essential oil (attractive), and cadaverine (repulsive).
The dataset includes detailed behavioral measurements, such as the total time spent in the food zone (Food_Zone_TS), the number of eating occurrences (Eat_Total_Occ), the time spent in the social zone (Social_Zone_TS), and vocalizations (Voc_Occ), recorded during exposure to the different olfactory stimuli. Additional variables, such as food intake, were also measured to assess the behavioral responses of the ewes (see the Behavioral_Data files). The behaviors were analyzed using video recordings processed with BORIS software
Daily and hourly climatic data from 54 stations of the INRAE Agroclim network
Measured daily and hourly climatic data from 54 currently active stations of the INRAE AgroClim network in metropolitan France, Guyana and Guadeloupe. Each station is composed by 12 sensors and measures air temperature under shelter, air humidity under shelter, wind speed and direction at 2 m, precipitation, global solar radiation, photosynthetically active radiation, leaf wetness duration, air temperature at 10 and 50 cm, and soil temperature at 10 and 50 cm depths.
The daily data is filtered to remove main outliers, with season-dependent thresholds.
Hourly data is raw and not yet filtered or validated.Les données climatiques horaires et journalières sont collectées à partir de 54 stations en France, y compris en Corse, en Guadeloupe et en Guyane. Chaque station est composée de 12 capteurs et mesure la température de l'air sous abri, l'humidité de l'air sous abri, la vitesse et la direction du vent, les précipitations, le rayonnement solaire global, le rayonnement photosynthétiquement actif, la durée d'humectation des feuilles, la température de l'air à 10 et 50 cm, et la température du sol à 10 et 50 cm de profondeur. Les données sont filtrées pour éliminer les principales valeurs aberrantes, avec des seuils dépendant de la saison
Extensive feedback on soil and water bioengineering projects in the Alps
This dataset comprises the data gathered in the field for Juliette Rousset's PhD thesis. It encompasses all the measured and observed variables during the field campaign summer 2024, which aims to observe and characterise soil and water bioengineering projects in Alps.
The SWBE projects were characterised by topographic surveys (cross section and slope), granulometric surveys, vegetation surveys and failure observations
gyrB database for taxonomic assignment formatted for DADA2
This dataset contains gyrB amplicon database used for taxonomic assignment of gyrB barcode sequencing performed according to the protocol described in “Emergence Shapes the Structure of the Seed Microbiota, Barret et al 2015”.
The database includes gyrB sequences from Public repositories of genomes and sequences (Ensembl Bacteria, NCBI, JGI).
- annotated genes gyrB (TIGR01059) at JGI (2014)
- annotated sequences gyrB at ensemblBacteria (2018)
- ampilicons extracted in silico from primers described in Barret et al 2015 from bacteria genomes available at NCBI in 2019
Curated and Analysed Historical Barley (Hordeum sps.) Phenotypic Data from seven European genebanks
This dataset compiles historical phenotypic records of Barley (Genus: Hordeum) accessions evaluated across multiple years and locations by seven European genebanks participating in the AGENT project. The data originate from long-term evaluation trials conducted under diverse agro-ecological conditions and represent one of the most comprehensive multi-environment phenotypic collections of barley genetic resources in Europe.
The dataset includes raw phenotypic data, outlier-corrected data, and best linear unbiased estimates (BLUEs) for a wide range of agro-morphological and developmental traits. Traits typically include, but are not limited to, days to heading (DTH), plant height (PH), thousand kernel weight (TKW), and other agronomic descriptors recorded according to national or genebank-specific descriptors.
The outlier-corrected dataset represents cleaned phenotypic data after statistical identification and removal of erroneous values using a mixed-model approach. The BLUE dataset provides adjusted means per accession, accounting for design and replication effects, and serves as a robust input for downstream analyses such as genomic prediction and genotype x environment interaction studies.
The author gratefully acknowledges the teams from IPK, INRAE and the CARC genebank for their valuable support and guidance, as well as the collaboration of all AGENT genebanks contributing to this study.
References:
1) Bernal-Vasquez, AM., Utz , HF. & Piepho, HP. Outlier detection methods for generalized lattices: a case study on the transition from ANOVA to REML. Theor Appl Genet 129, 787–804 (2016). https://doi.org/10.1007/s00122-016-2666-6
2) Philipp N, Weise S, Oppermann M, Börner A, Graner A, Keilwagen J, Kilian B, Zhao Y, Reif JC and Schulthess AW (2018) Leveraging the Use of Historical Data Gathered During Seed Regeneration of an ex Situ Genebank Collection of Wheat. Front. Plant Sci. 9:609. doi: 10.3389/fpls.2018.0060
Dataset for common wheat (Triticum aestivum L.) grain and flour characterization using classical and advanced analyses: raw and calculated analytical data SPARQL queries
This dataset is composed of the 28 SPARQL queries executed to generate the measurement tables which are included in the files belonging to dataset containing the data tables results of the queries execution. They have the same name. They only differ by their extension. By example, CWG_reception_fallingNumber_raw.sparql is the file including the SPARQL query executed to obtain the table included in the file CWG_reception_fallingNumber_raw.tsv