Publication Server of Zuse Institute Berlin (ZIB)
Not a member yet
    6648 research outputs found

    From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

    No full text
    We study the problem of training neural stochastic differential equations, or diffusion models, to sample from a Boltzmann distribution without access to target samples. Existing methods for training such models enforce time-reversal of the generative and noising processes, using either differentiable simulation or off-policy reinforcement learning (RL). We prove equivalences between families of objectives in the limit of infinitesimal discretization steps, linking entropic RL methods (GFlowNets) with continuous-time objects (partial differential equations and path space measures). We further show that an appropriate choice of coarse time discretization during training allows greatly improved sample efficiency and the use of time-local objectives, achieving competitive performance on standard sampling benchmarks with reduced computational cost

    Influence of frictional drag on Kelvin-Helmholtz instability in viscous fluids

    No full text
    The Kelvin-Helmholtz instability arises at the interface between two fluid layers with a tangential velocity discontinuity, significantly impacting the safety of systems in industrial applications. Despite extensive studies, many theoretical analyses neglect viscosity and frictional drag due to the complexity of solving the dispersion equation. However, including these effects is crucial, as viscosity alters the growth rate of the instability, while frictional drag influences energy dissipation and momentum transfer. In this study, these effects are incorporated into both finite and infinite flows. The assumption of a potential flow combined with a dissipation method is employed to derive the dispersion equation, providing a more tractable approach than direct calculation methods. The results indicate that, in the case of infinite flows, the frictional drag suppresses the growth of the instability for long waves (small wavenumber k) but enhances it for short waves (large k). In contrast, the surface tension dominates, while the frictional drag only slightly affects the growth rate in the case of finite flows bounded by solid walls

    Beyond Benchmarks: Towards Robust Artificial Intelligence Bone Segmentation in Socio-Technical Systems

    No full text
    Despite the advances in automated medical image segmentation, AI models still underperform in various clinical settings, challenging real-world integration. In this multicenter evaluation, we analyzed 20 state-of-the-art mandibular segmentation models across 19,218 segmentations of 1,000 clinically resampled CT/CBCT scans. We show that segmentation accuracy varies by up to 25% depending on socio-technical factors such as voxel size, bone orientation, and patient conditions such as osteosynthesis or pathology. Higher sharpness, isotropic smaller voxels, and neutral orientation significantly improved results, while metallic osteosynthesis and anatomical complexity led to significant degradation. Our findings challenge the common view of AI models as “plug-and-play” tools and suggest evidence-based optimization recommendations for both clinicians and developers. This will in turn boost the integration of AI segmentation tools in routine healthcare

    Embedded Model Form Uncertainty Quantification with Measurement Noise for Bayesian Model Calibration

    No full text
    A key factor in ensuring the accuracy of computer simulations that model physical systems is the proper calibration of their parameters based on real-world observations or experimental data. Inevitably, uncertainties arise, and Bayesian methods provide a robust framework for quantifying and propagating these uncertainties to model predictions. Nevertheless, Bayesian methods paired with inexact models usually produce predictions unable to represent the observed datapoints. Additionally, the quantified uncertainties of these overconfident models cannot be propagated to other Quantities of Interest (QoIs) reliably. A promising solution involves embedding a model inadequacy term in the inference parameters, allowing the quantified model form uncertainty to influence non-observed QoIs. This paper introduces a more interpretable framework for embedding the model inadequacy compared to existing methods. To overcome the limitations of current approaches, we adapt the existing likelihood models to properly account for noise in the measurements and propose two new formulations designed to address their shortcomings. Moreover, we evaluate the performance of this inadequacy-embedding approach in the presence of discrepancies between measurements and model predictions, including noise and outliers. Particular attention is given to how the uncertainty associated with the model inadequacy term propagates to the QoIs, enabling a more comprehensive statistical analysis of prediction’s reliability. Finally, the proposed approach is applied to estimate the uncertainty in the predicted heat flux from a transient thermal simulation using temperature bservations

    Developing heuristic solution techniques for large-scale unit commitment models

    No full text
    Shifting towards renewable energy sources and reducing carbon emissions necessitate sophisticated energy system planning, optimization, and extension. Energy systems optimization models (ESOMs) often form the basis for political and operational decision-making. ESOMs are frequently formulated as linear (LPs) and mixed-integer linear (MIP) problems. MIPs allow continuous and discrete decision variables. Consequently, they are substantially more expressive than LPs but also more challenging to solve. The ever-growing size and complexity of ESOMs take a toll on the computational time of state-of-the-art commercial solvers. Indeed, for large-scale ESOMs, solving the LP relaxation -- the basis of modern MIP solution algorithms -- can be very costly. These time requirements can render ESOM MIPs impractical for real-world applications. This article considers a set of large-scale decarbonization-focused unit commitment models with expansion decisions based on the REMix framework (up to 83 million variables and 900,000 discrete decision variables). For these particular instances, the solution to the LP relaxation and the MIP optimum lie close. Based on this observation, we investigate the application of relaxation-enforced neighborhood search (RENS), machine learning guided rounding, and a fix-and-propagate (FP) heuristic as a standalone solution method. Our approach generated feasible solutions 20 to 100 times faster than GUROBI, achieving comparable solution quality with primal-dual gaps as low as 1% and up to 35%. This enabled us to solve numerous scenarios without lowering the quality of our models. For some instances that Gurobi could not solve within two days, our FP method provided feasible solutions in under one hour

    p-Laplacians for Manifold-valued Hypergraphs

    No full text
    Hypergraphs extend traditional graphs by enabling the representation of N-ary relationships through higher-order edges. Akin to a common approach of deriving graph Laplacians, we define function spaces and corresponding symmetric products on the nodes and edges to derive hypergraph Laplacians. While this has been done before for Euclidean features, this work generalizes previous hypergraph Laplacian approaches to accommodate manifold-valued hypergraphs for many commonly encountered manifolds

    Non-linear Battery Behavior in Electric Vehicle Scheduling Problems

    No full text
    The currently most popular approach to handle non-linear battery behavior for electric vehicle scheduling is to use a linear spline interpolation of the charge curve. We show that this can lead to approximate models that underestimate the charge duration and overestimate the state of charge, which is not desirable. While the error is of second order with respect to the interpolation step size, the associated mixed-integer linear programs do not scale well with the number of spline segments. It is therefore recommendable to use coarse interpolation grids adapted to the curvature of the charge curve, and to include sufficient safety margins to ensure solutions of approximate models remain feasible subjected to the exact charge curve

    The EuroCropsML time series benchmark dataset for few-shot crop type classification in Europe

    No full text
    We introduce EuroCropsML, an analysis-ready remote sensing dataset based on the open-source EuroCrops collection, for machine learning (ML) benchmarking of time series crop type classification in Europe. It is the first time-resolved remote sensing dataset designed to benchmark transnational few-shot crop type classification algorithms that supports advancements in algorithmic development and research comparability. It comprises 706683 multi-class labeled data points across 176 crop classes. Each data point features a time series of per-parcel median pixel values extracted from Sentinel-2 L1C data and precise geospatial coordinates. EuroCropsML is publicly available on Zenodo

    0

    full texts

    6,648

    metadata records
    Updated in last 30 days.
    Publication Server of Zuse Institute Berlin (ZIB)
    Access Repository Dashboard
    Do you manage Open Research Online? Become a CORE Member to access insider analytics, issue reports and manage access to outputs from your repository in the CORE Repository Dashboard! 👇