MelissaDL x Breed: Towards Data-Efficient On-line Supervised Training of Multi-parametric Surrogates with Active Learning

Sofya Dymchenko; Abhishek Purandare; Bruno Raffin

Conference Papers Year : 2024

MelissaDL x Breed: Towards Data-Efficient On-line Supervised Training of Multi-parametric Surrogates with Active Learning

(1) , (1) , (1)

Sofya Dymchenko

Function : Author
PersonId : 1315621
IdHAL : sofya-dymchenko

Data Aware Large Scale Computing

Abhishek Purandare

Function : Author
PersonId : 1419794

Data Aware Large Scale Computing

Bruno Raffin

Function : Author
PersonId : 4842
IdHAL : bruno-raffin
ORCID : 0000-0002-7980-4946
IdRef : 091616999

Data Aware Large Scale Computing

Abstract

Artificial intelligence is transforming scientific computing with deep neural network surrogates that approximate solutions to partial differential equations (PDEs). Traditional off-line training methods face issues with storage and I/O efficiency, as the training dataset has to be computed with numerical solvers up-front. Our previous work, the Melissa framework, addresses these problems by enabling data to be created "on-the-fly" and streamed directly into the training process. In this paper we introduce a new active learning method to enhance data-efficiency for on-line surrogate training. The surrogate is direct and multi-parametric, i.e., it is trained to predict a given timestep directly with different initial and boundary conditions parameters. Our approach uses Adaptive Multiple Importance Sampling guided by training loss statistics, in order to focus NN training on the difficult areas of the parameter space. Preliminary results for 2D heat PDE demonstrate the potential of this method, called Breed, to improve the generalization capabilities of surrogates while reducing computational overhead.

Keywords

Active Learning Adaptive Multiple Importance Sampling Surrogates On-line Training Data-efficiency

Domains

Artificial Intelligence [cs.AI]

Fichier principal

final-version-Sept2024.pdf (2.65 Mo)

Origin	Files produced by the author(s)

Bruno Raffin : Connect in order to contact the contributor

https://hal.univ-brest.fr/hal-04712480

Submitted on : Monday, October 7, 2024-2:19:55 PM

Last modification on : Tuesday, November 5, 2024-10:52:04 AM

Dates and versions

hal-04712480 , version 1 (07-10-2024)

Licence

Identifiers

HAL Id : hal-04712480 , version 1

Cite

Sofya Dymchenko, Abhishek Purandare, Bruno Raffin. MelissaDL x Breed: Towards Data-Efficient On-line Supervised Training of Multi-parametric Surrogates with Active Learning. AI4S 2024 - 5th Workshop on artificial intelligence and machine learning for scientific applications, Nov 2024, Atlanta (Georgia), United States. pp.1-9. ⟨hal-04712480⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST UGA CNRS INRIA LIG LIG_SRCPR INRIA2 GENCI LIG-SRCPR-DATAMOVE ANR LIG_SIDCH NUMPEX

69 View

31 Download

MelissaDL x Breed: Towards Data-Efficient On-line Supervised Training of Multi-parametric Surrogates with Active Learning

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Share