Giri Krishnan created AIRAVATA-3965:
---------------------------------------

             Summary: Facilitating computational experiment generation in 
AIRAVATA
                 Key: AIRAVATA-3965
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-3965
             Project: Airavata
          Issue Type: New Feature
            Reporter: Giri Krishnan


Computational sciences involve extensive experimentation which often involves 
searching over space of parameters, variables, functions and workflows. 
Individual researchers and groups often perform a large number of such searches 
to identify critical functional forms and workflows for any particular study. 
The goal of this work is to provide a tool that facilitates this search 
process. This will enable visualization, identifying or learning templates and 
generate potential experiments based on past experiments using LLM and 
neurosymbolic methods.

 

This task requires the following specific goals for this work :
 # Provide visualization of past computational experiments: Tracking various 
computational experiments with various variations is often a challenging 
problem for individuals and groups of researchers. Often various adhoc 
approaches (directories, git etc) are used to track these changes, but often it 
is very difficult to provide an entire overview of past experiments. The goal 
of this work is to develop a visualization approach that allows to examine all 
the past experiments. This will require dimensionality reduction on the 
embeddings from LLMs which have been tested on its code generation abilities 
(eg. codellama, Llama 4 Maverick) for generating visualizations. Further 
comparison in the performance with standard code cloning and similarity 
measures will be required.
 # Identify template based on past experiment database: It is common for 
several computational experiments to share a common structure, in such cases 
identifying the 'template' allows for identifying common approaches in past 
experiments and to generate new ones. This work will need software engineer 
approach and AI based approaches to identify such templates. The templates will 
also be integrated with the visualization (in addition to embedding based 
visualization) allowing for examining the collections of experiments that 
belong to each template.
 # Generate new suggested experiments using templates and visualization guided 
search. Generation of new experiments is a key component of computational 
science work. To facilate this process, will require a visual interactive way 
to generate experiments based within the regions of previous experiments and 
also in the space where it was not previously explored. In addition, this will 
require generation of new experiments based on templates that were identified 
from the previous step. Template based generation could also provide a 
verifiable way to generate experiments.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to