[like] Miao, Yu reacted to your message: ________________________________ From: Suresh Marru <sma...@apache.org> Sent: Tuesday, April 29, 2025 4:40:06 PM To: Airavata Dev <dev@airavata.apache.org> Cc: Pamidighantam, Sudhakar <spami...@gatech.edu> Subject: Re: Seeking Guidance on Automating Parsing in SMILES Workflow
You don't often get email from sma...@apache.org. Learn why this is important<https://aka.ms/LearnAboutSenderIdentification> Hi Yu, Thanks for this summary. This post is probably old now, but might still be relavent, did you review this one - https://medium.com/@lahirujayathilake/gsoc-2018-re-architect-output-data-parsing-into-airavata-core-81da4b37057e Suresh On Apr 29, 2025, at 12:36 PM, Miao, Yu <ymia...@gatech.edu> wrote: Hi Airavata Dev Team, I’m reaching out to get feedback and suggestions on implementing a new feature in the SMILES Django Portal. Goal: Once a SMILES experiment completes(marked as COMPLETED), we would like the corresponding Gaussian log file to be automatically parsed into JSON and uploaded to the Data Catalog. The parser itself is already implemented. To enable this, I currently believe the following steps are needed: 1. Build a Docker image with the parser, so that it can process a .log file and generate a .json file. Register the container in Airavata. 1. Modify DataParsingTask.java in Airavata in Helix Spectator to integrate the new parser if needed. 1. Update the SMILES Portal to add the parsing task to the experiment submission workflow. I’d really appreciate input from the community on: Whether this approach is correct, or if there’s a better integration path; Any design considerations or requirements that should be addressed; @Eroma Abeysinghe: Do we have documentation or examples related to automatic parsing functionality (e.g. from CDE/lit_data) that I could reference? Looking forward to your suggestions! Best regards, Yu