No problemo Mr Mattmann. I've added a FAQ section to the wiki on how to
specify multiple crawler actions.

Thanks to you and Brian for your help on this,
Mike

On 2/8/12 7:44 PM, "Mattmann, Chris A (388J)"
<[email protected]> wrote:

>Great job, Mike! Would you be willing to write up a quick wiki FAQ on
>this,
>here?
>
>https://cwiki.apache.org/confluence/display/OODT/CAS-PGE+Help+and+Document
>ation
>
>Cheers,
>Chris
>
>On Feb 8, 2012, at 8:34 AM, Cayanan, Michael D (388J) wrote:
>
>> Hey Brian and Chris,
>> 
>> So, I first re-tried it using Chris' suggestion of using spaces and I
>>still got an error. Here is what I'm seeing in the log:
>> 
>> Feb 8, 2012 7:55:33 AM org.apache.oodt.cas.pge.PGETaskInstance
>>runIngestCrawler
>> INFO: Executing StdProductCrawler in productPath:
>>[/Users/mcayanan/smap/staging/L1A_RADAR_SPS/20120208T155531/output]
>> Feb 8, 2012 7:55:33 AM org.apache.oodt.cas.pge.PGETaskInstance
>>runIngestCrawler
>> WARNING: Failed while attempting to ingest products while crawling
>>directory 
>>'/Users/mcayanan/smap/staging/L1A_RADAR_SPS/20120208T155531/output' (all
>>products may not have been ingested) : No bean named
>>'KickoffL1BS0LoResWorkflow MoveMetadataToBackupDir' is defined
>> org.springframework.beans.factory.NoSuchBeanDefinitionException: No
>>bean named 'KickoffL1BS0LoResWorkflow MoveMetadataToBackupDir' is defined
>>      at 
>>org.springframework.beans.factory.support.DefaultListableBeanFactory.getB
>>eanDefinition(DefaultListableBeanFactory.java:360)
>>      at 
>>org.springframework.beans.factory.support.AbstractBeanFactory.getMergedBe
>>anDefinition(AbstractBeanFactory.java:914)
>>      at 
>>org.springframework.beans.factory.support.AbstractBeanFactory.getBean(Abs
>>tractBeanFactory.java:241)
>>      at 
>>org.springframework.beans.factory.support.AbstractBeanFactory.getBean(Abs
>>tractBeanFactory.java:164)
>>      at 
>>org.springframework.context.support.AbstractApplicationContext.getBean(Ab
>>stractApplicationContext.java:761)
>>      at 
>>org.apache.oodt.cas.crawl.action.CrawlerActionRepo.loadActionsFromBeanFac
>>tory(CrawlerActionRepo.java:68)
>>      at 
>>org.apache.oodt.cas.crawl.ProductCrawler.crawl(ProductCrawler.java:87)
>>      at 
>>org.apache.oodt.cas.pge.PGETaskInstance.runIngestCrawler(PGETaskInstance.
>>java:405)
>>      at 
>>org.apache.oodt.cas.pge.PGETaskInstance.ingestProducts(PGETaskInstance.ja
>>va:349)
>>      at 
>>org.apache.oodt.cas.pge.PGETaskInstance.run(PGETaskInstance.java:454)
>>      at 
>>org.apache.oodt.cas.workflow.structs.TaskJob.execute(TaskJob.java:102)
>>      at 
>>org.apache.oodt.cas.resource.system.extern.XmlRpcBatchStub$RunnableJob.ru
>>n(XmlRpcBatchStub.java:229)
>>      at java.lang.Thread.run(Thread.java:680)
>> 
>> 
>> I then modified my tasks.xml file where I specified the
>>'PCS_ActionsIds' property with the action IDs:
>> 
>> <property name="PCS_ActionsIds"
>>value="KickoffL1BS0LoResWorkflow,MoveMetadataToBackupDir" />
>> 
>> and added a line in the PgeConfig.xml as Brian suggested:
>> 
>> <customMetadata>
>>        <metadata key="PCS_ActionsIds" val="[PCS_ActionsIds]"/>
>> 
>> and still got an error in the log as shown above.
>> 
>> So, I ended up re-naming the property name in the tasks.xml to some
>>other name (so as to not reference itself essentially) and that did the
>>trick!
>> 
>> tasks.xml:
>> 
>> <property name="Actions"
>>value="KickoffL1BS0LoResWorkflow,MoveMetadataToBackupDir" />
>> 
>> PgeConfig.xml:
>> 
>> <customMetadata>
>>        <metadata key="PCS_ActionsIds" val="[Actions]"/>
>> 
>> 
>> Also, I found out that you can't have spaces in between the commas like
>>so,
>> 
>> <property name="Actions" value="KickoffL1BS0LoResWorkflow,
>>MoveMetadataToBackupDir" />
>> 
>> 
>> Thanks for all your help guys!
>> 
>> 
>> -Mike
>> 
>> 
>> From: holenoter <[email protected]>
>> Reply-To: "[email protected]" <[email protected]>
>> Date: Tue, 7 Feb 2012 22:06:27 -0800
>> To: "[email protected]" <[email protected]>
>> Subject: Re: CAS PGE Crawler Question
>> 
>> workflow manager properties (static metadata) are passed to CAS-PGE via
>>WorkflowTaskConfiguration which only supports scalar metadata... CAS-PGE
>>doesn't re-parse out these metadata value because you have to allow for
>>workflow metadata to contain values with commas without always being
>>split out by CAS-PGE.. so you have to explicitly repoint the metadata
>>field at itself so it becomes custom (or local) metadata for CAS-PGE and
>>then the value will be parsed on comma... context: CAS-PGE initially did
>>the opposite (i.e. always re-parse workflow metadata into multi-values)
>>but chris later changed this to the way it is now because this broke OCO
>>since OCO had static workflow metadata which contained metadata with
>>commas that was not supposed to be parsed into multi-values.
>> 
>> -brian
>> 
>> On Feb 07, 2012, at 07:36 PM, "Mattmann, Chris A (388J)"
>><[email protected]> wrote:
>> 
>>> Hi Mike,
>>> 
>>> Space should work in the property value below. Can you show me
>>> the log output from CAS-PGE in the workflow log file (or the resource
>>>manager
>>> batch stubs?) It should show you what Actions are being executed
>>> and that can help to debug.
>>> 
>>> Thanks!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Feb 7, 2012, at 4:06 PM, Cayanan, Michael D (388J) wrote:
>>> 
>>> > If I wanted to specify more than one PCS_ActionsIds values, how
>>>would I do
>>> > this?
>>> > 
>>> > I've tried the following with no success:
>>> > 
>>> > <property name="PCS_ActionIds" value="Action1, Action2" />
>>> > 
>>> > 
>>> > and
>>> > 
>>> > <property name="PCS_ActionIds" value="Action1,Action2" />
>>> > 
>>> > 
>>> > and 
>>> > 
>>> > <property name="PCS_ActionIds" value="Action1 Action2" />
>>> > 
>>> > 
>>> > 
>>> > -Mike
>>> > 
>>> > 
>>> > On 2/7/12 3:22 PM, "Mattmann, Chris A (388J)"
>>> > <[email protected]> wrote:
>>> > 
>>> >> Hi Mike,
>>> >> 
>>> >> Glad to hear!
>>> >> 
>>> >> Cheers,
>>> >> Chris
>>> >> 
>>> >> On Feb 7, 2012, at 3:14 PM, Cayanan, Michael D (388J) wrote:
>>> >> 
>>> >>> Hi all,
>>> >>> 
>>> >>> Disregard my previous message. Turns out I had a typo. I had to set
>>> >>> 'PCS_ActionsIds' instead of 'PCS_ActionIds'.
>>> >>> 
>>> >>> -Mike
>>> >>> 
>>> >>> From: "Cayanan, Michael D (388J)" <[email protected]>
>>> >>> Reply-To: "[email protected]" <[email protected]>
>>> >>> Date: Tue, 7 Feb 2012 13:51:12 -0800
>>> >>> To: "[email protected]" <[email protected]>
>>> >>> Cc: "Wong, Cynthia L (388J)" <[email protected]>
>>> >>> Subject: CAS PGE Crawler Question
>>> >>> 
>>> >>> Hi all,
>>> >>> 
>>> >>> I would like CAS PGE to run a post ingest success action that I
>>>created
>>> >>> upon a successful ingestion of the output file(s). How do you
>>>configure
>>> >>> CAS PGE for this type of scenario?
>>> >>> 
>>> >>> I have a tasks.xml file in the workflow, where I define my post
>>>ingest
>>> >>> action under the 'PCS_ActionIds' property:
>>> >>> 
>>> >>> <task id="urn:my:Task" name="Task"
>>> >>> ...
>>> >>> <property name="PCS_ActionIds"
>>> >>> value="MyPostIngestSuccessAction" />
>>> >>> <property name="PCS_ActionRepoFile"
>>> >>> value="crawler-config.xml" />
>>> >>> </configuration>
>>> >>> </task>
>>> >>> 
>>> >>> My crawler-config.xml has the following:
>>> >>> 
>>> >>> <beans xmlns="http://www.springframework.org/schema/beans";
>>> >>> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance";
>>> >>> xmlns:p="http://www.springframework.org/schema/p";
>>> >>> xsi:schemaLocation="http://www.springframework.org/schema/beans
>>> >>> http://www.springframework.org/schema/beans/spring-beans-2.5.xsd";>
>>> >>> 
>>> >>> <import 
>>> >>> resource="classpath:/org/apache/oodt/cas/crawl/crawler-beans.xml"/>
>>> >>> <import resource="./action-beans.xml"/>
>>> >>> <import 
>>> >>> 
>>>resource="classpath:/org/apache/oodt/cas/crawl/comparator/precondition-b
>>>e
>>> >>> ans.xml"/>
>>> >>> <import 
>>> >>> 
>>>resource="classpath:/org/apache/oodt/cas/crawl/option/cmd-line-option-be
>>>a
>>> >>> ns.xml"/>
>>> >>> </beans>
>>> >>> 
>>> >>> Within my action-beans.xml, I added the post ingest success action:
>>> >>> 
>>> >>> <beans xmlns="http://www.springframework.org/schema/beans";
>>> >>> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance";
>>> >>> xsi:schemaLocation="http://www.springframework.org/schema/beans
>>> >>> http://www.springframework.org/schema/beans/spring-beans-2.5.xsd";>
>>> >>> 
>>> >>> <bean 
>>> >>> 
>>>class="org.apache.oodt.commons.spring.postprocessor.SetIdBeanPostProcess
>>>o
>>> >>> r"/>
>>> >>> 
>>> >>> <!-- Crawler Actions -->
>>> >>> 
>>> >>> <bean id="MyPostIngestSuccessAction" lazy-init="true"
>>> >>> class="gov.nasa.jpl.crawler.action.MyPostIngestSuccessAction">
>>> >>> <property name="description" value="Kickoff Another Workflow
>>> >>> Event"/>
>>> >>> <property name="executeCommand" value="sh
>>> >>> [WORKFLOWMGR_HOME]/bin/run_L1B_S0_LoRes.sh"/>
>>> >>> <property name="phases">
>>> >>> <list>
>>> >>> <value type="java.lang.String">postIngestSuccess</value>
>>> >>> </list>
>>> >>> </property>
>>> >>> </bean>
>>> >>> 
>>> >>> Not sure if I'm missing anything else.
>>> >>> 
>>> >>> Thanks in advance,
>>> >>> Mike
>>> >>> 
>>> >> 
>>> >> 
>>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> >> Chris Mattmann, Ph.D.
>>> >> Senior Computer Scientist
>>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> >> Office: 171-266B, Mailstop: 171-246
>>> >> Email: [email protected]
>>> >> WWW: http://sunset.usc.edu/~mattmann/
>>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> >> Adjunct Assistant Professor, Computer Science Department
>>> >> University of Southern California, Los Angeles, CA 90089 USA
>>> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> >> 
>>> > 
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: [email protected]
>>> WWW: http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>
>
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Chris Mattmann, Ph.D.
>Senior Computer Scientist
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 171-266B, Mailstop: 171-246
>Email: [email protected]
>WWW:   http://sunset.usc.edu/~mattmann/
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>Adjunct Assistant Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>

Reply via email to