Hi Rupert, yes I had already reported our discussion internally but it's
always better to discuss it on stanbol-dev.
On 3/3/12 10:13 AM, Rupert Westenthaler wrote:
As the goal is to build the first release candidate Mo-Du next week the
intension here is to find the best way on how to include the RefactorEngine in
the full launcher. For this the idea was to
* keep the Refactor engine in the full launcher. This engine is an important
feature of Apache Stanbol and should be available by default
* remove the default configuration of the Engine that uses the SEO (Search
Engine Optimization) recipe. This means the the Refactor engine is not active
in the default configuration (similar to the KeywordExtractionEngine).
* keep the SEO recipe in the default configuration. This ensures that users
that want to try this use case do not need to load/init the recipe.
this was the content of your recent commit, right? Personally it is fine
with me
* set the default values (the values in the @Property annotations) so that they
work for the SEO recipe. This requires Users that want to test/use the SEO use
case to only go to the configuration tab of the OSGI Web Console click the [+]
of the RefactorEngine and than [OK].
configuration tab of the OSGI Web Console click the [+] of the RefactorEngine
and than [OK]
+1
Longer Term plan (after the 0.9 Release)
---------
Alessandro was mentioning some points about adding native support for Clerezza
graphs for the Rule component. Alessandro/Alberto maybe you can add more
information here
Andrea should correct me if I'm wrong, but I'm told there's work under
the hood on exporting Stanbol rules as ConstructQuery Clerezza objects
natively and avoiding to go through the ordeal of Jena rule conversion.
I had also the Idea to try using an IndexMGraph in-memory graph instead of the
the file-based Jena TDB. This could considerable boost performance. However one
would need validate the memory requirements.
As I told you, I do need to measure the memory footprints of many
implementations of ours. I've checked the Oracle VisualVM you mentioned,
but if there is some cool profiler in Apache I'd be happy to know. We'll
return on this anyway.
In the meantime I've just started to use the IndexedMGraph for in-memory
work in OntoNet.
Build an other Usecase that is more simple than SEO with the goal to include it in
the default configuration. SEO includes>60 rather complex rule definition. That
is cool for showing the power of this component. For the default configuration I
would like to have a simple Usecase that just need ~5 Rules. This could also
include a 5min Tutorial how to create this 5 Rules and a 15min Tutorial that
extends this default configuration by some additional Rules to include an other
feature. Maybe the combination of IPTC, rNews, TopicEngine and RefactorEngine
could be such an Example.
It could, but on an educated guess the IPTC/rNews combination might
require a bit more than 5 rules, maybe just a subset.
best
Alessandro
On 02.03.2012, at 17:20, Alberto Musetti wrote:
Hi Rupert, all
I'm working on it, but i'm proceeding slowly because i cannot reproduce the
error.
i'm sorry
Alberto
Il giorno 02/mar/2012, alle ore 00:20, Rupert Westenthaler ha scritto:
Hi Alberto
I spent some time to look into "Build failed in Jenkins: stanbol-trunk-1.6 #772"
The most interesting file is the error.log file of the Stanbol
instance used for the integration test.
https://builds.apache.org/job/stanbol-trunk-1.6/ws/trunk/integration-tests/target/launchdir/sling/logs/error.log
For the first view calls to the enhancer everything looks fine (~5sec
for the Refactor engine).
But later - after time code "01.03.2012 19:49:21.04" (use this to
search in the file) something starts to went wrong with Jena TDB.
[Thread-36]>> seo_refactoring and
[Thread-12]>> looks like a Jena TDB demon
are the only active one (looks a little like a deadlock).
You can safely ignore the [DataFileTrackingDaemon] this checks only
every 5sec for Resources of the DataFIleProvider.
I have really no idea what is going on here. Maybe you can make more
sense of this
best
Rupert
On Thu, Mar 1, 2012 at 3:30 PM, Alberto Musetti<[email protected]> wrote:
Hi Rupert, all
Il giorno 29/feb/2012, alle ore 19:14, Rupert Westenthaler ha scritto:
Hi Alberto
On 29.02.2012, at 18:48, Alberto Musetti wrote:
Hi all,
I would like to add the refactor engine in enhancer/bundlelist.
It will be activated in the launcher full and full-war,
but not in stable because there isn't rules and ontonet.
May I add the refactor engine?
I think the full launcher should include all engines that are managed by the
Stanbol Community. So in principle a +1 from my side.
Can I ask two questions:
(1) What would that mean for the memory footprint. I usually run Stanbol with
-Xmx512m. Would that still work with the refactor engine?
Yes, it would still work.
Concerning performance and memory usage we are adding full support to Clerezza
in the Refactor with spending effort and memory in graph tansformations.
(2) Would the engine be active by default? Do you plan to ship the full
launcher with a default configuration or would a user need to manually
configure an instance?
There is a default configuration running at bundle start-up,
i.e., the refactoring for the SEO demo.
Best,
Alberto
best
Rupert
--
| Rupert Westenthaler [email protected]
| Bodenlehenstraße 11 ++43-699-11108907
| A-5500 Bischofshofen
--
M.Sc. Alessandro Adamou
Alma Mater Studiorum - Università di Bologna
Department of Computer Science
Mura Anteo Zamboni 7, 40127 Bologna - Italy
Semantic Technology Laboratory (STLab)
Institute for Cognitive Science and Technology (ISTC)
National Research Council (CNR)
Via Nomentana 56, 00161 Rome - Italy
"As for the charges against me, I am unconcerned. I am beyond their timid, lying
morality, and so I am beyond caring."
(Col. Walter E. Kurtz)
Not sent from my iSnobTechDevice