how-to query an xml repository efficiently

2009-09-07 Thread Robby Pelssers
Hi all,

 

I have following use case.  The customer has an xml repository which is
nothing more then a directory on filesystem which contains
subdirectories containing one or more xml files.  They now want to query
those xml files on some predefined criteria which might change over
time...

 

I'm looking for a solution which results in high performance search and
some things that came to my mind was 

* extracting information and storing them in a database (e.g.
HSQLDB)  

* using lucene

 

Is there somewhere detailed documentation available on using these?  And
what would you recommend for my use case?

 

I already found some stuff but no real quick-start material.

http://cocoon.apache.org/2.1/userdocs/concepts/xmlsearching.html

http://cocoon.apache.org/2.2/blocks/hsqldb-client/1.0/

http://cocoon.apache.org/2.2/blocks/hsqldb-server/1.0/

 

Thx in advance,

Robby Pelssers

 

 



Re: how-to query an xml repository efficiently

2009-09-07 Thread DAVIGNON Andre - CETE NP/DIODé/PANDOC

Hi,

For high performance search I would recommend you Apache Solr.

Regards,

André


Le 07/09/2009 15:54,  Robby Pelssers (par Internet, dépôt 
users-return-97968-andre.davignon=developpement-durable.gouv...@cocoon.apache.org) 
a écrit  :

Hi all,

 

I have following use case.  The customer has an xml repository which is 
nothing more then a directory on filesystem which contains 
subdirectories containing one or more xml files.  They now want to query 
those xml files on some predefined criteria which might change over time…


 

I’m looking for a solution which results in high performance search and 
some things that came to my mind was


· extracting information and storing them in a database (e.g. 
HSQLDB) 


· using lucene

 

Is there somewhere detailed documentation available on using these?  And 
what would you recommend for my use case?


 


I already found some stuff but no real quick-start material.

http://cocoon.apache.org/2.1/userdocs/concepts/xmlsearching.html

http://cocoon.apache.org/2.2/blocks/hsqldb-client/1.0/

http://cocoon.apache.org/2.2/blocks/hsqldb-server/1.0/

 


Thx in advance,

Robby Pelssers

 

 



-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org



Re: how-to query an xml repository efficiently

2009-09-07 Thread David Legg

Hi Robby,

I have following use case. The customer has an xml repository which is 
nothing more then a directory on filesystem which contains 
subdirectories containing one or more xml files. They now want to 
query those xml files on some predefined criteria which might change 
over time…




Maybe others with more experience could comment but what about Apache 
Xindice [1] ?. Not sure what the status is on this project but it was 
designed from the ground up to be able to store, query and retrieve XML. 
It may be that everyone abandoned it and started using Solr instead ;-)


[1] http://xml.apache.org/xindice/index.html

Regards,
David Legg


-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org



xml2pgsql with cocoon?

2009-09-07 Thread Mika Lehtonen

Hello Cocooners!

I have a little project going on with the following brief description. I 
was just wondering whether this be something that would be feasible to 
implement with Cocoon.


I have a XML schema and a Postgresql (PostGIS) table structure which fit 
together, at least as much as possible. What I would like to accomplish is:


1) Upload a XML-file (Actually GML) into my server.
2) Optionally validate it against the schema.
3) Populate the Postgresql database with the data of that uploaded GML.
4) Populate PostGIS columns of the database with the geometries in that 
GML-file.


And someday I might want to do the same other way round, that is, save a 
valid GML out of the Postgresql/PostGIS database.



Any hints/ideas are most welcome. And maybe I am just trying to 
re-invent a wheel again. If you know some framework/software which 
already does this, please share your knowledge.


With best regards,
Mika Lehtonen

-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org



Re: xml2pgsql with cocoon?

2009-09-07 Thread Jos Snellings
Hello Mika,

In your description you mention a maintenance task: data uploading and
conversion: you need to parse XML data and insert extracted entities
(polygons, ids) in a database. You do not need cocoon to do that.
I would use SAX and simply JDBC.
How do you represent your GIS-data for the users of your site?

Kind regards,
Jos


On Mon, 2009-09-07 at 18:58 +0300, Mika Lehtonen wrote:
 Hello Cocooners!
 
 I have a little project going on with the following brief description. I 
 was just wondering whether this be something that would be feasible to 
 implement with Cocoon.
 
 I have a XML schema and a Postgresql (PostGIS) table structure which fit 
 together, at least as much as possible. What I would like to accomplish is:
 
 1) Upload a XML-file (Actually GML) into my server.
 2) Optionally validate it against the schema.
 3) Populate the Postgresql database with the data of that uploaded GML.
 4) Populate PostGIS columns of the database with the geometries in that 
 GML-file.
 
 And someday I might want to do the same other way round, that is, save a 
 valid GML out of the Postgresql/PostGIS database.
 
 
 Any hints/ideas are most welcome. And maybe I am just trying to 
 re-invent a wheel again. If you know some framework/software which 
 already does this, please share your knowledge.
 
 With best regards,
 Mika Lehtonen
 
 -
 To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
 For additional commands, e-mail: users-h...@cocoon.apache.org
 
 


-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org



Re: xml2pgsql with cocoon?

2009-09-07 Thread Mika Lehtonen

Hi Jos,

and thanks for your reply.
SAX and JDBC.. I should have mentioned, that Java is my Achilles Heel. I 
can't claim that I had never written a single line of Java, shame on me. 
And your suggestion means a lot of Java jumping, right? I do have 
written code with C#, Perl, PHP, JS and done quite a lot of things with 
XML/XSLT/XSL-FO, but still a fresh start with Java in a project like 
this.. I don't know.. on the other hand, it would be a chance to grab 
the ox from its horns and do the inevitable.
I will probably represent the geometries with Geoserver and OpenLayers 
and probably include some attribute related functions into it. But there 
will be also some other applications. It's still all open. At this 
point, the major task is to get the data into database or actually build 
a block that will make uploading into database possible.


- mika -

Jos Snellings kirjoitti:

Hello Mika,

In your description you mention a maintenance task: data uploading and
conversion: you need to parse XML data and insert extracted entities
(polygons, ids) in a database. You do not need cocoon to do that.
I would use SAX and simply JDBC.
How do you represent your GIS-data for the users of your site?

Kind regards,
Jos


On Mon, 2009-09-07 at 18:58 +0300, Mika Lehtonen wrote:
  

Hello Cocooners!

I have a little project going on with the following brief description. I 
was just wondering whether this be something that would be feasible to 
implement with Cocoon.


I have a XML schema and a Postgresql (PostGIS) table structure which fit 
together, at least as much as possible. What I would like to accomplish is:


1) Upload a XML-file (Actually GML) into my server.
2) Optionally validate it against the schema.
3) Populate the Postgresql database with the data of that uploaded GML.
4) Populate PostGIS columns of the database with the geometries in that 
GML-file.


And someday I might want to do the same other way round, that is, save a 
valid GML out of the Postgresql/PostGIS database.



Any hints/ideas are most welcome. And maybe I am just trying to 
re-invent a wheel again. If you know some framework/software which 
already does this, please share your knowledge.


With best regards,
Mika Lehtonen

-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org






-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org
  


-
To unsubscribe, e-mail: users-unsubscr...@cocoon.apache.org
For additional commands, e-mail: users-h...@cocoon.apache.org