You are right. These modules are not general purpose parsers. In fact, I
am using HTML::Parser to implement a HTML miner to extract data from
HTML/HEAD/META. There is a miner that wraps around stat() to make the
data provided by stat() in the Miner/Store framework. The beauty of the
framework is that it provides a uniform way to get Meta data on many
sources in the same application and to use several miners to obtain
information on the same file. It is also fairly easy to write a Miner
that analyzes a particular file and constructs data such as number of
words, keywords, version number. Since file formats vary, I believe that
a uniform way to obtain this information would be useful.

I am at a loss as far as the Namespace is concerned because there don't
seem to be related modules in CPAN. There is a Metadata module but it
implements interfaces that I believe are not useful for what I am trying
to do. This is what the modules do: Process files to obtain Metadata.

The best alternative I can come up with is File::Metadata. Is this any
better?


-----Original Message-----
From: Ade Olonoh [mailto:[EMAIL PROTECTED]] 
Sent: Tuesday, July 16, 2002 8:27 PM
To: Midh Mulpuri
Cc: [EMAIL PROTECTED]
Subject: Re: New Module Advice

What kind of meta data do you mean?  It sounds like there is
application-specific functionality that the Miner/Store modules provide,
rather than being a general Star/Open Office parser or an HTML parser. 
The type of meta data you're retrieving from the files would probably
hint towards a better name, since (IMHO) "MetaInfo" is too vague to
signal what kind of problem could be solved with the module.

--Ade.


On Mon, 2002-07-15 at 17:36, Midh Mulpuri wrote:
>        I am writing an application that collects and stores meta data
> from a variety of files (e.g. Star/Open Office files, HTML files) etc.
> The application is written as one set of modules that extract this
meta
> data from a file and another set of modules that store this
information.
> The information is exchanged between a miner and a store in a hash.
>        I believe that the miner modules would be useful to every one.
I
> would like to release to CPAN. At the same time the store modules are
a
> nice way to store this meta information but they do not implement
> anything that is useful separate from the Miner modules. 
>        Would it be a good thing to release both the Miner and Store
> modules. At last count I have four of the former and two of the
latter.
> There is one store Module that write the Metadata to a XML file and
> another to a DBI supported database.
>        Another problem would be the Namespace. MetaInfo sees
available.
> I could release the modules as MetaInfo::Miner::--------- and
> MetaInfo::Store::------ if I wanted to release both sets. On the other
> hand I could release just the Miners under the MetaInfo namespace.
>        Any advice and pointers would be appreciated since this would
be
> my first release to CPAN.
>        - Midh Mulpuri

Reply via email to