You are right. These modules are not general purpose parsers. In fact, I am using HTML::Parser to implement a HTML miner to extract data from HTML/HEAD/META. There is a miner that wraps around stat() to make the data provided by stat() in the Miner/Store framework. The beauty of the framework is that it provides a uniform way to get Meta data on many sources in the same application and to use several miners to obtain information on the same file. It is also fairly easy to write a Miner that analyzes a particular file and constructs data such as number of words, keywords, version number. Since file formats vary, I believe that a uniform way to obtain this information would be useful.
I am at a loss as far as the Namespace is concerned because there don't seem to be related modules in CPAN. There is a Metadata module but it implements interfaces that I believe are not useful for what I am trying to do. This is what the modules do: Process files to obtain Metadata. The best alternative I can come up with is File::Metadata. Is this any better? -----Original Message----- From: Ade Olonoh [mailto:[EMAIL PROTECTED]] Sent: Tuesday, July 16, 2002 8:27 PM To: Midh Mulpuri Cc: [EMAIL PROTECTED] Subject: Re: New Module Advice What kind of meta data do you mean? It sounds like there is application-specific functionality that the Miner/Store modules provide, rather than being a general Star/Open Office parser or an HTML parser. The type of meta data you're retrieving from the files would probably hint towards a better name, since (IMHO) "MetaInfo" is too vague to signal what kind of problem could be solved with the module. --Ade. On Mon, 2002-07-15 at 17:36, Midh Mulpuri wrote: > I am writing an application that collects and stores meta data > from a variety of files (e.g. Star/Open Office files, HTML files) etc. > The application is written as one set of modules that extract this meta > data from a file and another set of modules that store this information. > The information is exchanged between a miner and a store in a hash. > I believe that the miner modules would be useful to every one. I > would like to release to CPAN. At the same time the store modules are a > nice way to store this meta information but they do not implement > anything that is useful separate from the Miner modules. > Would it be a good thing to release both the Miner and Store > modules. At last count I have four of the former and two of the latter. > There is one store Module that write the Metadata to a XML file and > another to a DBI supported database. > Another problem would be the Namespace. MetaInfo sees available. > I could release the modules as MetaInfo::Miner::--------- and > MetaInfo::Store::------ if I wanted to release both sets. On the other > hand I could release just the Miners under the MetaInfo namespace. > Any advice and pointers would be appreciated since this would be > my first release to CPAN. > - Midh Mulpuri