Hi, you can look into parse-html plugin, lookout for the class "HTMLMetaProcessor.java " . the metadata is processed there so u can probably ge it from there
On Sat, Jan 10, 2009 at 3:40 AM, 郭雄 <[email protected]> wrote: > Hi,i am a newer to nutch,and i need get a webpage title and its metedata > like <meta name="description" content="nutch study"> : "nutch study". > can somebody give a guid on how to use nutch to do it.Thanks a lot! > -- Ankur Garg
