Hi, Is this currently possible with Tika 0.9 in Nutch branch 1.4? I would have thought that this would have been dealt with in Tika, however I have seen no mention of anyone having problems extracting this from web documents when fetching with Nutch or even discussing it.
For example say I had some geographical location in a meta tag such as"geo:long=55.1244", is is possible to extract with parse-tika or would I need to extend parse-html? Or the other part, is it possible to extract hash tags from twitter via the above? -- *Lewis*