I have no experience in this, I have just coded a little with the libraries.
On Tue, Jun 26, 2012 at 1:08 PM, Florian Kuhlmann <[email protected]> wrote: > Hello Daniel, > > I'm new to the list, but as i also work with german language and POS-tags, i > want to give my 2 cent: > Please consider that capitalisation is probably one of the features used to > determine the correct tag. > > So if you have a noun like "Sonne" it should not begin with a small "s". Most > NLP tools are not very robust for this kind of errors (and so sometimes out > of the box not a good match for e.g. social media, because they are trained > on news which mostly have correct spelling). > > Best, > > Florian > > ________________________________________ > Von: daniel stieger [[email protected]] > Gesendet: Dienstag, 26. Juni 2012 21:56 > An: [email protected] > Betreff: Re: Newby Question on German POS Tagging > > Hi Lance, > > thanks for your answer here. Is there a way to contact you (skype) for a > small talk. That would help me a lot. Or can you recommend someone who is > experienced? > > I m not into computer linguistic in detail. In past, i wrote my own > algorithms, compined them with stemming and levinstein but openNLP would > offer a totally different approach - i guess. > > Rright now, i just need someone who pushes me into the right direction. > > Any help appreciated .... :) > Dan > > > -------- Original-Nachricht -------- >> Datum: Mon, 25 Jun 2012 01:27:07 -0700 >> Von: Lance Norskog <[email protected]> >> An: [email protected] >> Betreff: Re: Newby Question on German POS Tagging > >> The Chunking tool might help here. Chunking means finding noun and >> verb phrases. This can help you find recurring phrases. Because German >> is agglutinative, this is probably a very different problem than in >> English. Are there any de-agglutinizer algorithms? >> >> On Sun, Jun 24, 2012 at 11:43 PM, daniel stieger <[email protected]> >> wrote: >> > Hi, >> > >> > thanks a lot for your answers. My goal is to identify adjectives and >> nouns in association sentence. Eg. What do you associate with our brand? >> Answer: nice mountains, the mountains are very nice .. etc. >> > >> > >> > If appropriate, i would use the openNLP posTagger (it seams to be the >> most elaborated java postagger) in order to identify nouns and adjectives. So >> when i input the sentence "the", "mountains", "are", "nice" >> > the output is correct - also when using single words: >> > >> >>> [DT, NNS, VBP, JJ] >> >>> [DT] >> >>> [NNS] >> >>> [VBP] >> >>> [JJ] >> > >> > >> > Is the english model better than the german model? Do i have to build my >> own model - or is the de-maxent appropriate? >> > >> > Generally - is openNLP a good choice for my task? >> > >> > Thanks again, >> > Dan >> > >> > >> > -------- Original-Nachricht -------- >> >> Datum: Sat, 23 Jun 2012 16:53:36 -0700 >> >> Von: Lance Norskog <[email protected]> >> >> An: [email protected] >> >> Betreff: Re: Newby Question on German POS Tagging >> > >> >> What would you like to find out about your data? Until we know that it >> >> is difficult to recommend a technique. >> >> >> >> On Sat, Jun 23, 2012 at 4:15 AM, Thilo Goetz <[email protected]> wrote: >> >> > On 22.06.2012 20:13, daniel stieger wrote: >> >> >> >> >> >> Hi List, >> >> >> >> >> >> i m looking for some suggestions and opinions for my task. The >> >> situation >> >> >> is this: >> >> >> >> >> >> In an online survey approx. 800 participants were asked a open text >> >> >> question like "What do you associate with our brand?". Participants >> can >> >> then >> >> >> enter 5 associations. Eg. >> >> >> >> >> >> - nature >> >> >> - beautifull mountains >> >> >> - relax >> >> >> - family friendly >> >> >> - very good service >> >> >> >> >> >> >> >> >> Now i just want to run the openNLP Post tagger over all >> associations. I >> >> >> suppose that i can use one association just as one sentence. Instead >> of >> >> the >> >> >> english model, i used the de-maxent.bin model and some german >> answers. >> >> But >> >> >> the tags are somehow wrong. Eg. >> >> >> >> >> >> sonne -> KON >> >> >> familie -> ART (it is a noun, definitely not an aricle) >> >> >> >> >> >> Am I on a wrong path? Should i handle my data differently? Or should >> i >> >> >> download an other model? Where can i get trainingdata ?? >> >> >> >> >> >> So many questions.. sorry.. but every hint appreciated, >> >> >> >> >> >> best, >> >> >> Daniel >> >> >> >> >> >> >> >> > >> >> > I'm pretty sure the model was trained on complete sentences. The >> >> > tagging takes context into account, and will not work properly >> >> > without it. So just running it on a couple of words at a time >> >> > will not work. >> >> > >> >> > If all your associations are NPs like your example, >> >> > you can maybe fix things by always prefixing "I like the ". In >> >> > German, maybe "Ich liebe ". >> >> > >> >> > HTH, >> >> > Thilo >> >> > >> >> > >> >> >> >> >> >> >> >> -- >> >> Lance Norskog >> >> [email protected] >> > >> > -- >> > NEU: FreePhone 3-fach-Flat mit kostenlosem Smartphone! >> > Jetzt informieren: http://mobile.1und1.de/?ac=OM.PW.PW003K20328T7073a >> >> >> >> -- >> Lance Norskog >> [email protected] > > -- > NEU: FreePhone 3-fach-Flat mit kostenlosem Smartphone! > Jetzt informieren: http://mobile.1und1.de/?ac=OM.PW.PW003K20328T7073a -- Lance Norskog [email protected]
