HI Dr. Peter,

Here are some example addresses that the attached ruta is able to find.

There is two ruta rules which is used one is for multiline addresses and
other for single line addresses.
Also we are using some prepopulated EntityType Annotation with feature
location_indicator



//Annotation EntityType with feature location_indicator is already present
= Georgia

11175 Cicero Drive
Suite 200
Alpharetta, Georgia 30022



//EntityType with feature location_indicator is already present =
Cambridge;MA;U.S.A

One Rogers Street
Cambridge, MA
02142-1209
U.S.A

//EntityType with feature location_indicator is already present  =
Cambridge, MA, U.S.A.
1120 Avenue of the Americas
4th Floor
New York, NY 10036
U.S.A.


//EntityType with feature location_indicator is already present = U.S.A

11175 Cicero Drive
Suite 200
Alpharetta, Georgia 30022
U.S.A

//EntityType with feature location_indicator is already present = U.S.A

My new address is
8 Commerce Dr.
Suite 3B
Bedford, NH 03110
U.S.A


//EntityType with feature location_indicator is already present  = U.S.A.

400 Renaissance Center Drive
Suite 2600
Detroit, MI 48243
U.S.A.

//EntityType with feature location_indicator is already present  = U.S.A.

125 Wacker Drive
Suite 300
Chicago, IL 60606
U.S.A.

//EntityType with feature location_indicator is already present  = U.S.A.


1120 Avenue of the Americas
4th Floor
New York, NY 10036
U.S.A.


222 West Las Colinas Blvd. Suite 1650 North Tower Millennium Center Irving,
TX 75039 U.S.A.


Block No. 9A, Pritech Park SEZ, RMZ Ecospace Internal Road, Bellandur,
Bengaluru, Karnataka 560103, India



Thanks & Regard
Md Azaz Ali

On Thu, Aug 4, 2022 at 5:42 PM Peter Klügl <peter.klu...@averbis.com> wrote:

> Hi,
>
>
> yes, I can suggest some refactored rules.
>
> However, I do not know the common input data and the use cases. It is
> easier for me if I have a few representative input snippets I can test
> the refactored rules against. Can you provide some (artifical) example
> text snippets?
>
>
> Best
>
>
> Peter
>
>
> Am 04.08.2022 um 11:33 schrieb Md Azaz Ali:
> > Hi Dr. Peter Klügl,
> >
> >
> > 1. We are not able to upgrade to Ruta 3.x because we have to upgrade
> > uimaj-core also and to do that we need an stable version of cleartk-ml
> > (which is not working with uima 3.x).
> >
> > 2. using PARAM_MAX_RULE_MATCHES , PARAM_MAX_RULE_ELEMENT_MATCHES we
> > are not sure what numer will be good enough.
> >
> > 3. if possible can you please suggest an improved version for above
> > script it will really help.
> >
> > 4. Also getting a new build from main-v2 is also not possible because
> > we can only use ga versions which are available directly in mvn
> repository
> >
> > I am attaching one script file if you can suggest the possible
> > improvements it will be really helpful.
> >
> > Note: I am new to ruta and these ruta scripts are written by old
> > developers in my company who are not associated with the company any
> > more.
> >
> > Many Thanks
> >
> >
> > On Tue, Aug 2, 2022 at 8:35 PM Peter Klügl <peter.klu...@averbis.com>
> > wrote:
> >
> >     Hi,
> >
> >
> >     thanks for the pointer. I added an answer.
> >
> >     Let me know if you want to have more information about the rule
> >     refactoring.
> >
> >
> >     In my experience, the life of a Ruta rule engineer is much easier
> >     if the
> >     Ruta rules stay small :-)
> >
> >
> >     Best,
> >
> >
> >     Peter
> >
> >
> >     Am 31.07.2022 um 21:09 schrieb Md Azaz Ali:
> >     >
> >
> https://stackoverflow.com/questions/73147822/getting-oom-issue-while-running-ruta-script-with-large-texts
> >     >
> >     >
> >     >
> >     > Many Thanks
> >     >
> >     --
> >     Dr. Peter Klügl
> >     Head of Text Mining/Machine Learning
> >
> >     Averbis GmbH
> >     Salzstr. 15
> >     79098 Freiburg
> >     Germany
> >
> >     Fon: +49 761 708 394 0
> >     Fax: +49 761 708 394 10
> >     Email: peter.klu...@averbis.com
> >     Web: https://averbis.com
> >
> >     Headquarters: Freiburg im Breisgau
> >     Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
> >     Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó
> >
> --
> Dr. Peter Klügl
> Head of Text Mining/Machine Learning
>
> Averbis GmbH
> Salzstr. 15
> 79098 Freiburg
> Germany
>
> Fon: +49 761 708 394 0
> Fax: +49 761 708 394 10
> Email:peter.klu...@averbis.com
> Web:https://averbis.com
>
> Headquarters: Freiburg im Breisgau
> Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
> Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó
>

Reply via email to