Hi,

I had a quick look at the rules. Given the examples you provided, only the first rule matches three times, the second rule not once.

So I have to ask before I can refactor the rules: what should the rules annotate exactly?


Best


Peter


Am 05.08.2022 um 11:44 schrieb Md Azaz Ali:
Hi   Dr. Peter Klügl,

Yes its same in stackoverflow

On Fri, Aug 5, 2022 at 12:48 PM Peter Klügl <peter.klu...@averbis.com>
wrote:

Hi,


the attachements are removed by the mailing list. Are the rules the same
as in the StackOverflow question?


Best,


Peter

Am 04.08.2022 um 20:15 schrieb Md Azaz Ali:
HI Dr. Peter,

Here are some example addresses that the attached ruta is able to find.

There is two ruta rules which is used one is for multiline addresses
and other for single line addresses.
Also we are using some prepopulated EntityType Annotation with feature
location_indicator



//Annotation EntityType with feature location_indicator is already
present = Georgia

11175 Cicero Drive
Suite 200
Alpharetta, Georgia 30022



//EntityType with feature location_indicator is already present =
Cambridge;MA;U.S.A

One Rogers Street
Cambridge, MA
02142-1209
U.S.A

//EntityType with feature location_indicator is already present  =
Cambridge, MA, U.S.A.
1120 Avenue of the Americas
4th Floor
New York, NY 10036
U.S.A.


//EntityType with feature location_indicator is already present = U.S.A

11175 Cicero Drive
Suite 200
Alpharetta, Georgia 30022
U.S.A

//EntityType with feature location_indicator is already present = U.S.A

My new address is
8 Commerce Dr.
Suite 3B
Bedford, NH 03110
U.S.A


//EntityType with feature location_indicator is already present  = U.S.A.

400 Renaissance Center Drive
Suite 2600
Detroit, MI 48243
U.S.A.

//EntityType with feature location_indicator is already present  = U.S.A.

125 Wacker Drive
Suite 300
Chicago, IL 60606
U.S.A.

//EntityType with feature location_indicator is already present  = U.S.A.


1120 Avenue of the Americas
4th Floor
New York, NY 10036
U.S.A.


222 West Las Colinas Blvd. Suite 1650 North Tower Millennium Center
Irving, TX 75039 U.S.A.


Block No. 9A, Pritech Park SEZ, RMZ Ecospace Internal Road, Bellandur,
Bengaluru, Karnataka 560103, India



Thanks & Regard
Md Azaz Ali

On Thu, Aug 4, 2022 at 5:42 PM Peter Klügl <peter.klu...@averbis.com>
wrote:

     Hi,


     yes, I can suggest some refactored rules.

     However, I do not know the common input data and the use cases. It is
     easier for me if I have a few representative input snippets I can
     test
     the refactored rules against. Can you provide some (artifical)
     example
     text snippets?


     Best


     Peter


     Am 04.08.2022 um 11:33 schrieb Md Azaz Ali:
     > Hi Dr. Peter Klügl,
     >
     >
     > 1. We are not able to upgrade to Ruta 3.x because we have to
     upgrade
     > uimaj-core also and to do that we need an stable version of
     cleartk-ml
     > (which is not working with uima 3.x).
     >
     > 2. using PARAM_MAX_RULE_MATCHES , PARAM_MAX_RULE_ELEMENT_MATCHES we
     > are not sure what numer will be good enough.
     >
     > 3. if possible can you please suggest an improved version for above
     > script it will really help.
     >
     > 4. Also getting a new build from main-v2 is also not possible
     because
     > we can only use ga versions which are available directly in mvn
     repository
     >
     > I am attaching one script file if you can suggest the possible
     > improvements it will be really helpful.
     >
     > Note: I am new to ruta and these ruta scripts are written by old
     > developers in my company who are not associated with the company
     any
     > more.
     >
     > Many Thanks
     >
     >
     > On Tue, Aug 2, 2022 at 8:35 PM Peter Klügl
     <peter.klu...@averbis.com>
     > wrote:
     >
     >     Hi,
     >
     >
     >     thanks for the pointer. I added an answer.
     >
     >     Let me know if you want to have more information about the rule
     >     refactoring.
     >
     >
     >     In my experience, the life of a Ruta rule engineer is much
     easier
     >     if the
     >     Ruta rules stay small :-)
     >
     >
     >     Best,
     >
     >
     >     Peter
     >
     >
     >     Am 31.07.2022 um 21:09 schrieb Md Azaz Ali:
     >     >
     >

https://stackoverflow.com/questions/73147822/getting-oom-issue-while-running-ruta-script-with-large-texts
     >     >
     >     >
     >     >
     >     > Many Thanks
     >     >
     >     --
     >     Dr. Peter Klügl
     >     Head of Text Mining/Machine Learning
     >
     >     Averbis GmbH
     >     Salzstr. 15
     >     79098 Freiburg
     >     Germany
     >
     >     Fon: +49 761 708 394 0
     >     Fax: +49 761 708 394 10
     >     Email: peter.klu...@averbis.com
     >     Web: https://averbis.com
     >
     >     Headquarters: Freiburg im Breisgau
     >     Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
     >     Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó
     >
     --
     Dr. Peter Klügl
     Head of Text Mining/Machine Learning

     Averbis GmbH
     Salzstr. 15
     79098 Freiburg
     Germany

     Fon: +49 761 708 394 0
     Fax: +49 761 708 394 10
     Email:peter.klu...@averbis.com
     <mailto:email%3apeter.klu...@averbis.com>
     Web:https://averbis.com

     Headquarters: Freiburg im Breisgau
     Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
     Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó

--
Dr. Peter Klügl
Head of Text Mining/Machine Learning

Averbis GmbH
Salzstr. 15
79098 Freiburg
Germany

Fon: +49 761 708 394 0
Fax: +49 761 708 394 10
Email:peter.klu...@averbis.com
Web:https://averbis.com

Headquarters: Freiburg im Breisgau
Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó

--
Dr. Peter Klügl
Head of Text Mining/Machine Learning

Averbis GmbH
Salzstr. 15
79098 Freiburg
Germany

Fon: +49 761 708 394 0
Fax: +49 761 708 394 10
Email: peter.klu...@averbis.com
Web: https://averbis.com

Headquarters: Freiburg im Breisgau
Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó

Reply via email to