Hi Dr. Peter,

sorry for not being able to clarify it , i have created gist .

Below gist has address.ruta file with one one example attached to both the
rules

https://gist.github.com/azazali30/635c3b80e02908e9f8387db3fda865db


Many Thanks





On Sat, Aug 6, 2022 at 4:11 PM Peter Klügl <peter.klu...@averbis.com> wrote:

> Hi,
>
>
> I had a quick look at the rules. Given the examples you provided, only
> the first rule matches three times, the second rule not once.
>
> So I have to ask before I can refactor the rules: what should the rules
> annotate exactly?
>
>
> Best
>
>
> Peter
>
>
> Am 05.08.2022 um 11:44 schrieb Md Azaz Ali:
> > Hi   Dr. Peter Klügl,
> >
> > Yes its same in stackoverflow
> >
> > On Fri, Aug 5, 2022 at 12:48 PM Peter Klügl <peter.klu...@averbis.com>
> > wrote:
> >
> >> Hi,
> >>
> >>
> >> the attachements are removed by the mailing list. Are the rules the same
> >> as in the StackOverflow question?
> >>
> >>
> >> Best,
> >>
> >>
> >> Peter
> >>
> >> Am 04.08.2022 um 20:15 schrieb Md Azaz Ali:
> >>> HI Dr. Peter,
> >>>
> >>> Here are some example addresses that the attached ruta is able to find.
> >>>
> >>> There is two ruta rules which is used one is for multiline addresses
> >>> and other for single line addresses.
> >>> Also we are using some prepopulated EntityType Annotation with feature
> >>> location_indicator
> >>>
> >>>
> >>>
> >>> //Annotation EntityType with feature location_indicator is already
> >>> present = Georgia
> >>>
> >>> 11175 Cicero Drive
> >>> Suite 200
> >>> Alpharetta, Georgia 30022
> >>>
> >>>
> >>>
> >>> //EntityType with feature location_indicator is already present =
> >>> Cambridge;MA;U.S.A
> >>>
> >>> One Rogers Street
> >>> Cambridge, MA
> >>> 02142-1209
> >>> U.S.A
> >>>
> >>> //EntityType with feature location_indicator is already present  =
> >>> Cambridge, MA, U.S.A.
> >>> 1120 Avenue of the Americas
> >>> 4th Floor
> >>> New York, NY 10036
> >>> U.S.A.
> >>>
> >>>
> >>> //EntityType with feature location_indicator is already present = U.S.A
> >>>
> >>> 11175 Cicero Drive
> >>> Suite 200
> >>> Alpharetta, Georgia 30022
> >>> U.S.A
> >>>
> >>> //EntityType with feature location_indicator is already present = U.S.A
> >>>
> >>> My new address is
> >>> 8 Commerce Dr.
> >>> Suite 3B
> >>> Bedford, NH 03110
> >>> U.S.A
> >>>
> >>>
> >>> //EntityType with feature location_indicator is already present  =
> U.S.A.
> >>>
> >>> 400 Renaissance Center Drive
> >>> Suite 2600
> >>> Detroit, MI 48243
> >>> U.S.A.
> >>>
> >>> //EntityType with feature location_indicator is already present  =
> U.S.A.
> >>>
> >>> 125 Wacker Drive
> >>> Suite 300
> >>> Chicago, IL 60606
> >>> U.S.A.
> >>>
> >>> //EntityType with feature location_indicator is already present  =
> U.S.A.
> >>>
> >>>
> >>> 1120 Avenue of the Americas
> >>> 4th Floor
> >>> New York, NY 10036
> >>> U.S.A.
> >>>
> >>>
> >>> 222 West Las Colinas Blvd. Suite 1650 North Tower Millennium Center
> >>> Irving, TX 75039 U.S.A.
> >>>
> >>>
> >>> Block No. 9A, Pritech Park SEZ, RMZ Ecospace Internal Road, Bellandur,
> >>> Bengaluru, Karnataka 560103, India
> >>>
> >>>
> >>>
> >>> Thanks & Regard
> >>> Md Azaz Ali
> >>>
> >>> On Thu, Aug 4, 2022 at 5:42 PM Peter Klügl <peter.klu...@averbis.com>
> >>> wrote:
> >>>
> >>>      Hi,
> >>>
> >>>
> >>>      yes, I can suggest some refactored rules.
> >>>
> >>>      However, I do not know the common input data and the use cases.
> It is
> >>>      easier for me if I have a few representative input snippets I can
> >>>      test
> >>>      the refactored rules against. Can you provide some (artifical)
> >>>      example
> >>>      text snippets?
> >>>
> >>>
> >>>      Best
> >>>
> >>>
> >>>      Peter
> >>>
> >>>
> >>>      Am 04.08.2022 um 11:33 schrieb Md Azaz Ali:
> >>>      > Hi Dr. Peter Klügl,
> >>>      >
> >>>      >
> >>>      > 1. We are not able to upgrade to Ruta 3.x because we have to
> >>>      upgrade
> >>>      > uimaj-core also and to do that we need an stable version of
> >>>      cleartk-ml
> >>>      > (which is not working with uima 3.x).
> >>>      >
> >>>      > 2. using PARAM_MAX_RULE_MATCHES ,
> PARAM_MAX_RULE_ELEMENT_MATCHES we
> >>>      > are not sure what numer will be good enough.
> >>>      >
> >>>      > 3. if possible can you please suggest an improved version for
> above
> >>>      > script it will really help.
> >>>      >
> >>>      > 4. Also getting a new build from main-v2 is also not possible
> >>>      because
> >>>      > we can only use ga versions which are available directly in mvn
> >>>      repository
> >>>      >
> >>>      > I am attaching one script file if you can suggest the possible
> >>>      > improvements it will be really helpful.
> >>>      >
> >>>      > Note: I am new to ruta and these ruta scripts are written by old
> >>>      > developers in my company who are not associated with the company
> >>>      any
> >>>      > more.
> >>>      >
> >>>      > Many Thanks
> >>>      >
> >>>      >
> >>>      > On Tue, Aug 2, 2022 at 8:35 PM Peter Klügl
> >>>      <peter.klu...@averbis.com>
> >>>      > wrote:
> >>>      >
> >>>      >     Hi,
> >>>      >
> >>>      >
> >>>      >     thanks for the pointer. I added an answer.
> >>>      >
> >>>      >     Let me know if you want to have more information about the
> rule
> >>>      >     refactoring.
> >>>      >
> >>>      >
> >>>      >     In my experience, the life of a Ruta rule engineer is much
> >>>      easier
> >>>      >     if the
> >>>      >     Ruta rules stay small :-)
> >>>      >
> >>>      >
> >>>      >     Best,
> >>>      >
> >>>      >
> >>>      >     Peter
> >>>      >
> >>>      >
> >>>      >     Am 31.07.2022 um 21:09 schrieb Md Azaz Ali:
> >>>      >     >
> >>>      >
> >>>
> >>
> https://stackoverflow.com/questions/73147822/getting-oom-issue-while-running-ruta-script-with-large-texts
> >>>      >     >
> >>>      >     >
> >>>      >     >
> >>>      >     > Many Thanks
> >>>      >     >
> >>>      >     --
> >>>      >     Dr. Peter Klügl
> >>>      >     Head of Text Mining/Machine Learning
> >>>      >
> >>>      >     Averbis GmbH
> >>>      >     Salzstr. 15
> >>>      >     79098 Freiburg
> >>>      >     Germany
> >>>      >
> >>>      >     Fon: +49 761 708 394 0
> >>>      >     Fax: +49 761 708 394 10
> >>>      >     Email: peter.klu...@averbis.com
> >>>      >     Web: https://averbis.com
> >>>      >
> >>>      >     Headquarters: Freiburg im Breisgau
> >>>      >     Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
> >>>      >     Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél
> Markó
> >>>      >
> >>>      --
> >>>      Dr. Peter Klügl
> >>>      Head of Text Mining/Machine Learning
> >>>
> >>>      Averbis GmbH
> >>>      Salzstr. 15
> >>>      79098 Freiburg
> >>>      Germany
> >>>
> >>>      Fon: +49 761 708 394 0
> >>>      Fax: +49 761 708 394 10
> >>>      Email:peter.klu...@averbis.com
> >>>      <mailto:email%3apeter.klu...@averbis.com>
> >>>      Web:https://averbis.com
> >>>
> >>>      Headquarters: Freiburg im Breisgau
> >>>      Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
> >>>      Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó
> >>>
> >> --
> >> Dr. Peter Klügl
> >> Head of Text Mining/Machine Learning
> >>
> >> Averbis GmbH
> >> Salzstr. 15
> >> 79098 Freiburg
> >> Germany
> >>
> >> Fon: +49 761 708 394 0
> >> Fax: +49 761 708 394 10
> >> Email:peter.klu...@averbis.com
> >> Web:https://averbis.com
> >>
> >> Headquarters: Freiburg im Breisgau
> >> Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
> >> Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó
> >>
> --
> Dr. Peter Klügl
> Head of Text Mining/Machine Learning
>
> Averbis GmbH
> Salzstr. 15
> 79098 Freiburg
> Germany
>
> Fon: +49 761 708 394 0
> Fax: +49 761 708 394 10
> Email: peter.klu...@averbis.com
> Web: https://averbis.com
>
> Headquarters: Freiburg im Breisgau
> Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080
> Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó
>
>

Reply via email to