Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Bill Haskett
Unfortunately, these days I'm having a hard time "thinking" of anything 
normal in particular; Anglo-Saxon or not!  :-)


Bill


- Original Message -
*From:* antli...@youngman.org.uk
*To:* u2-users@listserver.u2ug.org
*Date:* 12/14/2011 2:32 PM
*Subject:* Re: [U2] Extract first and last name from free-form name

On 14/12/11 16:30, Bill Haskett wrote:

As usual, one has to ask if the complexities involved in managing all
potentialities ensures that the simplest, and by far the most usual,
scenarios are at best difficult to implement. :-(

Bill

The trouble is, even in the Anglo-Saxon world, what YOU think of as 
"the norm" is very different to what we think (for example, personally 
I've NEVER come across II, III, Jr, Snr over here apart from one 
instance of an expat American).


And, still in the Anglo-Saxon world, I only have to travel 400 miles 
and I'm in a culture where apparently women keep their maiden name as 
a matter of course in marriage...


Cheers,
Wol
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Wols Lists

On 14/12/11 16:30, Bill Haskett wrote:

As usual, one has to ask if the complexities involved in managing all
potentialities ensures that the simplest, and by far the most usual,
scenarios are at best difficult to implement. :-(

Bill

The trouble is, even in the Anglo-Saxon world, what YOU think of as "the 
norm" is very different to what we think (for example, personally I've 
NEVER come across II, III, Jr, Snr over here apart from one instance of 
an expat American).


And, still in the Anglo-Saxon world, I only have to travel 400 miles and 
I'm in a culture where apparently women keep their maiden name as a 
matter of course in marriage...


Cheers,
Wol
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread George Gallen
Our problem in this area, is when you purchase a list or are given a list from 
a client and all they have
is an excel file with full name. It would be nice to say, no, not good enough, 
but if they are paying you
to do a mailing, it's kinda hard to turn them away!

George Gallen
Senior Programmer/Analyst
Accounting/Data Division
ggal...@wyanokegroup.com
ph:856.848.9005 Ext 220
The Wyanoke Group
http://www.wyanokegroup.com

From: u2-users-boun...@listserver.u2ug.org 
[u2-users-boun...@listserver.u2ug.org] On Behalf Of Tony Gravagno 
[3xk547...@sneakemail.com]
Sent: Wednesday, December 14, 2011 1:02 PM
To: u2-users@listserver.u2ug.org
Subject: Re: [U2] Extract first and last name from free-form name

In today's modern world we have a luxury which is not being
utilized.  Depending on your audience, most people these days
have access to cell phones, e-mail, web pages (even Facebook),
etc.  Rather than trying to get algorithms or employees to guess
at which field is which, just let human beings modify their own
data.  Any company can publish a simple web page, a mobile app, a
SMS app, or even an email form.  With those media people can
properly identify themselves for the company database.  The data
can be audited before getting posted to the database.

[ad] I'll be happy to create such a solution for any company.

Tony Gravagno
Nebula Research and Development
TG@ remove.pleaseNebula-RnD.com
remove.pleaseNebula-RnD.com/blog



___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Colin Alfke
Or you could use pre-packaged software to do it for you. I've used ParseRat
 for parsing report output and it
worked really well - although the interface is getting extremely dated.
Haven't tried it for names - but it would be way easier than reinventing the
wheel

Colin

-Original Message-
From: Bill Haskett

As usual, one has to ask if the complexities involved in managing all 
potentialities ensures that the simplest, and by far the most usual, 
scenarios are at best difficult to implement.  :-(

Bill

- Original Message -
*From:* symeon
> You need to do a proper lexical analysis in order to work these out
>
> For example
>
> Input : Dean Foster
> Lex: title word
>
> Input: Dean Reginald McGraw
> Lex: title word word
>
> Then set rules to say a lex of "title word" is probable worked out as
> "forename surname"  and a lex of "title word word" is probably "title
> forename surname"   you can assign probabilities against these and build
> some self learning in.
>
> It is a whole massive topic.
>
> -Original Message-
> From: Mecki Foerthmann
>
> And the list goes on and on and...
> That's why free form names are an absolute pain and should be avoided.
> It's so much easier to have Title, First Name(s), Last Name(s) fields in
the
> input screen and keep them as separate attributes.
> You never get it 100% right.
> In your list take Dean or Prince for instance - they could be first names
> and not titles at all.
> A colleague of mine tried a last name upper to lower case conversion
> including Irish and Scottish names and out of Machine Co it made MacHine
Co.
> And don't even ask what happened to last names starting with O.:-)
>
>
> On 14/12/2011 01:02, Charlie Noah wrote:
>> Great start, but here is a longer list, although still nowhere near
>> complete:
>>
>> Prefixes
>>
>> Code Description
>> 1st Lt   First Lieutenant
>> Adm  Admiral
>> Atty Attorney
>> Brother  Brother (religious)
>> Capt Captain
>> ChiefChief
>> Cmdr Commander
>> Col  Colonel
>> Dean University Dean (includes Assistant and Associate)
>> Dr   Doctor (Medical or Educator)
>> ElderElder (religious)
>> Father   Father (religious)
>> Gen  General
>> Gov  Governor
>> Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge,
>> etc.)
>> Lt Col   Lieutenant Colonel
>> Maj  Major
>> MSgt Major/Master Sergeant
>> Mr   Mister
>> Mrs  Married Woman
>> Ms   Single or Married Woman
>> Prince   Prince
>> Prof Professor (includes Assistant and Associate
>> RabbiRabbi (religious)
>> Rev  Reverend (religious)
>> Sister   Sister (religious)
>>
>> Suffixes
>>
>> Code Description
>> II   The Second
>> III  The Third
>> IV   The Fourth
>> VThe Fifth
>> CPA  Certified Public Accountant
>> DDS  Doctor of Dental Medicine
>> Esq  Esquire
>> JD   Jurist Doctor
>> Jr   Junior
>> Jnr  Junior (British)
>> LLD  Doctor of Laws
>> MD   Doctor of Medicine
>> PhD  Doctorate
>> Ret  Retired from Armed Forces
>> RN   Registered Nurse
>> RPh  Registered Pharmacist
>> Sr   Senior
>> Snr  Senior (British)
>> DO   Doctor of Osteopathy
>>
>> Perhaps others can add more to the list.
>>
>> Regards,
>> Charlie Noah
>>
>> Tiny Bear's Wild Bird Store
>> "Everything For The Backyard Bird Enthusiast, Except For The Birds"
>> Info, Forum:  http://www.TinyBearMarketing.com
>> Store:http://Stores.TinyBearMarketing.com
>>
>>
>> On 12-13-2011 5:12 PM, Wjhonson wrote:
>>> 0044:  SUFFIXES = ",JR,SR,MD,III,"
>>> 0045:  S.NAME = DCOUNT(UM.NAME,' ')
>>> 0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
>>> 0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
>>> 0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
>>> 0049:  END ELSE
>>> 0050: LAST.NAME = LAST.WORD.IN.NAME
>>> 0051:  END
>>> 0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
>>> 0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
>>> 0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
>>> 0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
>>> 0056:  END ELSE
>>> 0057: FIRST.NAME = FIRST.WORD.IN.NAME
>>> 0058:  END

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Wjhonson

I'm not disagreeing Tony, but when you put up a "Pay Your Bill Online" type 
page, for a general population, like say a county tax authority, you'll find 
that some people do it, most people don't.  It takes a generation to adopt 
something like that, and the generation who is paying most of the bills (50 
years and older) aren't particularly tech savvy, although the programmers in 
that generation might be.

So you can fix some of the issues by allowing the population to fix their own 
name, but you'll end up fielding ten times more phone calls because they don't 
know how to turn off their shift key.



-Original Message-
From: Tony Gravagno <3xk547...@sneakemail.com>
To: u2-users 
Sent: Wed, Dec 14, 2011 10:02 am
Subject: Re: [U2] Extract first and last name from free-form name


In today's modern world we have a luxury which is not being
tilized.  Depending on your audience, most people these days
ave access to cell phones, e-mail, web pages (even Facebook),
tc.  Rather than trying to get algorithms or employees to guess
t which field is which, just let human beings modify their own
ata.  Any company can publish a simple web page, a mobile app, a
MS app, or even an email form.  With those media people can
roperly identify themselves for the company database.  The data
an be audited before getting posted to the database.
[ad] I'll be happy to create such a solution for any company.
Tony Gravagno
ebula Research and Development
G@ remove.pleaseNebula-RnD.com
emove.pleaseNebula-RnD.com/blog

___
2-Users mailing list
2-us...@listserver.u2ug.org
ttp://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Tony Gravagno
In today's modern world we have a luxury which is not being
utilized.  Depending on your audience, most people these days
have access to cell phones, e-mail, web pages (even Facebook),
etc.  Rather than trying to get algorithms or employees to guess
at which field is which, just let human beings modify their own
data.  Any company can publish a simple web page, a mobile app, a
SMS app, or even an email form.  With those media people can
properly identify themselves for the company database.  The data
can be audited before getting posted to the database.

[ad] I'll be happy to create such a solution for any company.

Tony Gravagno
Nebula Research and Development
TG@ remove.pleaseNebula-RnD.com
remove.pleaseNebula-RnD.com/blog



___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Bill Haskett
As usual, one has to ask if the complexities involved in managing all 
potentialities ensures that the simplest, and by far the most usual, 
scenarios are at best difficult to implement.  :-(


Bill


- Original Message -
*From:* syme...@gmail.com
*To:* 'U2 Users List' 
*Date:* 12/14/2011 1:57 AM
*Subject:* Re: [U2] Extract first and last name from free-form name

You need to do a proper lexical analysis in order to work these out

For example

Input : Dean Foster
Lex: title word

Input: Dean Reginald McGraw
Lex: title word word



Then set rules to say a lex of "title word" is probable worked out as
"forename surname"  and a lex of "title word word" is probably "title
forename surname"   you can assign probabilities against these and build
some self learning in.

It is a whole massive topic.




-Original Message-
From: u2-users-boun...@listserver.u2ug.org
[mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Mecki Foerthmann
Sent: 14 December 2011 08:22
To: u2-users@listserver.u2ug.org
Subject: Re: [U2] Extract first and last name from free-form name

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields in the
input screen and keep them as separate attributes.
You never get it 100% right.
In your list take Dean or Prince for instance - they could be first names
and not titles at all.
A colleague of mine tried a last name upper to lower case conversion
including Irish and Scottish names and out of Machine Co it made MacHine Co.
And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:

Great start, but here is a longer list, although still nowhere near
complete:

Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge,
etc.)
Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users
-
No virus found in this message.
Checked by AVG - www.avg.com
Version: 10.0.1415 / Virus Database: 2102/4079 - Release Date: 12/13/11

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://lis

Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread George R Smith

Count Darling IV is alive and well. He lives in Fayetteville, AR
still works with mvBase.
George

-Original Message- 
From: Charlie Noah

Sent: Wednesday, December 14, 2011 8:00 AM
To: U2 Users List
Subject: Re: [U2] Extract first and last name from free-form name

And how about Count Darling IV? Count is actually his first name. I
haven't heard anything about him in years.

Charlie

On 12-14-2011 7:47 AM, Wjhonson wrote:

  Or in our system we have entries like
Dean Suarez Smith

In actuality the customer states that "Suarez Smith" is a double last 
name, not a middle name and last name

And Dean is their title... or no it's their first name...

Actually Doctor can be a first name as well.
It's a mess





-Original Message-
From: Mecki Foerthmann
To: u2-users
Sent: Wed, Dec 14, 2011 5:09 am
Subject: Re: [U2] Extract first and last name from free-form name


Just face it - it can't be done!
So what if Dean has 2 first names and is a plumber?

On 14/12/2011 09:57, Symeon Breen wrote:

You need to do a proper lexical analysis in order to work these out

For example

Input : Dean Foster
Lex: title word

Input: Dean Reginald McGraw
Lex: title word word



Then set rules to say a lex of "title word" is probable worked out as
"forename surname"  and a lex of "title word word" is probably "title
forename surname"   you can assign probabilities against these and build
some self learning in.

It is a whole massive topic.




-Original Message-
From: u2-users-boun...@listserver.u2ug.org
[mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Mecki 
Foerthmann

Sent: 14 December 2011 08:22
To: u2-users@listserver.u2ug.org
Subject: Re: [U2] Extract first and last name from free-form name

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields in 
the

input screen and keep them as separate attributes.
You never get it 100% right.
In your list take Dean or Prince for instance - they could be first names
and not titles at all.
A colleague of mine tried a last name upper to lower case conversion
including Irish and Scottish names and out of Machine Co it made MacHine 
Co.

And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:

Great start, but here is a longer list, although still nowhere near
complete:

Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge,
etc.)
Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.

Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Dawn Wolthuis
I'm late to the Party, but wrote this draft yesterday. Some of it has been
covered, but I'll click send without further revisions.

I did not see the original post on this (just the Re:) but you can only get
somewhat close on extracting first or last names from a free-form name, not
good enough to use the resulting information, in general. There could be a
lot more name-intelligence built into the algorithm, but it will still be
an approximation. Of course, there are many two word first names, such as
"Sally Jo" and many two and three word last names (e.g. Vander Molen, De
Vries, and 1/2 of the phone book in my little city). Even if you have an
algorithm that identifies a "Van" within the name, you need to be sure you
are not treating "Van Smith" as a last name or Mary Van Dam Zylstra as if
her former last name, now serving as the middle name, were the start of a
three-part last name. There are also more suffixes and prefixes than in
your algorithm as well as potentially multiple with any given name.

As an aside...
If starting out, rather than working with existing data, after some
investigation a couple of years ago regarding what to collect in an online
application as well as what to call the parts, I came down to 3 fields for
names. Because one part should go first and the other last, I did what some
consider politically incorrect with one tag as First name and the other as
Last name. I also have a Full name that is currently read-only (read-write
is future) defaulting initially as first : " " : last. Someone can put "Dr.
James A." as the first name and "Vander Ark Sr." as the last name, should
they choose to do so. Once implemented as a typical data field (rather than
as a materialized derived value), the full name could be "Dr. James A.
Vander Ark Sr." with a first name as Jim and a last name as Vander Ark, for
example. Skipping fields for middle name, prefix, and suffix might seem
overly simplistic, but with these three fields a large percentage of
possible requirements is covered and some risks are mitigated. Using these
names to identify members of the same family across cultures is not,
however.  --dawn

On Tue, Dec 13, 2011 at 5:12 PM, Wjhonson  wrote:

>
> 0044:  SUFFIXES = ",JR,SR,MD,III,"
> 0045:  S.NAME = DCOUNT(UM.NAME,' ')
> 0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
> 0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
> 0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
> 0049:  END ELSE
> 0050: LAST.NAME = LAST.WORD.IN.NAME
> 0051:  END
> 0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
> 0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
> 0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
> 0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
> 0056:  END ELSE
> 0057: FIRST.NAME = FIRST.WORD.IN.NAME
> 0058:  END
> ___
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
>



-- 
Dawn M. Wolthuis

Take and give some delight today
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Charlie Noah
And how about Count Darling IV? Count is actually his first name. I 
haven't heard anything about him in years.


Charlie

On 12-14-2011 7:47 AM, Wjhonson wrote:

  Or in our system we have entries like
Dean Suarez Smith

In actuality the customer states that "Suarez Smith" is a double last name, not 
a middle name and last name
And Dean is their title... or no it's their first name...

Actually Doctor can be a first name as well.
It's a mess





-Original Message-
From: Mecki Foerthmann
To: u2-users
Sent: Wed, Dec 14, 2011 5:09 am
Subject: Re: [U2] Extract first and last name from free-form name


Just face it - it can't be done!
So what if Dean has 2 first names and is a plumber?

On 14/12/2011 09:57, Symeon Breen wrote:

You need to do a proper lexical analysis in order to work these out

For example

Input : Dean Foster
Lex: title word

Input: Dean Reginald McGraw
Lex: title word word



Then set rules to say a lex of "title word" is probable worked out as
"forename surname"  and a lex of "title word word" is probably "title
forename surname"   you can assign probabilities against these and build
some self learning in.

It is a whole massive topic.




-Original Message-
From: u2-users-boun...@listserver.u2ug.org
[mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Mecki Foerthmann
Sent: 14 December 2011 08:22
To: u2-users@listserver.u2ug.org
Subject: Re: [U2] Extract first and last name from free-form name

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields in the
input screen and keep them as separate attributes.
You never get it 100% right.
In your list take Dean or Prince for instance - they could be first names
and not titles at all.
A colleague of mine tried a last name upper to lower case conversion
including Irish and Scottish names and out of Machine Co it made MacHine Co.
And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:

Great start, but here is a longer list, although still nowhere near
complete:

Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge,
etc.)
Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@l

Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Charlie Noah

Hi Mecki,

I absolutely agree. Of course, it's hard enough to get a trained 
(hopefully) data entry person to get it right, just imagine all those 
web customers filling out online order forms. The best you can hope for 
is to get most right, and enough time to look over the rest for errors.


Then there's the problem of capitalization. There are probably as many 
exceptions as there are "rules". Oh well, that's why they pay us the big 
bucks, right? ;^)


Regards,
Charlie

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-14-2011 2:22 AM, Mecki Foerthmann wrote:

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields 
in the input screen and keep them as separate attributes.

You never get it 100% right.
In your list take Dean or Prince for instance - they could be first 
names and not titles at all.
A colleague of mine tried a last name upper to lower case conversion 
including Irish and Scottish names and out of Machine Co it made 
MacHine Co.

And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:
Great start, but here is a longer list, although still nowhere near 
complete:


Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, 
Judge, etc.)

Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Wjhonson

 Or in our system we have entries like
Dean Suarez Smith

In actuality the customer states that "Suarez Smith" is a double last name, not 
a middle name and last name
And Dean is their title... or no it's their first name...

Actually Doctor can be a first name as well.
It's a mess

 

 

-Original Message-
From: Mecki Foerthmann 
To: u2-users 
Sent: Wed, Dec 14, 2011 5:09 am
Subject: Re: [U2] Extract first and last name from free-form name


Just face it - it can't be done!
So what if Dean has 2 first names and is a plumber?

On 14/12/2011 09:57, Symeon Breen wrote:
> You need to do a proper lexical analysis in order to work these out
>
> For example
>
> Input : Dean Foster
> Lex: title word
>
> Input: Dean Reginald McGraw
> Lex: title word word
>
>
>
> Then set rules to say a lex of "title word" is probable worked out as
> "forename surname"  and a lex of "title word word" is probably "title
> forename surname"   you can assign probabilities against these and build
> some self learning in.
>
> It is a whole massive topic.
>
>
>
>
> -Original Message-
> From: u2-users-boun...@listserver.u2ug.org
> [mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Mecki Foerthmann
> Sent: 14 December 2011 08:22
> To: u2-users@listserver.u2ug.org
> Subject: Re: [U2] Extract first and last name from free-form name
>
> And the list goes on and on and...
> That's why free form names are an absolute pain and should be avoided.
> It's so much easier to have Title, First Name(s), Last Name(s) fields in the
> input screen and keep them as separate attributes.
> You never get it 100% right.
> In your list take Dean or Prince for instance - they could be first names
> and not titles at all.
> A colleague of mine tried a last name upper to lower case conversion
> including Irish and Scottish names and out of Machine Co it made MacHine Co.
> And don't even ask what happened to last names starting with O.:-)
>
>
> On 14/12/2011 01:02, Charlie Noah wrote:
>> Great start, but here is a longer list, although still nowhere near
>> complete:
>>
>> Prefixes
>>
>> Code Description
>> 1st Lt   First Lieutenant
>> Adm  Admiral
>> Atty Attorney
>> Brother  Brother (religious)
>> Capt Captain
>> ChiefChief
>> Cmdr Commander
>> Col  Colonel
>> Dean University Dean (includes Assistant and Associate)
>> Dr   Doctor (Medical or Educator)
>> ElderElder (religious)
>> Father   Father (religious)
>> Gen  General
>> Gov  Governor
>> Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge,
>> etc.)
>> Lt Col   Lieutenant Colonel
>> Maj  Major
>> MSgt Major/Master Sergeant
>> Mr   Mister
>> Mrs  Married Woman
>> Ms   Single or Married Woman
>> Prince   Prince
>> Prof Professor (includes Assistant and Associate
>> RabbiRabbi (religious)
>> Rev  Reverend (religious)
>> Sister   Sister (religious)
>>
>> Suffixes
>>
>> Code Description
>> II   The Second
>> III  The Third
>> IV   The Fourth
>> VThe Fifth
>> CPA  Certified Public Accountant
>> DDS  Doctor of Dental Medicine
>> Esq  Esquire
>> JD   Jurist Doctor
>> Jr   Junior
>> Jnr  Junior (British)
>> LLD  Doctor of Laws
>> MD   Doctor of Medicine
>> PhD  Doctorate
>> Ret  Retired from Armed Forces
>> RN   Registered Nurse
>> RPh  Registered Pharmacist
>> Sr   Senior
>> Snr  Senior (British)
>> DO   Doctor of Osteopathy
>>
>> Perhaps others can add more to the list.
>>
>> Regards,
>> Charlie Noah
>>
>> Tiny Bear's Wild Bird Store
>> "Everything For The Backyard Bird Enthusiast, Except For The Birds"
>> Info, Forum:  http://www.TinyBearMarketing.com
>> Store:http://Stores.TinyBearMarketing.com
>>
>>
>> On 12-13-2011 5:12 PM, Wjhonson wrote:
>>> 0044:  SUFFIXES = ",JR,SR,MD,III,"
>>> 0045:  S.NAME = DCOUNT(UM.NAME,' ')
>>> 0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
>>> 0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
>>> 0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
>>> 0049:  END ELSE
>>> 0050: LAST.NAME = LAST.WORD.IN.NAME
>>> 0051:  END

Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Mecki Foerthmann

Just face it - it can't be done!
So what if Dean has 2 first names and is a plumber?

On 14/12/2011 09:57, Symeon Breen wrote:

You need to do a proper lexical analysis in order to work these out

For example

Input : Dean Foster
Lex: title word

Input: Dean Reginald McGraw
Lex: title word word



Then set rules to say a lex of "title word" is probable worked out as
"forename surname"  and a lex of "title word word" is probably "title
forename surname"   you can assign probabilities against these and build
some self learning in.

It is a whole massive topic.




-Original Message-
From: u2-users-boun...@listserver.u2ug.org
[mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Mecki Foerthmann
Sent: 14 December 2011 08:22
To: u2-users@listserver.u2ug.org
Subject: Re: [U2] Extract first and last name from free-form name

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields in the
input screen and keep them as separate attributes.
You never get it 100% right.
In your list take Dean or Prince for instance - they could be first names
and not titles at all.
A colleague of mine tried a last name upper to lower case conversion
including Irish and Scottish names and out of Machine Co it made MacHine Co.
And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:

Great start, but here is a longer list, although still nowhere near
complete:

Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge,
etc.)
Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users
-
No virus found in this message.
Checked by AVG - www.avg.com
Version: 10.0.1415 / Virus Database: 2102/4079 - Release Date: 12/13/11

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Symeon Breen
You need to do a proper lexical analysis in order to work these out

For example 

Input : Dean Foster
Lex: title word

Input: Dean Reginald McGraw
Lex: title word word



Then set rules to say a lex of "title word" is probable worked out as
"forename surname"  and a lex of "title word word" is probably "title
forename surname"   you can assign probabilities against these and build
some self learning in.  

It is a whole massive topic.




-Original Message-
From: u2-users-boun...@listserver.u2ug.org
[mailto:u2-users-boun...@listserver.u2ug.org] On Behalf Of Mecki Foerthmann
Sent: 14 December 2011 08:22
To: u2-users@listserver.u2ug.org
Subject: Re: [U2] Extract first and last name from free-form name

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields in the
input screen and keep them as separate attributes.
You never get it 100% right.
In your list take Dean or Prince for instance - they could be first names
and not titles at all.
A colleague of mine tried a last name upper to lower case conversion
including Irish and Scottish names and out of Machine Co it made MacHine Co.
And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:
> Great start, but here is a longer list, although still nowhere near
> complete:
>
> Prefixes
>
> Code Description
> 1st Lt   First Lieutenant
> Adm  Admiral
> Atty Attorney
> Brother  Brother (religious)
> Capt Captain
> ChiefChief
> Cmdr Commander
> Col  Colonel
> Dean University Dean (includes Assistant and Associate)
> Dr   Doctor (Medical or Educator)
> ElderElder (religious)
> Father   Father (religious)
> Gen  General
> Gov  Governor
> Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge, 
> etc.)
> Lt Col   Lieutenant Colonel
> Maj  Major
> MSgt Major/Master Sergeant
> Mr   Mister
> Mrs  Married Woman
> Ms   Single or Married Woman
> Prince   Prince
> Prof Professor (includes Assistant and Associate
> RabbiRabbi (religious)
> Rev  Reverend (religious)
> Sister   Sister (religious)
>
> Suffixes
>
> Code Description
> II   The Second
> III  The Third
> IV   The Fourth
> VThe Fifth
> CPA  Certified Public Accountant
> DDS  Doctor of Dental Medicine
> Esq  Esquire
> JD   Jurist Doctor
> Jr   Junior
> Jnr  Junior (British)
> LLD  Doctor of Laws
> MD   Doctor of Medicine
> PhD  Doctorate
> Ret  Retired from Armed Forces
> RN   Registered Nurse
> RPh  Registered Pharmacist
> Sr   Senior
> Snr  Senior (British)
> DO   Doctor of Osteopathy
>
> Perhaps others can add more to the list.
>
> Regards,
> Charlie Noah
>
> Tiny Bear's Wild Bird Store
> "Everything For The Backyard Bird Enthusiast, Except For The Birds"
> Info, Forum:  http://www.TinyBearMarketing.com
> Store:http://Stores.TinyBearMarketing.com
>
>
> On 12-13-2011 5:12 PM, Wjhonson wrote:
>> 0044:  SUFFIXES = ",JR,SR,MD,III,"
>> 0045:  S.NAME = DCOUNT(UM.NAME,' ')
>> 0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
>> 0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
>> 0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
>> 0049:  END ELSE
>> 0050: LAST.NAME = LAST.WORD.IN.NAME
>> 0051:  END
>> 0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
>> 0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
>> 0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
>> 0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
>> 0056:  END ELSE
>> 0057: FIRST.NAME = FIRST.WORD.IN.NAME
>> 0058:  END
>> ___
>> U2-Users mailing list
>> U2-Users@listserver.u2ug.org
>> http://listserver.u2ug.org/mailman/listinfo/u2-users
>>
> ___
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users
-
No virus found in this message.
Checked by AVG - www.avg.com
Version: 10.0.1415 / Virus Database: 2102/4079 - Release Date: 12/13/11

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread George Gallen
Oops, 

This should have been it's considered a middle Only if there has been a non 
salutation or a non single letter part first found. (ie.  John L Smith, but NOT 
  J Smith - the J here is a firstname)



From: u2-users-boun...@listserver.u2ug.org 
[u2-users-boun...@listserver.u2ug.org] On Behalf Of George Gallen 
[ggal...@wyanokegroup.com]
Sent: Wednesday, December 14, 2011 4:19 AM
To: U2 Users List
Subject: Re: [U2] Extract first and last name from free-form name

If it's a single letter, it's considered a middle initial, ONLY if there has
   been no first name yet.
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread George Gallen
Of course, this doesn't account for mulitple suffixes (ie. JR MD) 
Or if someone has a multi part last name (Del Rossi)
Or if a middle initial has been thrown into the name as well.

The routine I have, splits the name into a dynamic array on a space
and goes through the array one at a time, if it matches a salutation or suffix
   it's dropped into it's own dynamic array, however, if it matches a
   prefix, it must be within the first half of the name, and a suffix must
   be within the last half of the name (there are some salutation that can
   also be suffixes - ie. SR - Senior or Sister).
If it's a single letter, it's considered a middle initial, ONLY if there has
   been no first name yet.
if the part is one of a multi part last name, it's added the next part with
   a sub value mark, and rechecked (after eliminating the next part).

What's left depending on how many parts will be a FN Middle Name or LN,
  FN, LN or just a LN (if there is a first name initial).

Subvalues and Attribute marks are converted back to spaces

Returning 5 parts,  Salution, FN, Middle name, LN, Suffix

I use the same method of having salution file, and a suffix file and combined 
last name file
As we find failures based on unknown saluation, suffixes, they are added to the 
file.

It works excellent on the normal formatted names.


George Gallen
Senior Programmer/Analyst
Accounting/Data Division
ggal...@wyanokegroup.com
ph:856.848.9005 Ext 220
The Wyanoke Group
http://www.wyanokegroup.com

From: u2-users-boun...@listserver.u2ug.org 
[u2-users-boun...@listserver.u2ug.org] On Behalf Of Charlie Noah 
[cwn...@comcast.net]
Sent: Tuesday, December 13, 2011 8:02 PM
To: U2 Users List
Subject: Re: [U2] Extract first and last name from free-form name

Great start, but here is a longer list, although still nowhere near
complete:

Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge, etc.)
Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:
> 0044:  SUFFIXES = ",JR,SR,MD,III,"
> 0045:  S.NAME = DCOUNT(UM.NAME,' ')
> 0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
> 0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
> 0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
> 0049:  END ELSE
> 0050: LAST.NAME = LAST.WORD.IN.NAME
> 0051:  END
> 0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
> 0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
> 0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
> 0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
> 0056:  END ELSE
> 0057: FIRST.NAME = FIRST.WORD.IN.NAME
> 0058:  END
> ___
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
>
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Dan Fitzgerald

That's pretty good. 
 
A few years back, Multnomah County (Portland), Oregon, added  "Klingon" to the 
official list of interpreter services it would offer in the Mental Health 
department. 
 
> To: u2-users@listserver.u2ug.org
> From: bradley.sch...@usbank.com
> Date: Wed, 14 Dec 2011 02:57:28 -0600
> Subject: Re: [U2] Extract first and last name from free-form name
> 
> This thread reminded me of a blog posting I ran across a while ago. His 
> basic premise is that most of our assumptions about names are incorrect, 
> which makes coding a computer to handle them properly is really tough if 
> not impossible. Don't skip the comments section, there's some good stuff 
> there.
> 
> http://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names
> 
> 
> 
> U.S. BANCORP made the following annotations
> -
> Electronic Privacy Notice. This e-mail, and any attachments, contains 
> information that is, or may be, covered by electronic communications privacy 
> laws, and is also confidential and proprietary in nature. If you are not the 
> intended recipient, please be advised that you are legally prohibited from 
> retaining, using, copying, distributing, or otherwise disclosing this 
> information in any manner. Instead, please reply to the sender that you have 
> received this communication in error, and then immediately delete it. Thank 
> you in advance for your cooperation.
> 
> 
> 
> -
> 
> ___
> U2-Users mailing list
> U2-Users@listserver.u2ug.org
> http://listserver.u2ug.org/mailman/listinfo/u2-users
  
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread bradley . schrag
This thread reminded me of a blog posting I ran across a while ago. His 
basic premise is that most of our assumptions about names are incorrect, 
which makes coding a computer to handle them properly is really tough if 
not impossible. Don't skip the comments section, there's some good stuff 
there.

http://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names



U.S. BANCORP made the following annotations
-
Electronic Privacy Notice. This e-mail, and any attachments, contains 
information that is, or may be, covered by electronic communications privacy 
laws, and is also confidential and proprietary in nature. If you are not the 
intended recipient, please be advised that you are legally prohibited from 
retaining, using, copying, distributing, or otherwise disclosing this 
information in any manner. Instead, please reply to the sender that you have 
received this communication in error, and then immediately delete it. Thank you 
in advance for your cooperation.



-

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-14 Thread Mecki Foerthmann

And the list goes on and on and...
That's why free form names are an absolute pain and should be avoided.
It's so much easier to have Title, First Name(s), Last Name(s) fields in 
the input screen and keep them as separate attributes.

You never get it 100% right.
In your list take Dean or Prince for instance - they could be first 
names and not titles at all.
A colleague of mine tried a last name upper to lower case conversion 
including Irish and Scottish names and out of Machine Co it made MacHine Co.

And don't even ask what happened to last names starting with O.:-)


On 14/12/2011 01:02, Charlie Noah wrote:
Great start, but here is a longer list, although still nowhere near 
complete:


Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge, 
etc.)

Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-13 Thread Tony Gravagno
This reminds me that around 1990 I wrote functions like this in F
correlatives that wrapped around the screen several times, one
dict item for each name part.  Oh joy.
 

> From: Charlie Noah
> Great start, but here is a longer list, although still nowhere
near 
> complete:
> [snip]

> Wjhonson wrote:
> > 0044:  SUFFIXES = ",JR,SR,MD,III,"
> > 0045:  S.NAME = DCOUNT(UM.NAME,' ')
> > 0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
> > [snip]

___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-13 Thread Charlie Noah
Great start, but here is a longer list, although still nowhere near 
complete:


Prefixes

Code Description
1st Lt   First Lieutenant
Adm  Admiral
Atty Attorney
Brother  Brother (religious)
Capt Captain
ChiefChief
Cmdr Commander
Col  Colonel
Dean University Dean (includes Assistant and Associate)
Dr   Doctor (Medical or Educator)
ElderElder (religious)
Father   Father (religious)
Gen  General
Gov  Governor
Hon  Honorable (Cabinet Officer, Commissioner, Congressman, Judge, etc.)
Lt Col   Lieutenant Colonel
Maj  Major
MSgt Major/Master Sergeant
Mr   Mister
Mrs  Married Woman
Ms   Single or Married Woman
Prince   Prince
Prof Professor (includes Assistant and Associate
RabbiRabbi (religious)
Rev  Reverend (religious)
Sister   Sister (religious)

Suffixes

Code Description
II   The Second
III  The Third
IV   The Fourth
VThe Fifth
CPA  Certified Public Accountant
DDS  Doctor of Dental Medicine
Esq  Esquire
JD   Jurist Doctor
Jr   Junior
Jnr  Junior (British)
LLD  Doctor of Laws
MD   Doctor of Medicine
PhD  Doctorate
Ret  Retired from Armed Forces
RN   Registered Nurse
RPh  Registered Pharmacist
Sr   Senior
Snr  Senior (British)
DO   Doctor of Osteopathy

Perhaps others can add more to the list.

Regards,
Charlie Noah

Tiny Bear's Wild Bird Store
"Everything For The Backyard Bird Enthusiast, Except For The Birds"
Info, Forum:  http://www.TinyBearMarketing.com
Store:http://Stores.TinyBearMarketing.com


On 12-13-2011 5:12 PM, Wjhonson wrote:

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users


Re: [U2] Extract first and last name from free-form name

2011-12-13 Thread Wjhonson

0044:  SUFFIXES = ",JR,SR,MD,III,"
0045:  S.NAME = DCOUNT(UM.NAME,' ')
0046:  LAST.WORD.IN.NAME = FIELD(UM.NAME,' ',S.NAME)
0047:  IF INDEX(SUFFIXES,",":LAST.WORD.IN.NAME:",",1) THEN
0048: LAST.NAME = FIELD(UM.NAME,' ',S.NAME-1)
0049:  END ELSE
0050: LAST.NAME = LAST.WORD.IN.NAME
0051:  END
0052:  PREFIXES = ',DR,MR,MS,MISS,MRS,'
0053:  FIRST.WORD.IN.NAME = FIELD(UM.NAME,' ',1)
0054:  IF INDEX(PREFIXES,",":FIRST.WORD.IN.NAME:",",1) THEN
0055: FIRST.NAME = FIELD(UM.NAME,' ',2)
0056:  END ELSE
0057: FIRST.NAME = FIRST.WORD.IN.NAME
0058:  END
___
U2-Users mailing list
U2-Users@listserver.u2ug.org
http://listserver.u2ug.org/mailman/listinfo/u2-users