Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
> > > You say that named entity recognition is not generalised beyond Mail, > but the support library is there for anyone to use. See for > example > https://developer.apple.com/documentation/foundation/nslinguistictagger/identifying_people_places_and_organizations > >

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
Hi Hernan, Really nice. I try it today. It might be what I need. I come back if installation pb. Cheers, Cédrick > Le 8 mars 2019 à 03:34, Hernán Morales Durand a > écrit : > > Hi Cédrick, > > I wrote some years ago an interface to a named-entity recognizer: >

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
> Couldn't find anything in Smalltalk but that should you give ideas and > inspire you or get you started... > > https://github.com/search?q=contact+scraping=Repositories > > I guess we have all that's needed in Pharo : parsers (HTML, XML, > PetitParser), Soup & regex ! Yes for markup, I

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Richard O'Keefe
You say that named entity recognition is not generalised beyond Mail, but the support library is there for anyone to use. See for example https://developer.apple.com/documentation/foundation/nslinguistictagger/identifying_people_places_and_organizations In Python, you can use NLTK to do roughly

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Hernán Morales Durand
Hi Cédrick, I wrote some years ago an interface to a named-entity recognizer: https://80738163270632.blogspot.com/2015/02/stner-interface-to-stanford-named.html I think that was Pharo 5, so you may want to check if there are load problems in current Pharo. The blogger post didn't parsed

Re: [Pharo-users] Pharo 7.0 image size

2019-03-07 Thread Trussardi Dario Romano
Ciao, I thought that using Transcript as a report to analyze the operation of the code was a good thing - solution. Unfortunately, however, the size of the image continues to increase ( and I think it's due to the use of the transcript ) and i can not find the

Re: [Pharo-users] Load Tonel package from local folder

2019-03-07 Thread BrunoBB
Hi, Fixed: Iceberg enableMetacelloIntegration: false. I felt with this one again :) regards, bruno -- Sent from: http://forum.world.st/Pharo-Smalltalk-Users-f1310670.html

Re: [Pharo-users] Noob Question - slicing

2019-03-07 Thread Craig Johnson
From: Pharo-users [mailto:pharo-users-boun...@lists.pharo.org] On Behalf Of K K Subbu Sent: Wednesday, 06 March 2019 16:45 To: pharo-users@lists.pharo.org Subject: Re: [Pharo-users] Noob Question - slicing > '1234567890' copyFrom: 5 to: 8 "5678" > > Page 207 in Updated Pharo by Example book

Re: [Pharo-users] Load Tonel package from local folder

2019-03-07 Thread BrunoBB
Hi, It seems that my previous image was corrupted. In new Fresh Image the error is the following: IceLibgitErrorVisitor>>visitGenericError: IceLibgitErrorVisitor>>visitERROR: LGit_GIT_ERROR>>acceptError: [ :error | location exists ifTrue: [ location ensureDeleteAll ]. error acceptError:

[Pharo-users] Load Tonel package from local folder

2019-03-07 Thread BrunoBB
Hi, I have a very simple Baseline: baselineOf: spec spec for: #common do: [ spec baseline: 'NeoJSON' with: [ spec repository: 'github://svenvc/NeoJSON:master/repository' ]. spec baseline: 'P3' with: [

Re: [Pharo-users] Pharo 7.0 image size

2019-03-07 Thread Trussardi Dario Romano
Ciao, thanks. i have a Pharo 7.0 alpha build 1262. I development a Seaside application. > May be you have a lot of still open Seaside session, which if I remember > correctly are automatically shutdown after 10 min when not active. > > Looking at Seaside session

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
> > > Cedrick > > Sorry for not answering sooner. Do not worry, this is because I haden’t properly searched before answering. > You have probably found what you want. Not yet. Loading. > I meant that ‘island + sea’ is now a standard pattern. Follow the link to the > tutorial from the

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread PBKResearch
Cedrick Sorry for not answering sooner. You have probably found what you want. I meant that ‘island + sea’ is now a standard pattern. Follow the link to the tutorial from the readme you referenced – the example of javascript in HTML might carry over well enough to your situation. Peter

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
> > When you say "unstructured material ... is now a standard pattern in > PetitParser », how could I begin exploring that ? Any tutorials ? I’ll load it and play around. https://github.com/kursjan/petitparser2/blob/master/README.md

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
> >> This also assumes that the items of interest are really structured; there >> could be many ways of writing phone numbers, for instance. > > Phone numbers are actually not easy… I see them as a limited sequence of > number (if not well structure) + eventually the +contrycode). > I’d like

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
> > > Cédrick > > In principle, what you are asking for is to identify 'islands' of structured > information in a 'sea' of otherwise unstructured material, which is now a > standard pattern in PetitParser. Exactly :) > You could imagine a parser spec of the form: > > (sea optional,

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread PBKResearch
Cédrick In principle, what you are asking for is to identify 'islands' of structured information in a 'sea' of otherwise unstructured material, which is now a standard pattern in PetitParser. You could imagine a parser spec of the form: (sea optional, (email/phone/address/), sea optional)

Re: [Pharo-users] IMAP in Pharo

2019-03-07 Thread Tim Mackinnon
> On 7 Mar 2019, at 09:37, Cédrick Béler > wrote: > > I think maybe what we miss would be a tutorial and a sandbox project so > people can play with (quite intimidating to do his first PR the wrong way). Actually - when you take a deep breathe and plunge in, its not

Re: [Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Benoit St-Jean via Pharo-users
--- Begin Message --- Couldn't find anything in Smalltalk but that should you give ideas and inspire you or get you started... https://github.com/search?q=contact+scraping=Repositories I guess we have all that's needed in Pharo : parsers (HTML, XML, PetitParser), Soup & regex ! On

[Pharo-users] Loading Baseline from Smart HTTP git

2019-03-07 Thread Sebastián Filippini
Hi, My name is Sebastian and I'm new to Pharo and Smalltalk in general. I'm trying to load a baseline from a git server using Smart HTTP protocol. I'm using a self hosted git server with gitolite. I can clone the repository by http using: git clone http://user:password@host/repo-name. I've

Re: [Pharo-users] Why can't a Bag answer as a dictionary?

2019-03-07 Thread Cédrick Béler
> > #(1 2 3) asDictionary > For this one, I would expect a dictionary with integer keys I see a dictionary as an indexed collection with key that are not only integer (like for an array). 1->1 2->2 3->3 I doubt this would be useful though. My 2 cents, Cédrick

[Pharo-users] Parsing text to discover general data of interest (phone, email, address, ...)

2019-03-07 Thread Cédrick Béler
Hi all, I’ve often got the need to analyse some random unstructured text to discover (structured) information (in email for instance), to extract : - emails - telephone numbers - addresses - events - person names (according to a list of known persons), - etc… Apple do it in email for instance

Re: [Pharo-users] IMAP in Pharo

2019-03-07 Thread Cédrick Béler
> > >> On 6 Mar 2019, at 16:08, Sean P. DeNigris > > wrote: >> >> cedreek wrote >>> Still havent get my head around this FORK/PR cycle but I will for sure one >>> day :) >> >> It took me quite a while to grok. Fork = 'personal remote clone' and: [ PR = >> 'UI

Re: [Pharo-users] Fail on condense when image name is drgeo.image

2019-03-07 Thread Hilaire
Thanks Alistair. Great! I can now build DrGeo with the P7.0.1 image. I will check with the P7.0.2, but should be fine. Now the OS X DrGeo build behave much more gently, but I need to test a bit further. Hilaire Le 06/03/2019 à 08:53, Alistair Grant a écrit : > For future reference: > > Issue:

Re: [Pharo-users] Google Protobuf and usage of Slots

2019-03-07 Thread Holger Freyther
> On 4. Mar 2019, at 16:12, Marcus Denker wrote: > Thanks for the help! [ToC] I try to keep it brief so please tell me when to elaborate more. (1) Handling of default values. (2) Fast lazy initialization of OrderedCollection for repeated fields (3) Installation of selectors based on the

Re: [Pharo-users] Why can't a Bag answer as a dictionary?

2019-03-07 Thread Steffen Märcker
This is exactly how I think and feel about bags. Thanks Richard. Am .03.2019, 01:35 Uhr, schrieb Richard O'Keefe : To me, a bag is first and foremost a multiset. Yes, it's a collection, but it's not *just* a collection. It's a very specific kind of collection with a very salient

Re: [Pharo-users] Google Protobuf and usage of Slots

2019-03-07 Thread Holger Freyther
> On 5. Mar 2019, at 19:06, HenrikNergaard5 wrote: > > Hi Holger, > >> My plan of action is to start with a protobuf compiler/model and then look >> into binding the gRPC C implementation to Pharo. > > I have an almost complete parser and compiler (class generation) > implementation of