Re: machine learning forums

2018-08-06 Thread Sharan Basappa
> > I am quite new to Python. I am learning Python as I am interested in
> > machine learning. The issue is, I have not found any ML forum where
> > novices like me can get help. I have tried reddit and each of my posts
> > have gone unanswered. 
> 
> Which subreddits have you posted to?

its called machine learning

> > Looks like reddit forum prefers either abstract
> > topics on ML or very complex issues for discussions.
> > 
> > I have tried stackoverflow also but there only programming issues are
> > entertained 
> 
> I believe Stackoverflow has a dedicated machine-learning site, "Cross 
> Validated":
> 
> https://meta.stackexchange.com/questions/130524/which-stack-exchange-
> website-for-machine-learning-and-computational-algorithms
> 
> https://meta.stackexchange.com/questions/227757/where-to-ask-basic-
> questions-about-machine-learning

Thanks. This looks like a very good sight. I hope folks are friendlier than 
stackoverflow.
-- 
https://mail.python.org/mailman/listinfo/python-list


Re: machine learning forums

2018-08-06 Thread Sharan Basappa
On Monday, 6 August 2018 09:21:03 UTC+5:30, downtime  wrote:
> You might also try courses on Udemy and Udacity. I know Udemy is always
> having sales on courses for like $10.99. They have beginner/novice
> courses for all kinds of topics and in my experience, there are some
> pretty good ones.

Actually, I have already completed machine learning course by Andrew NG.
And then I spent close to 3 months learning Python as I quickly realized than 
without Python, it is nearly impossible to do any coding in ML area.

But the issue is, from time to time, I do get doubts. For Python, I have this 
forum but for ML, not much options. Now I know about cross validated though.
-- 
https://mail.python.org/mailman/listinfo/python-list


NLTK

2018-08-06 Thread mausg
I like to analyse text. my method consisted of something like 
words=text.split(), which would split the text into space-seperated 
units. then I tried to use the Python NLTK library, which had alot
of features I wanted, but using `word-tokenize' gives a different
 answer.-

What gives?.



-- 
m...@ireland.com
Will Rant For Food
-- 
https://mail.python.org/mailman/listinfo/python-list


Fishing from PyPI ?

2018-08-06 Thread Vincent Vande Vyvre

I've received a mail askink me to confirm my identity on PyPI.

The begining of the message:

-

Your account VinsS on PyPI 
 
does not have a verified primary email address.


Please take 2 minutes to verify your email:

 * To ensure that every email we send is going to its intended recipient
 * To ensure email sent from PyPI isn’t marked as spam by email clients
   and service
 * *If you do not verify, you will not be able to upload to PyPI, or be
   added as a maintainer to projects. You will no longer receive
   notifications from PyPI.*

   To verify, visit your Account Settings
   

   page.

-

The Account Settings 
 
is : 
https://pypi.us18.list-manage.com/track/click?u=b96f861be540deb98fd700265&id=1763135c31&e=8824682d0f


Phishing ? yes, no ?


Vincent

--
https://mail.python.org/mailman/listinfo/python-list


Re: Fishing from PyPI ?

2018-08-06 Thread Abdur-Rahmaan Janhangeer
me too received it but what makes you suspect?

Abdur-Rahmaan Janhangeer
https://github.com/Abdur-rahmaanJ
Mauritius

On Mon, 6 Aug 2018, 20:35 Vincent Vande Vyvre, <
vincent.vande.vy...@telenet.be> wrote:

> I've received a mail askink me to confirm my identity on PyPI.
>
> The begining of the message:
>
> -
>
> Your account VinsS on PyPI
> <
> https://pypi.us18.list-manage.com/track/click?u=b96f861be540deb98fd700265&id=9a19d150cf&e=8824682d0f>
>
> does not have a verified primary email address.
>
> Please take 2 minutes to verify your email:
>
>   * To ensure that every email we send is going to its intended recipient
>   * To ensure email sent from PyPI isn’t marked as spam by email clients
> and service
>   * *If you do not verify, you will not be able to upload to PyPI, or be
> added as a maintainer to projects. You will no longer receive
> notifications from PyPI.*
>
> To verify, visit your Account Settings
> <
> https://pypi.us18.list-manage.com/track/click?u=b96f861be540deb98fd700265&id=1763135c31&e=8824682d0f
> >
> page.
>
> -
>
> The Account Settings
> <
> https://pypi.us18.list-manage.com/track/click?u=b96f861be540deb98fd700265&id=1763135c31&e=8824682d0f>
>
> is :
>
> https://pypi.us18.list-manage.com/track/click?u=b96f861be540deb98fd700265&id=1763135c31&e=8824682d0f
>
> Phishing ? yes, no ?
>
>
> Vincent
>
> --
> https://mail.python.org/mailman/listinfo/python-list
>
-- 
https://mail.python.org/mailman/listinfo/python-list


Re: Fishing from PyPI ?

2018-08-06 Thread MRAB

On 2018-08-03 22:43, Vincent Vande Vyvre wrote:

I've received a mail askink me to confirm my identity on PyPI.

The begining of the message:

-

Your account VinsS on PyPI

does not have a verified primary email address.

Please take 2 minutes to verify your email:

   * To ensure that every email we send is going to its intended recipient
   * To ensure email sent from PyPI isn’t marked as spam by email clients
 and service
   * *If you do not verify, you will not be able to upload to PyPI, or be
 added as a maintainer to projects. You will no longer receive
 notifications from PyPI.*

 To verify, visit your Account Settings
 

 page.

-

The Account Settings

is :
https://pypi.us18.list-manage.com/track/click?u=b96f861be540deb98fd700265&id=1763135c31&e=8824682d0f

Phishing ? yes, no ?

If you want to be sure, ignore the links in the email, and check PyPI at 
the URL that you know is correct.


Given that the email says "https://pypi.us18.list-manage.com"; and PyPI 
is at "https://pypi.org/";, it does look suspicious.

--
https://mail.python.org/mailman/listinfo/python-list


Re: Fishing from PyPI ?

2018-08-06 Thread Chris Warrick
On Mon, 6 Aug 2018 at 19:31, MRAB  wrote:
> > https://pypi.us18.list-manage.com/track/[snip]
> If you want to be sure, ignore the links in the email, and check PyPI at
> the URL that you know is correct.
>
> Given that the email says "https://pypi.us18.list-manage.com"; and PyPI
> is at "https://pypi.org/";, it does look suspicious.
> --
> https://mail.python.org/mailman/listinfo/python-list

Those e-mails are legitimate. See [1] and [2].

The unusual domain is a common staple of Mailchimp, which is an e-mail
newsletter platform (it was used to mail out the announcement), and
they replace all links with tracking ones in their list-manage.com
domain. (They also implement the GDPR in an anti-user/pro-spam
fashion, but I digress.)

[1]: https://status.python.org/incidents/nk7cyn2vh4wr
[2]: https://github.com/pypa/warehouse/issues/3632

-- 
Chris Warrick 
PGP: 5EAAEA16
-- 
https://mail.python.org/mailman/listinfo/python-list


Re: NLTK

2018-08-06 Thread Rhodri James

On 03/08/18 08:49, ma...@mail.com wrote:

I like to analyse text. my method consisted of something like
words=text.split(), which would split the text into space-seperated
units. then I tried to use the Python NLTK library, which had alot
of features I wanted, but using `word-tokenize' gives a different
  answer.-

What gives?.


I don't know or use NLTK and you haven't provided any examples, so I 
can't say for certain.  However vague memories from the natural language 
work I did decades ago suggest that perhaps splitting text by spaces is 
a little too simplistic, and NLTK may be trying to do something more 
comprehensive.


--
Rhodri James *-* Kynesim Ltd
--
https://mail.python.org/mailman/listinfo/python-list


Re: Fishing from PyPI ?

2018-08-06 Thread Gregory Ewing

Chris Warrick wrote:

The unusual domain is a common staple of Mailchimp, which is an e-mail
newsletter platform (it was used to mail out the announcement), and
they replace all links with tracking ones in their list-manage.com
domain.


Sounds like you need to find a mail service that doesn't
screw around with the contents of your messages. This is
really quite obnoxious, IMO.

--
Greg
--
https://mail.python.org/mailman/listinfo/python-list


Re: NLTK

2018-08-06 Thread Steven D'Aprano
On Fri, 03 Aug 2018 07:49:40 +, mausg wrote:

> I like to analyse text. my method consisted of something like
> words=text.split(), which would split the text into space-seperated
> units. 

In natural language, words are more complicated than just space-separated 
units. Some languages don't use spaces as a word delimiter. Some don't 
use word delimiters at all. Even in English, the we have *compound words* 
which exist in three forms:

- open: "ice cream"
- closed: "notebook"
- hyphenated: "long-term"

Recognising open compound words is difficult. "Real estate" is an open 
compound word, but "real cheese" and "my estate" are both two words.

Another problem for English speakers is deciding whether to treat 
contractions as a single word, or split them?

"don't" --> "do" "n't"

"they'll" --> "they" "'ll"

Punctuation marks should either be stripped out of sentences before 
splitting into words, or treated as distinct tokens. We don't want 
"tokens" and "tokens." to be treated as distinct words, just because one 
happened to fall at the end of a sentence and one didn't.


> then I tried to use the Python NLTK library, which had alot of
> features I wanted, but using `word-tokenize' gives a different
>  answer.-
> 
> What gives?.

I'm pretty sure the function isn't called "word-tokenize". That would 
mean "word subtract tokenize" in Python code. Do you mean word_tokenize?

Have you compared the output of the two and looked at how they differ? If 
there is too much output to compare by eye, you could convert to sets and 
check the set difference.

Or try reading the documentation for word_tokenize:

http://www.nltk.org/api/nltk.tokenize.html#nltk.tokenize.treebank.TreebankWordTokenizer



-- 
Steven D'Aprano
"Ever since I learned about confirmation bias, I've been seeing
it everywhere." -- Jon Ronson

-- 
https://mail.python.org/mailman/listinfo/python-list


Good reason not to obfuscate URLs (was: Fishing from PyPI ?)

2018-08-06 Thread Ben Finney
Vincent Vande Vyvre  writes:

>To verify, visit your Account Settings
> information …]>
>page.
>
> -
>
> The Account Settings
>  information …]>
> is :
> https://pypi.us18.list-manage.com/track/click?u=[… personally-identifying 
> information …]
>
> Phishing ? yes, no ?

It's impossible to tell, from those links alone. The links are
obfuscated deliberately.

What we can say for certain, is that following those links allows
parties unknown, to track the fact you've followed that link, before you
ever get to PyPI.

You are right to be concerned.

This is one good reason why I argue that link obfuscation like this is
bad practice: we can't tell what domain they will redirect to, so
there's no way to know before visiting the link whether it will go to a
‘python.org’ URL.

Instead, sending people links that you want them to follow should be
direct links. That way we can see where it is the person wants us to
visit.

As a bonus, we avoid more layers of surveillance that these
man-inthe-middle providers like ‘list-manage.com’ try to gather about
our online behaviour.

-- 
 \  “Programs must be written for people to read, and only |
  `\incidentally for machines to execute.” —Abelson & Sussman, |
_o__)  _Structure and Interpretation of Computer Programs_ |
Ben Finney

-- 
https://mail.python.org/mailman/listinfo/python-list