[GitHub] [spark] HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable pydocstyle tests

2020-03-14 Thread GitBox
HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable 
pydocstyle tests
URL: https://github.com/apache/spark/pull/27912#discussion_r392638387
 
 

 ##
 File path: dev/requirements.txt
 ##
 @@ -1,5 +1,8 @@
-flake8==3.5.0
+pycodestyle==2.5.0
 
 Review comment:
   On the other hand, not pinning will trigger us to fix quickly to support 
latest versions. Pinning version will force developers to install different 
versions of libraries they already have. If something is found we can't easily 
fix, we could pin the version. If I am not completely wrong, using the latest 
version by default is the conventional approach in Python side (and R side - it 
reminds me of [`testthat` case](https://github.com/apache/spark/pull/27359)). 
This is the approach used in [pandas as an 
example](https://github.com/pandas-dev/pandas/blob/master/requirements-dev.txt).
 [I used this approach 
too](https://github.com/databricks/koalas/blob/master/requirements-dev.txt).
   
   I agree, in a way, it can make more sense to pin the version in particular 
when this is used for CI purpose or release docker image which are supposed to 
discard of after using once; however, I don't quite agree when this affects 
other developers' environments.
   
   cc @dongjoon-hyun. IIRC, I think we have related discussions at 
https://github.com/apache/spark/pull/26824.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable pydocstyle tests

2020-03-14 Thread GitBox
HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable 
pydocstyle tests
URL: https://github.com/apache/spark/pull/27912#discussion_r392638919
 
 

 ##
 File path: dev/tox.ini
 ##
 @@ -18,4 +18,4 @@ ignore=E226,E241,E305,E402,E722,E731,E741,W503,W504
 max-line-length=100
 
exclude=cloudpickle.py,heapq3.py,shared.py,python/docs/conf.py,work/*/*.py,python/.eggs/*,dist/*,.git/*
 [pydocstyle]
-ignore=D100,D101,D102,D103,D104,D105,D106,D107,D200,D201,D202,D203,D204,D205,D206,D207,D208,D209,D210,D211,D212,D213,D214,D215,D300,D301,D302,D400,D401,D402,D403,D404,D405,D406,D407,D408,D409,D410,D411,D412,D413,D414
+ignore=D100,D101,D102,D103,D104,D105,D106,D107,D200,D201,D202,D203,D204,D205,D206,D207,D208,D209,D210,D211,D212,D213,D214,D215,D300,D301,D302,D400,D401,D402,D403,D404,D405,D406,D407,D408,D409,D410,D411,D412,D413,D414,D415,D416,D417
 
 Review comment:
   Also, I think we didn't mention that we comply 
[PEP-257](ttps://www.python.org/dev/peps/pep-0257/) yet anywhere. Of course, 
It's good to incrementally fix to comply PEP-257; however, it ignores too many 
rules currently. I wonder if we added this too early without a comprehensive 
investigation. There were several rules that looked counterintuitive to me, 
IIRC. cc @BryanCutler who, IIRC, also pointed this out.
   
   BTW, many rules were disabled due to our legacy Epydoc, which I managed to 
remove at 
https://github.com/apache/spark/commit/fe75ff8bea3330a10aba1a61f3aba42e541195a8.
 So, several rules might be able to enable back now.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable pydocstyle tests

2020-03-14 Thread GitBox
HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable 
pydocstyle tests
URL: https://github.com/apache/spark/pull/27912#discussion_r392638919
 
 

 ##
 File path: dev/tox.ini
 ##
 @@ -18,4 +18,4 @@ ignore=E226,E241,E305,E402,E722,E731,E741,W503,W504
 max-line-length=100
 
exclude=cloudpickle.py,heapq3.py,shared.py,python/docs/conf.py,work/*/*.py,python/.eggs/*,dist/*,.git/*
 [pydocstyle]
-ignore=D100,D101,D102,D103,D104,D105,D106,D107,D200,D201,D202,D203,D204,D205,D206,D207,D208,D209,D210,D211,D212,D213,D214,D215,D300,D301,D302,D400,D401,D402,D403,D404,D405,D406,D407,D408,D409,D410,D411,D412,D413,D414
+ignore=D100,D101,D102,D103,D104,D105,D106,D107,D200,D201,D202,D203,D204,D205,D206,D207,D208,D209,D210,D211,D212,D213,D214,D215,D300,D301,D302,D400,D401,D402,D403,D404,D405,D406,D407,D408,D409,D410,D411,D412,D413,D414,D415,D416,D417
 
 Review comment:
   Also, I think we didn't mention that we comply 
[PEP-257](ttps://www.python.org/dev/peps/pep-0257/) yet anywhere. Of course, 
It's good to incrementally fix to comply PEP-257; however, it ignores too many 
rules currently. I wonder if we added this too early without a comprehensive 
investigation. There were several rules that looked counterintuitive to me, 
IIRC too. cc @BryanCutler who, IIRC, also pointed this out.
   
   BTW, many rules were disabled due to our legacy Epydoc, which I managed to 
remove at 
https://github.com/apache/spark/commit/fe75ff8bea3330a10aba1a61f3aba42e541195a8.
 So, several rules might be able to enable back now.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable pydocstyle tests

2020-03-14 Thread GitBox
HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable 
pydocstyle tests
URL: https://github.com/apache/spark/pull/27912#discussion_r392638671
 
 

 ##
 File path: dev/tox.ini
 ##
 @@ -18,4 +18,4 @@ ignore=E226,E241,E305,E402,E722,E731,E741,W503,W504
 max-line-length=100
 
exclude=cloudpickle.py,heapq3.py,shared.py,python/docs/conf.py,work/*/*.py,python/.eggs/*,dist/*,.git/*
 [pydocstyle]
-ignore=D100,D101,D102,D103,D104,D105,D106,D107,D200,D201,D202,D203,D204,D205,D206,D207,D208,D209,D210,D211,D212,D213,D214,D215,D300,D301,D302,D400,D401,D402,D403,D404,D405,D406,D407,D408,D409,D410,D411,D412,D413,D414
+ignore=D100,D101,D102,D103,D104,D105,D106,D107,D200,D201,D202,D203,D204,D205,D206,D207,D208,D209,D210,D211,D212,D213,D214,D215,D300,D301,D302,D400,D401,D402,D403,D404,D405,D406,D407,D408,D409,D410,D411,D412,D413,D414,D415,D416,D417
 
 Review comment:
   I don't think it's intentional. Actually, I have been considering removing 
`pydocstyle` out which doesn't verify things very useful.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable pydocstyle tests

2020-03-14 Thread GitBox
HyukjinKwon commented on a change in pull request #27912: [SPARK-31155] Enable 
pydocstyle tests
URL: https://github.com/apache/spark/pull/27912#discussion_r392638387
 
 

 ##
 File path: dev/requirements.txt
 ##
 @@ -1,5 +1,8 @@
-flake8==3.5.0
+pycodestyle==2.5.0
 
 Review comment:
   On the other hand, not pinning will trigger us to fix quickly to support 
latest versions. Pinning version will force users to install different versions 
of libraries they already have. If something is found we can't easily fix, we 
could pin the version. If I am not completely wrong, using the latest version 
by default is the conventional approach in Python side (and R side - it reminds 
me of [`testthat` case](https://github.com/apache/spark/pull/27359)). This is 
the approach used in [pandas as an 
example](https://github.com/pandas-dev/pandas/blob/master/requirements-dev.txt).
 [I used this approach 
too](https://github.com/databricks/koalas/blob/master/requirements-dev.txt).
   
   I agree, in a way, it can make more sense to pin the version in particular 
when this is used for CI purpose or release docker image which are supposed to 
discard of after using once; however, I don't quite agree when this affects 
other developers' environments.
   
   cc @dongjoon-hyun. IIRC, I think we have related discussions at 
https://github.com/apache/spark/pull/26824.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org