GitHub user seratch opened a pull request:

    https://github.com/apache/spark/pull/22070

    Fix typos detected by github.com/client9/misspell

    ## What changes were proposed in this pull request?
    
    Fixing typos is sometimes very hard. It's not so easy to visually review 
them. Recently, I discovered a very useful tool for it, 
[misspell](https://github.com/client9/misspell). 
    
    This pull request fixes minor typos detected by 
[misspell](https://github.com/client9/misspell) except for the false positives. 
If you would like me to work on other files as well, let me know. 
    
    ## How was this patch tested?
    
    ### before
    
    ```
    $ misspell . | grep -v '.js'
    R/pkg/R/SQLContext.R:354:43: "definiton" is a misspelling of "definition"
    R/pkg/R/SQLContext.R:424:43: "definiton" is a misspelling of "definition"
    R/pkg/R/SQLContext.R:445:43: "definiton" is a misspelling of "definition"
    R/pkg/R/SQLContext.R:495:43: "definiton" is a misspelling of "definition"
    NOTICE-binary:454:16: "containd" is a misspelling of "contained"
    R/pkg/R/context.R:46:43: "definiton" is a misspelling of "definition"
    R/pkg/R/context.R:74:43: "definiton" is a misspelling of "definition"
    R/pkg/R/DataFrame.R:591:48: "persistance" is a misspelling of "persistence"
    R/pkg/R/streaming.R:166:44: "occured" is a misspelling of "occurred"
    R/pkg/inst/worker/worker.R:65:22: "ouput" is a misspelling of "output"
    R/pkg/tests/fulltests/test_utils.R:106:25: "environemnt" is a misspelling 
of "environment"
    
common/kvstore/src/test/java/org/apache/spark/util/kvstore/InMemoryStoreSuite.java:38:39:
 "existant" is a misspelling of "existent"
    
common/kvstore/src/test/java/org/apache/spark/util/kvstore/LevelDBSuite.java:83:39:
 "existant" is a misspelling of "existent"
    
common/network-common/src/main/java/org/apache/spark/network/crypto/TransportCipher.java:243:46:
 "transfered" is a misspelling of "transferred"
    
common/network-common/src/main/java/org/apache/spark/network/sasl/SaslEncryption.java:234:19:
 "transfered" is a misspelling of "transferred"
    
common/network-common/src/main/java/org/apache/spark/network/sasl/SaslEncryption.java:238:63:
 "transfered" is a misspelling of "transferred"
    
common/network-common/src/main/java/org/apache/spark/network/sasl/SaslEncryption.java:244:46:
 "transfered" is a misspelling of "transferred"
    
common/network-common/src/main/java/org/apache/spark/network/sasl/SaslEncryption.java:276:39:
 "transfered" is a misspelling of "transferred"
    
common/network-common/src/main/java/org/apache/spark/network/util/AbstractFileRegion.java:27:20:
 "transfered" is a misspelling of "transferred"
    
common/unsafe/src/test/scala/org/apache/spark/unsafe/types/UTF8StringPropertyCheckSuite.scala:195:15:
 "orgin" is a misspelling of "origin"
    core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala:621:39: 
"gauranteed" is a misspelling of "guaranteed"
    core/src/main/scala/org/apache/spark/status/storeTypes.scala:113:29: "ect" 
is a misspelling of "etc"
    core/src/main/scala/org/apache/spark/storage/DiskStore.scala:282:18: 
"transfered" is a misspelling of "transferred"
    core/src/main/scala/org/apache/spark/util/ListenerBus.scala:64:17: 
"overriden" is a misspelling of "overridden"
    core/src/test/scala/org/apache/spark/ShuffleSuite.scala:211:7: 
"substracted" is a misspelling of "subtracted"
    
core/src/test/scala/org/apache/spark/scheduler/DAGSchedulerSuite.scala:1922:49: 
"agriculteur" is a misspelling of "agriculture"
    
core/src/test/scala/org/apache/spark/scheduler/DAGSchedulerSuite.scala:2468:84: 
"truely" is a misspelling of "truly"
    
core/src/test/scala/org/apache/spark/storage/FlatmapIteratorSuite.scala:25:18: 
"persistance" is a misspelling of "persistence"
    
core/src/test/scala/org/apache/spark/storage/FlatmapIteratorSuite.scala:26:69: 
"persistance" is a misspelling of "persistence"
    data/streaming/AFINN-111.txt:1219:0: "humerous" is a misspelling of 
"humorous"
    dev/run-pip-tests:55:28: "enviroments" is a misspelling of "environments"
    dev/run-pip-tests:91:37: "virutal" is a misspelling of "virtual"
    dev/merge_spark_pr.py:377:72: "accross" is a misspelling of "across"
    dev/merge_spark_pr.py:378:66: "accross" is a misspelling of "across"
    dev/run-pip-tests:126:25: "enviroments" is a misspelling of "environments"
    docs/configuration.md:1830:82: "overriden" is a misspelling of "overridden"
    docs/structured-streaming-programming-guide.md:525:45: "processs" is a 
misspelling of "processes"
    docs/structured-streaming-programming-guide.md:1165:61: "BETWEN" is a 
misspelling of "BETWEEN"
    docs/sql-programming-guide.md:1891:810: "behaivor" is a misspelling of 
"behavior"
    examples/src/main/python/sql/arrow.py:98:8: "substract" is a misspelling of 
"subtract"
    examples/src/main/python/sql/arrow.py:103:27: "substract" is a misspelling 
of "subtract"
    licenses/LICENSE-heapq.txt:5:63: "Stichting" is a misspelling of "Stitching"
    licenses/LICENSE-heapq.txt:6:2: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses/LICENSE-heapq.txt:262:29: "Stichting" is a misspelling of 
"Stitching"
    licenses/LICENSE-heapq.txt:262:39: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses/LICENSE-heapq.txt:269:49: "Stichting" is a misspelling of 
"Stitching"
    licenses/LICENSE-heapq.txt:269:59: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses/LICENSE-heapq.txt:274:2: "STICHTING" is a misspelling of 
"STITCHING"
    licenses/LICENSE-heapq.txt:274:12: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    licenses/LICENSE-heapq.txt:276:29: "STICHTING" is a misspelling of 
"STITCHING"
    licenses/LICENSE-heapq.txt:276:39: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    licenses-binary/LICENSE-heapq.txt:5:63: "Stichting" is a misspelling of 
"Stitching"
    licenses-binary/LICENSE-heapq.txt:6:2: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses-binary/LICENSE-heapq.txt:262:29: "Stichting" is a misspelling of 
"Stitching"
    licenses-binary/LICENSE-heapq.txt:262:39: "Mathematisch" is a misspelling 
of "Mathematics"
    licenses-binary/LICENSE-heapq.txt:269:49: "Stichting" is a misspelling of 
"Stitching"
    licenses-binary/LICENSE-heapq.txt:269:59: "Mathematisch" is a misspelling 
of "Mathematics"
    licenses-binary/LICENSE-heapq.txt:274:2: "STICHTING" is a misspelling of 
"STITCHING"
    licenses-binary/LICENSE-heapq.txt:274:12: "MATHEMATISCH" is a misspelling 
of "MATHEMATICS"
    licenses-binary/LICENSE-heapq.txt:276:29: "STICHTING" is a misspelling of 
"STITCHING"
    licenses-binary/LICENSE-heapq.txt:276:39: "MATHEMATISCH" is a misspelling 
of "MATHEMATICS"
    
mllib/src/main/resources/org/apache/spark/ml/feature/stopwords/hungarian.txt:170:0:
 "teh" is a misspelling of "the"
    
mllib/src/main/resources/org/apache/spark/ml/feature/stopwords/portuguese.txt:53:0:
 "eles" is a misspelling of "eels"
    mllib/src/main/scala/org/apache/spark/ml/stat/Summarizer.scala:99:20: 
"Euclidian" is a misspelling of "Euclidean"
    mllib/src/main/scala/org/apache/spark/ml/stat/Summarizer.scala:539:11: 
"Euclidian" is a misspelling of "Euclidean"
    
mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAOptimizer.scala:77:36:
 "Teh" is a misspelling of "The"
    
mllib/src/main/scala/org/apache/spark/mllib/clustering/StreamingKMeans.scala:230:24:
 "inital" is a misspelling of "initial"
    
mllib/src/main/scala/org/apache/spark/mllib/stat/MultivariateOnlineSummarizer.scala:276:9:
 "Euclidian" is a misspelling of "Euclidean"
    
mllib/src/test/scala/org/apache/spark/ml/clustering/KMeansSuite.scala:237:26: 
"descripiton" is a misspelling of "descriptions"
    python/pyspark/find_spark_home.py:30:13: "enviroment" is a misspelling of 
"environment"
    python/pyspark/context.py:937:12: "supress" is a misspelling of "suppress"
    python/pyspark/context.py:938:12: "supress" is a misspelling of "suppress"
    python/pyspark/context.py:939:12: "supress" is a misspelling of "suppress"
    python/pyspark/context.py:940:12: "supress" is a misspelling of "suppress"
    python/pyspark/heapq3.py:6:63: "Stichting" is a misspelling of "Stitching"
    python/pyspark/heapq3.py:7:2: "Mathematisch" is a misspelling of 
"Mathematics"
    python/pyspark/heapq3.py:263:29: "Stichting" is a misspelling of "Stitching"
    python/pyspark/heapq3.py:263:39: "Mathematisch" is a misspelling of 
"Mathematics"
    python/pyspark/heapq3.py:270:49: "Stichting" is a misspelling of "Stitching"
    python/pyspark/heapq3.py:270:59: "Mathematisch" is a misspelling of 
"Mathematics"
    python/pyspark/heapq3.py:275:2: "STICHTING" is a misspelling of "STITCHING"
    python/pyspark/heapq3.py:275:12: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    python/pyspark/heapq3.py:277:29: "STICHTING" is a misspelling of "STITCHING"
    python/pyspark/heapq3.py:277:39: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    python/pyspark/heapq3.py:713:8: "probabilty" is a misspelling of 
"probability"
    python/pyspark/ml/clustering.py:1038:8: "Currenlty" is a misspelling of 
"Currently"
    python/pyspark/ml/stat.py:339:23: "Euclidian" is a misspelling of 
"Euclidean"
    python/pyspark/ml/regression.py:1378:20: "paramter" is a misspelling of 
"parameter"
    python/pyspark/mllib/stat/_statistics.py:262:8: "probabilty" is a 
misspelling of "probability"
    python/pyspark/rdd.py:1363:32: "paramter" is a misspelling of "parameter"
    python/pyspark/streaming/tests.py:825:42: "retuns" is a misspelling of 
"returns"
    python/pyspark/sql/tests.py:768:29: "initalization" is a misspelling of 
"initialization"
    python/pyspark/sql/tests.py:3616:31: "initalize" is a misspelling of 
"initialize"
    
resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosSchedulerBackendUtil.scala:120:39:
 "arbitary" is a misspelling of "arbitrary"
    
resource-managers/mesos/src/test/scala/org/apache/spark/deploy/mesos/MesosClusterDispatcherArgumentsSuite.scala:26:45:
 "sucessfully" is a misspelling of "successfully"
    
resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosSchedulerUtils.scala:358:27:
 "constaints" is a misspelling of "constraints"
    
resource-managers/yarn/src/test/scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala:111:24:
 "senstive" is a misspelling of "sensitive"
    
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala:1063:5:
 "overwirte" is a misspelling of "overwrite"
    
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/datetimeExpressions.scala:1348:17:
 "compatability" is a misspelling of "compatibility"
    
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala:77:36:
 "paramter" is a misspelling of "parameter"
    
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala:1374:22:
 "precendence" is a misspelling of "precedence"
    
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/analysis/AnalysisSuite.scala:238:27:
 "unnecassary" is a misspelling of "unnecessary"
    
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ConditionalExpressionSuite.scala:212:17:
 "whn" is a misspelling of "when"
    
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamingSymmetricHashJoinHelper.scala:147:60:
 "timestmap" is a misspelling of "timestamp"
    sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala:150:45: 
"precentage" is a misspelling of "percentage"
    
sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVInferSchemaSuite.scala:135:29:
 "infered" is a misspelling of "inferred"
    
sql/hive/src/test/resources/golden/udf_instr-1-2e76f819563dbaba4beb51e3a130b922:1:52:
 "occurance" is a misspelling of "occurrence"
    
sql/hive/src/test/resources/golden/udf_instr-2-32da357fc754badd6e3898dcc8989182:1:52:
 "occurance" is a misspelling of "occurrence"
    
sql/hive/src/test/resources/golden/udf_locate-1-6e41693c9c6dceea4d7fab4c02884e4e:1:63:
 "occurance" is a misspelling of "occurrence"
    
sql/hive/src/test/resources/golden/udf_locate-2-d9b5934457931447874d6bb7c13de478:1:63:
 "occurance" is a misspelling of "occurrence"
    
sql/hive/src/test/resources/golden/udf_translate-2-f7aa38a33ca0df73b7a1e6b6da4b7fe8:9:79:
 "occurence" is a misspelling of "occurrence"
    
sql/hive/src/test/resources/golden/udf_translate-2-f7aa38a33ca0df73b7a1e6b6da4b7fe8:13:110:
 "occurence" is a misspelling of "occurrence"
    
sql/hive/src/test/resources/ql/src/test/queries/clientpositive/annotate_stats_join.q:46:105:
 "distint" is a misspelling of "distinct"
    
sql/hive/src/test/resources/ql/src/test/queries/clientpositive/auto_sortmerge_join_11.q:29:3:
 "Currenly" is a misspelling of "Currently"
    
sql/hive/src/test/resources/ql/src/test/queries/clientpositive/avro_partitioned.q:72:15:
 "existant" is a misspelling of "existent"
    
sql/hive/src/test/resources/ql/src/test/queries/clientpositive/decimal_udf.q:25:3:
 "substraction" is a misspelling of "subtraction"
    
sql/hive/src/test/resources/ql/src/test/queries/clientpositive/groupby2_map_multi_distinct.q:16:51:
 "funtion" is a misspelling of "function"
    
sql/hive/src/test/resources/ql/src/test/queries/clientpositive/groupby_sort_8.q:15:30:
 "issueing" is a misspelling of "issuing"
    
sql/hive/src/test/scala/org/apache/spark/sql/sources/HadoopFsRelationTest.scala:669:52:
 "wiht" is a misspelling of "with"
    
sql/hive-thriftserver/src/main/java/org/apache/hive/service/cli/session/HiveSessionImpl.java:474:9:
 "Refering" is a misspelling of "Referring"
    ```
    
    ### after
    
    ```
    $ misspell . | grep -v '.js'
    
common/network-common/src/main/java/org/apache/spark/network/util/AbstractFileRegion.java:27:20:
 "transfered" is a misspelling of "transferred"
    core/src/main/scala/org/apache/spark/status/storeTypes.scala:113:29: "ect" 
is a misspelling of "etc"
    
core/src/test/scala/org/apache/spark/scheduler/DAGSchedulerSuite.scala:1922:49: 
"agriculteur" is a misspelling of "agriculture"
    data/streaming/AFINN-111.txt:1219:0: "humerous" is a misspelling of 
"humorous"
    licenses/LICENSE-heapq.txt:5:63: "Stichting" is a misspelling of "Stitching"
    licenses/LICENSE-heapq.txt:6:2: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses/LICENSE-heapq.txt:262:29: "Stichting" is a misspelling of 
"Stitching"
    licenses/LICENSE-heapq.txt:262:39: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses/LICENSE-heapq.txt:269:49: "Stichting" is a misspelling of 
"Stitching"
    licenses/LICENSE-heapq.txt:269:59: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses/LICENSE-heapq.txt:274:2: "STICHTING" is a misspelling of 
"STITCHING"
    licenses/LICENSE-heapq.txt:274:12: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    licenses/LICENSE-heapq.txt:276:29: "STICHTING" is a misspelling of 
"STITCHING"
    licenses/LICENSE-heapq.txt:276:39: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    licenses-binary/LICENSE-heapq.txt:5:63: "Stichting" is a misspelling of 
"Stitching"
    licenses-binary/LICENSE-heapq.txt:6:2: "Mathematisch" is a misspelling of 
"Mathematics"
    licenses-binary/LICENSE-heapq.txt:262:29: "Stichting" is a misspelling of 
"Stitching"
    licenses-binary/LICENSE-heapq.txt:262:39: "Mathematisch" is a misspelling 
of "Mathematics"
    licenses-binary/LICENSE-heapq.txt:269:49: "Stichting" is a misspelling of 
"Stitching"
    licenses-binary/LICENSE-heapq.txt:269:59: "Mathematisch" is a misspelling 
of "Mathematics"
    licenses-binary/LICENSE-heapq.txt:274:2: "STICHTING" is a misspelling of 
"STITCHING"
    licenses-binary/LICENSE-heapq.txt:274:12: "MATHEMATISCH" is a misspelling 
of "MATHEMATICS"
    licenses-binary/LICENSE-heapq.txt:276:29: "STICHTING" is a misspelling of 
"STITCHING"
    licenses-binary/LICENSE-heapq.txt:276:39: "MATHEMATISCH" is a misspelling 
of "MATHEMATICS"
    
mllib/src/main/resources/org/apache/spark/ml/feature/stopwords/hungarian.txt:170:0:
 "teh" is a misspelling of "the"
    
mllib/src/main/resources/org/apache/spark/ml/feature/stopwords/portuguese.txt:53:0:
 "eles" is a misspelling of "eels"
    mllib/src/main/scala/org/apache/spark/ml/stat/Summarizer.scala:99:20: 
"Euclidian" is a misspelling of "Euclidean"
    mllib/src/main/scala/org/apache/spark/ml/stat/Summarizer.scala:539:11: 
"Euclidian" is a misspelling of "Euclidean"
    
mllib/src/main/scala/org/apache/spark/mllib/clustering/LDAOptimizer.scala:77:36:
 "Teh" is a misspelling of "The"
    
mllib/src/main/scala/org/apache/spark/mllib/stat/MultivariateOnlineSummarizer.scala:276:9:
 "Euclidian" is a misspelling of "Euclidean"
    python/pyspark/heapq3.py:6:63: "Stichting" is a misspelling of "Stitching"
    python/pyspark/heapq3.py:7:2: "Mathematisch" is a misspelling of 
"Mathematics"
    python/pyspark/heapq3.py:263:29: "Stichting" is a misspelling of "Stitching"
    python/pyspark/heapq3.py:263:39: "Mathematisch" is a misspelling of 
"Mathematics"
    python/pyspark/heapq3.py:270:49: "Stichting" is a misspelling of "Stitching"
    python/pyspark/heapq3.py:270:59: "Mathematisch" is a misspelling of 
"Mathematics"
    python/pyspark/heapq3.py:275:2: "STICHTING" is a misspelling of "STITCHING"
    python/pyspark/heapq3.py:275:12: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    python/pyspark/heapq3.py:277:29: "STICHTING" is a misspelling of "STITCHING"
    python/pyspark/heapq3.py:277:39: "MATHEMATISCH" is a misspelling of 
"MATHEMATICS"
    python/pyspark/ml/stat.py:339:23: "Euclidian" is a misspelling of 
"Euclidean"
    ```

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/seratch/spark fix-typo

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/22070.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #22070
    
----
commit 9e95df24206bbcc51ae09bd488d72a2bcf84ee7b
Author: Kazuhiro Sera <seratch@...>
Date:   2018-08-10T10:44:34Z

    Fix typos detected by github.com/client9/misspell
    
    Signed-off-by: Kazuhiro Sera <sera...@gmail.com>

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to