[ 
https://issues.apache.org/jira/browse/SPARK-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15177784#comment-15177784
 ] 

Daniel Darabos commented on SPARK-13620:
----------------------------------------

I just tested with the latest 2.0 nightly. Starting {{spark-shell}} is 
subjectively fast. {{strace}} does not see the {{RESOLVE-ADDRESS}} request. 
Looks like it's fixed then! I should have checked {{master}} before filing an 
issue, sorry.

> Avoid reverse DNS lookup for 0.0.0.0 on startup
> -----------------------------------------------
>
>                 Key: SPARK-13620
>                 URL: https://issues.apache.org/jira/browse/SPARK-13620
>             Project: Spark
>          Issue Type: Improvement
>          Components: Web UI
>    Affects Versions: 1.6.0
>            Reporter: Daniel Darabos
>            Priority: Minor
>             Fix For: 2.0.0
>
>
> I noticed we spend 5+ seconds during application startup with the following 
> stack trace:
> {code}
>         at java.net.Inet6AddressImpl.getHostByAddr(Native Method)
>         at java.net.InetAddress$1.getHostByAddr(InetAddress.java:926)
>         at java.net.InetAddress.getHostFromNameService(InetAddress.java:611)
>         at java.net.InetAddress.getHostName(InetAddress.java:553)
>         at java.net.InetAddress.getHostName(InetAddress.java:525)
>         at 
> java.net.InetSocketAddress$InetSocketAddressHolder.getHostName(InetSocketAddress.java:82)
>         at 
> java.net.InetSocketAddress$InetSocketAddressHolder.access$600(InetSocketAddress.java:56)
>         at java.net.InetSocketAddress.getHostName(InetSocketAddress.java:345)
>         at org.spark-project.jetty.server.Server.<init>(Server.java:115)
>         at 
> org.apache.spark.ui.JettyUtils$.org$apache$spark$ui$JettyUtils$$connect$1(JettyUtils.scala:243)
>         at 
> org.apache.spark.ui.JettyUtils$$anonfun$5.apply(JettyUtils.scala:262)
>         at 
> org.apache.spark.ui.JettyUtils$$anonfun$5.apply(JettyUtils.scala:262)
>         at 
> org.apache.spark.util.Utils$$anonfun$startServiceOnPort$1.apply$mcVI$sp(Utils.scala:1964)
>         at scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:141)
>         at org.apache.spark.util.Utils$.startServiceOnPort(Utils.scala:1955)
>         at 
> org.apache.spark.ui.JettyUtils$.startJettyServer(JettyUtils.scala:262)
>         at org.apache.spark.ui.WebUI.bind(WebUI.scala:136)
>         at 
> org.apache.spark.SparkContext$$anonfun$13.apply(SparkContext.scala:481)
>         at 
> org.apache.spark.SparkContext$$anonfun$13.apply(SparkContext.scala:481)
>         at scala.Option.foreach(Option.scala:236)
>         at org.apache.spark.SparkContext.<init>(SparkContext.scala:481)
> {code}
> Spark wants to start a server on localhost. So it [creates an 
> {{InetSocketAddress}}|https://github.com/apache/spark/blob/v1.6.0/core/src/main/scala/org/apache/spark/ui/JettyUtils.scala#L243]
>  [with hostname 
> {{"0.0.0.0"}}|https://github.com/apache/spark/blob/v1.6.0/core/src/main/scala/org/apache/spark/ui/WebUI.scala#L136].
>  Spark passes in a hostname string, but Java [recognizes that it's actually 
> an 
> address|https://github.com/openjdk-mirror/jdk/blob/adea42765ae4e7117c3f0e2d618d5e6aed44ced2/src/share/classes/java/net/InetSocketAddress.java#L220]
>  and so sets the hostname to {{null}}. So when Jetty [calls 
> {{getHostName}}|https://github.com/eclipse/jetty.project/blob/jetty-8.1.14.v20131031/jetty-server/src/main/java/org/eclipse/jetty/server/Server.java#L115]
>  Java has to do a reverse DNS lookup for {{0.0.0.0}}. That takes 5+ seconds 
> on my machine. Maybe it's just me? It's a very vanilla Ubuntu 14.04.
> There is a simple fix. Instead of passing in {{"0.0.0.0"}} we should not set 
> a hostname. In this case [{{InetAddress.anyLocalAddress()}} is 
> used|https://github.com/openjdk-mirror/jdk/blob/adea42765ae4e7117c3f0e2d618d5e6aed44ced2/src/share/classes/java/net/InetSocketAddress.java#L166],
>  which is the same, but does not need resolving.
> {code}
> scala> { val t0 = System.currentTimeMillis; new 
> java.net.InetSocketAddress("0.0.0.0", 8000).getHostName; 
> System.currentTimeMillis - t0 }
> res0: Long = 5432
> scala> { val t0 = System.currentTimeMillis; new 
> java.net.InetSocketAddress(8000).getHostName; System.currentTimeMillis - t0 }
> res1: Long = 0
> {code}
> I'll send a pull request for this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to