[ https://issues.apache.org/jira/browse/BEAM-7890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ismaël Mejía resolved BEAM-7890. -------------------------------- Fix Version/s: Not applicable Resolution: Won't Fix > Rework dependency stack to ensure beam stay lightweight + embeddable > -------------------------------------------------------------------- > > Key: BEAM-7890 > URL: https://issues.apache.org/jira/browse/BEAM-7890 > Project: Beam > Issue Type: Bug > Components: runner-core, runner-spark, sdk-java-core > Affects Versions: 2.14.0 > Reporter: Romain Manni-Bucau > Priority: Major > Fix For: Not applicable > > > Currently, beam entry cost is > 30M: > > {code:java} > -rw-r--r-- 1 rmannibucau rmannibucau 13M févr. 17 11:45 > beam-vendor-grpc-1_13_1-0.2.jar > -rw-r--r-- 1 rmannibucau rmannibucau 8,7M août 5 10:22 > beam-sdks-java-core-2.14.0.jar > -rw-r--r-- 1 rmannibucau rmannibucau 2,6M août 5 10:25 > beam-vendor-sdks-java-extensions-protobuf-2.14.0.jar > -rw-r--r-- 1 rmannibucau rmannibucau 2,6M févr. 17 11:45 > beam-vendor-guava-20_0-0.1.jar > -rw-r--r-- 1 rmannibucau rmannibucau 1,4M août 5 10:21 > beam-model-pipeline-2.14.0.jar > -rw-r--r-- 1 rmannibucau rmannibucau 825K août 5 10:25 > beam-model-fn-execution-2.14.0.jar > -rw-r--r-- 1 rmannibucau rmannibucau 470K août 5 10:21 > beam-model-job-management-2.14.0.jar > -rw-r--r-- 1 rmannibucau rmannibucau 446K août 5 10:25 > beam-runners-core-construction-java-2.14.0.jar > -rw-r--r-- 1 rmannibucau rmannibucau 378K août 5 10:24 > beam-runners-core-java-2.14.0.jar{code} > Due to its embed nature (generally sent with the job) it should stay as > lightweight as possible. I see a few actions which can help to make back beam > integrable: > > # Make all the polyglotism layer optional and excludable, this is never > needed for several jobs and this additional weight is a clear regression on > the packaging side of beam, > # Vendoring and sdk dependencies are generally luxuray (who needs a library > to do a new ArrayList<>() in 2019 ;)) so most of the dependencies can be > dropped, vendoring can be made very lightweight - to not say optional for the > sdk java core > At the end a reasonable limit for a runner like spark - not the direct one > which reimplements all the logic by design - would be around 5M of deps IMHO. -- This message was sent by Atlassian Jira (v8.3.4#803005)