[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15968600#comment-15968600 ]
ASF GitHub Bot commented on TIKA-2306: -------------------------------------- thammegowda commented on a change in pull request #163: TIKA-2306: Update Inception v3 to Inception v4 in Object recognition parser URL: https://github.com/apache/tika/pull/163#discussion_r111525259 ########## File path: tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionRestDockerfile ########## @@ -16,26 +16,34 @@ # FROM ubuntu -MAINTAINER Thamme Gowda <thammego...@apache.org> +MAINTAINER Thamme Gowda <thammego...@apache.org> and Kranthi Kiran GV <kkran...@student.nitw.ac.in> -# install missing part of ubuntu core + python stuff +# Install missing part of ubuntu core + python stuff RUN apt-get update && \ - apt-get install -y python-pip python-dev wget + apt-get install -y python-pip python-dev wget libtcmalloc-minimal4 git # Install tensorflow and other dependencies RUN \ - pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.9.0-cp27-none-linux_x86_64.whl && \ + pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl && \ pip install flask requests +# Get the TF-slim dependencies +# Forked for future compatibility +RUN git clone https://github.com/KranthiGV/models + # TODO: Change the URL to Apache/Tika Repo when this PR gets merged RUN \ - wget https://raw.githubusercontent.com/thammegowda/tika/TIKA-1993/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py -O /usr/bin/inceptionapi.py && \ + wget https://raw.githubusercontent.com/KranthiGV/tika/TIKA-2306/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py -O /usr/bin/inceptionapi.py && \ Review comment: TODO: when we merge this PR, we modify it and then merge it ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > Update Inception v3 to Inception v4 in Object recognition parser > ----------------------------------------------------------------- > > Key: TIKA-2306 > URL: https://issues.apache.org/jira/browse/TIKA-2306 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 1.14 > Reporter: Kranthi Kiran GV > Priority: Minor > Labels: inception, object_recognition > Original Estimate: 72h > Remaining Estimate: 72h > > Object Recognition Parser currently uses Inception V3 model for the object > classification task. Google released a newer Inception V4 model [1][2]. > It has an improved Top -1 accuracy of 80.2 and Top-5 accuracy of 95.2 [3]. > I believe that Tika community would benefit from it. I would be working on > this issue in the next few days. > [1] https://research.googleblog.com/2016/08/improving-inception-and-image.html > [2] https://arxiv.org/abs/1602.07261 > [3] https://github.com/tensorflow/models/tree/master/slim -- This message was sent by Atlassian JIRA (v6.3.15#6346)